Image Caption Generation using Vision Transformer and GPT Architecture

Full text for this resource is not available from the Research Repository.

Mishra, Swapneel, Seth, Saumya, Jain, Shrishti, Pant, Vasudev, Parikh, Jolly, Jain, Rachna and Islam, Sardar M. N ORCID: 0000-0001-9451-7390 (2024) Image Caption Generation using Vision Transformer and GPT Architecture. In: 2024 2nd International Conference on Advancement in Computation & Computer Technologies (InCACCT), 2 May 2024 - 3 May 2024.

Dimensions Badge

Altmetric Badge

Item type Conference or Workshop Item (Paper)
URI https://vuir.vu.edu.au/id/eprint/49084
DOI 10.1109/InCACCT61598.2024.10551257
Official URL https://ieeexplore.ieee.org/document/10551257
ISBN 9798350371321
Subjects Current > FOR (2020) Classification > 4602 Artificial intelligence
Current > FOR (2020) Classification > 4603 Computer vision and multimedia computation
Current > Division/Research > Institute for Sustainable Industries and Liveable Cities
Download/View statistics View download statistics for this item

Search Google Scholar

Repository staff login