Image Caption Generation using Vision Transformer and GPT Architecture
Download
Full text for this resource is not available from the Research Repository.
Export
Mishra, Swapneel, Seth, Saumya, Jain, Shrishti, Pant, Vasudev, Parikh, Jolly, Jain, Rachna and Islam, Sardar M. N ORCID: 0000-0001-9451-7390
(2024)
Image Caption Generation using Vision Transformer and GPT Architecture.
In: 2024 2nd International Conference on Advancement in Computation & Computer Technologies (InCACCT), 2 May 2024 - 3 May 2024.
Dimensions Badge
Altmetric Badge
Item type | Conference or Workshop Item (Paper) |
URI | https://vuir.vu.edu.au/id/eprint/49084 |
DOI | 10.1109/InCACCT61598.2024.10551257 |
Official URL | https://ieeexplore.ieee.org/document/10551257 |
ISBN | 9798350371321 |
Subjects | Current > FOR (2020) Classification > 4602 Artificial intelligence Current > FOR (2020) Classification > 4603 Computer vision and multimedia computation Current > Division/Research > Institute for Sustainable Industries and Liveable Cities |
Download/View statistics | View download statistics for this item |
CORE (COnnecting REpositories)