Generation and Evaluation of Hindi Image Captions of Visual Genome

Autor: Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay, Alok Singh
Rok vydání: 2021
Předmět:
Zdroj: Proceedings of the International Conference on Computing and Communication Systems ISBN: 9789813340831
DOI: 10.1007/978-981-33-4084-8_7
Popis: The automatic image caption generation with proper fluency and expressiveness is an emerging area of research. A lot of research has been done on image caption generation for English, but very few work has been done in the area of generating and evaluating captions in Hindi. In this paper, the problem of generation and evaluation of captions in Hindi is addressed by using a framework based on convolutional neural network (CNN) and long short-term memory (LSTM). This model maximizes the likelihood of the target caption for an input image. The framework is experimented over Hindi Visual Genome dataset. Human evaluation and pre-defined automatic evaluation metrics are used for the evaluation of generated output. The experimental results of the framework manifest that the model is generating reasonably impressive Hindi captions.
Databáze: OpenAIRE