WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset
Autor: | Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay |
---|---|
Rok vydání: | 2019 |
Předmět: |
Hindi
Text corpus business.industry Computer science 02 engineering and technology computer.software_genre Translation (geometry) language.human_language Task (project management) 030507 speech-language pathology & audiology 03 medical and health sciences ComputingMethodologies_DOCUMENTANDTEXTPROCESSING 0202 electrical engineering electronic engineering information engineering language 020201 artificial intelligence & image processing Artificial intelligence 0305 other medical science business computer Natural language processing |
Zdroj: | WAT@EMNLP-IJCNLP |
DOI: | 10.18653/v1/d19-5224 |
Popis: | A multimodal translation is a task of translating a source language to a target language with the help of a parallel text corpus paired with images that represent the contextual details of the text. In this paper, we carried out an extensive comparison to evaluate the benefits of using a multimodal approach on translating text in English to a low resource language, Hindi as a part of WAT2019 shared task. We carried out the translation of English to Hindi in three separate tasks with both the evaluation and challenge dataset. First, by using only the parallel text corpora, then through an image caption generation approach and, finally with the multimodal approach. Our experiment shows a significant improvement in the result with the multimodal approach than the other approach. |
Databáze: | OpenAIRE |
Externí odkaz: |