A Comparison of Transformer, Recurrent Neural Networks and SMT in Tamil to Sinhala MT

Autor:	Ashmari Pramodya, Randil Pushpananda, Ruvan Weerasinghe
Rok vydání:	2020
Předmět:	Network architecture Machine translation business.industry Computer science Context (language use) 010501 environmental sciences Semantics computer.software_genre 01 natural sciences language.human_language 030507 speech-language pathology & audiology 03 medical and health sciences Recurrent neural network Tamil language Artificial intelligence State (computer science) 0305 other medical science business computer Natural language processing 0105 earth and related environmental sciences Transformer (machine learning model)
Zdroj:	2020 20th International Conference on Advances in ICT for Emerging Regions (ICTer).
DOI:	10.1109/icter51097.2020.9325431
Popis:	Neural Machine Translation (NMT) is currently the most promising approach for machine translation. The attention mechanism is a successful technique in modern Natural Language Processing (NLP), especially in tasks like machine translation. The recently proposed network architecture of the Transformer is based entirely on attention mechanisms and achieves a new state of the art results in neural machine translation, outperforming other sequence-to-sequence models. Although it is successful in a resource-rich setting, its applicability for low-resource language pairs is still debatable. Additionally when the language pair is morphologically rich and also when the corpora is multi-domain, the lack of a large parallel corpus becomes a significant barrier. In this study, we explore different NMT algorithms – Long Short Term Memory (LSTM) and Transformer based NMT, to translate the Tamil to Sinhala language pair. Where we clearly see transformer outperforms LSTM by 2.43 BLEU score for Tamil to Sinhala direction. And this work provides a preliminary comparison of statistical machine translation (SMT) and Neural Machine Translation (NMT) for Tamil to Sinhala in the open domain context.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::d4d472528e830f7a2166cdf1dfdb0026 https://doi.org/10.1109/icter51097.2020.9325431 Zobrazit plný text záznamu