Neural Machine Translation between Vietnamese and English: an Empirical Study

Autor: Hoang Vu Dang, Van Nam Nguyen, Hong-Hai Phan-Vu, Phan Thuan Do, Viet-Trung Tran
Rok vydání: 2019
Předmět:
Zdroj: Journal of Computer Science and Cybernetics. 35:147-166
ISSN: 1813-9663
Popis: Machine translation is shifting to an end-to-end approach based on deep neural networks. The state of the art achieves impressive results for popular language pairs such as English - French or English - Chinese. However for English - Vietnamese the shortage of parallel corpora and expensive hyper-parameter search present practical challenges to neural-based approaches. This paper highlights our efforts on improving English-Vietnamese translations in two directions: (1) Building the largest open Vietnamese - English corpus to date, and (2) Extensive experiments with the latest neural models to achieve the highest BLEU scores. Our experiments provide practical examples of effectively employing different neural machine translation models with low-resource language pairs.
Databáze: OpenAIRE