The TALP-UPC system description for WMT20 news translation task: multilingual adaptation for low resource MT

Autor: Escolano Peinado, Carlos, Ruiz Costa-Jussà, Marta, Rodríguez Fonollosa, José Adrián
Přispěvatelé: Universitat Politècnica de Catalunya. Doctorat en Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Popis: In this article, we describe the TALP-UPC participation in the WMT20 news translation shared task for Tamil-English. Given the low amount of parallel training data, we resort to adapt the task to a multilingual system to benefit from the positive transfer from high resource languages. We use iterative backtranslation to fine-tune the system and benefit from the monolingual data available. In order to measure the effectivity of such methods, we compare our results to a bilingual baseline system. This work is supported in part by the Google Faculty Research Award 2019, the Spanish Ministerio de Ciencia e Innovacion, through the postdoctoral senior grant Ramon y Cajal and by the Agencia Estatal de Investigacion through the projects EUR2019-103819, PCIN-2017-079 and PID2019- 107579RB-I00 / AEI / 10.13039/501100011033
Databáze: OpenAIRE