Zobrazeno 1 - 2
of 2
pro vyhledávání: '"José Carlos Rosales Núñez"'
Publikováno v:
Proceedings of the Seventh W-NUT workshop (colocated with EMNLP 2021)
W-NUT 2021-Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021)
W-NUT 2021-Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021), association for computational linguistics, Nov 2021, Punta Cana, Dominican Republic
W-NUT 2021-Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021)
W-NUT 2021-Seventh Workshop on Noisy User-generated Text (colocated with EMNLP 2021), association for computational linguistics, Nov 2021, Punta Cana, Dominican Republic
International audience; This work takes a critical look at the evaluation of user-generated content automatic translation, the well-known specificities of which raise many challenges for MT. Our analyses show that measuring the average-case performan
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b3e286058ac3cfc7ca11ffdbe1fbd807
http://arxiv.org/abs/2110.12551
http://arxiv.org/abs/2110.12551
Publikováno v:
W-NUT@EMNLP
We present an approach to correct noisy User Generated Content (UGC) in French aiming to produce a pretreatement pipeline to improve Machine Translation for this kind of non-canonical corpora. In order to do so, we have implemented a character-based