Word2Vec vs DBnary ou comment (ré)concilier représentations distribuées et réseaux lexico-sémantiques ? Le cas de l’évaluation en traduction automatique

Autor:	Servan, Christophe, Elloumi, Zied, Blanchon, Hervé, Besacier, Laurent
Přispěvatelé:	Groupe d’Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole (GETALP ), Laboratoire d'Informatique de Grenoble (LIG ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), Institut Universitaire de France (IUF), Ministère de l'Education nationale, de l’Enseignement supérieur et de la Recherche (M.E.N.E.S.R.), ANR-14-CE24-0016,KEHATH,Méthodes qualité avancées pour la post-édition de traduction automatique(2014)
Jazyk:	francouzština
Rok vydání:	2016
Předmět:	word embeddings lexical- semantic resources quality estimation évaluation Traduction automatique literary work représentations distribuées de mots [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] evaluation metrics word similarity ressources lexico-sémantiques similarité entre mots Machine translation post-edition
Zdroj:	Actes de la conférence conjointe JEP-TALN-RECITAL TALN 2016 TALN 2016, Jul 2016, Paris, France
Popis:	International audience; This paper presents an approach combining lexical-semantic resources and distributed representations of words applied to the evaluation in machine translation (MT). This study is made through the enrichment of a well-known MT evaluation metric : METEOR. METEOR enables an approximate match (synonymy or morphological similarity) between an automatic and a reference translation. Our experiments are made in the framework of the Metrics task of WMT 2014. We show that distributed representations are less efficient than lexical-semantic resources for MT evaluation but they can nonetheless bring interesting additional information.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::91065a316430ee4d054b990e5a321932 https://hal.archives-ouvertes.fr/hal-01350101 Zobrazit plný text záznamu