An adaptable lexical simplification architecture for major ibero-romance languages

Autor: Horacio Saggion, Xavier Gómez Guinovart, Daniel Ferrés
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Recercat. Dipósit de la Recerca de Catalunya
instname
Popis: Comunicació presentada a: EMNLP 2017, Workshop Building Linguistically Generalizable NLP Systems; celebrat el 8 de setembre de 2017 a Copenhagen, Dinamarca. Lexical Simplification is the task of reducing the lexical complexity of textual documents by replacing difficult words with easier to read (or understand) expressions while preserving the original meaning. The development of robust pipelined multilingual architectures able to adapt to new languages is of paramount importance in lexical simplification. This paper describes and evaluates a modular hybrid linguistic-statistical Lexical Simplifier that deals with the four major Ibero-Romance Languages: Spanish, Portuguese, Catalan, and Galician. The architecture of the system is the same for the four languages addressed, only the language resources used during simplification are language specific. This work is (partly) supported by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502) and by the TUNER project (TIN2015-65308-C5-5-R and TIN2015- 65308-C5-1-R, MINECO/FEDER, UE).
Databáze: OpenAIRE