An adaptable lexical simplification architecture for major ibero-romance languages
Autor: | Horacio Saggion, Xavier Gómez Guinovart, Daniel Ferrés |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
Lexical simplification
Computer science business.industry Complex Word Detection Romance languages Modular design Morphological Generation computer.software_genre language.human_language Linguistics Ibero-Romance Languages Task (project management) Original meaning Natural Language Generation language Lexical Simplification Catalan Artificial intelligence Portuguese Architecture business Word Sense Disambiguation computer Natural language processing |
Zdroj: | Recercat. Dipósit de la Recerca de Catalunya instname |
Popis: | Comunicació presentada a: EMNLP 2017, Workshop Building Linguistically Generalizable NLP Systems; celebrat el 8 de setembre de 2017 a Copenhagen, Dinamarca. Lexical Simplification is the task of reducing the lexical complexity of textual documents by replacing difficult words with easier to read (or understand) expressions while preserving the original meaning. The development of robust pipelined multilingual architectures able to adapt to new languages is of paramount importance in lexical simplification. This paper describes and evaluates a modular hybrid linguistic-statistical Lexical Simplifier that deals with the four major Ibero-Romance Languages: Spanish, Portuguese, Catalan, and Galician. The architecture of the system is the same for the four languages addressed, only the language resources used during simplification are language specific. This work is (partly) supported by the Spanish Ministry of Economy and Competitiveness under the Maria de Maeztu Units of Excellence Programme (MDM-2015-0502) and by the TUNER project (TIN2015-65308-C5-5-R and TIN2015- 65308-C5-1-R, MINECO/FEDER, UE). |
Databáze: | OpenAIRE |
Externí odkaz: |