The LIMA multilingual analyzer made free: FLOSS resources adaptation and correction

Autor: De Chalendar, G.
Přispěvatelé: Département Intelligence Ambiante et Systèmes Interactifs (DIASI), Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay, Calzolari N., Choukri K., Goggi S., Declerck T., Mariani J., Maegaard B., Moreno A., Odijk J., Mazo H., Piperidis S., Loftsson H., Laboratoire d'Intégration des Systèmes et des Technologies (LIST), Le Roy, Léna, Calzolari N., Choukri K., Goggi S., Declerck T., Mariani J., Maegaard B., Moreno A., Odijk J., Mazo H., Piperidis S., Loftsson H.
Jazyk: angličtina
Rok vydání: 2014
Předmět:
Zdroj: Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, May 2014, Reykjavik, Iceland. pp.2932-2937
Popis: Conference of 9th International Conference on Language Resources and Evaluation, LREC 2014 ; Conference Date: 26 May 2014 Through 31 May 2014; Conference Code:131726; International audience; At CEA LIST, we have decided to release our multilingual analyzer LIMA as Free software. As we were not proprietary of all the language resources used we had to select and adapt free ones in order to attain results good enough and equivalent to those obtained with our previous ones. For English and French, we found and adapted a full-form dictionary and an annotated corpus for learning part-of-speech tagging models.
Databáze: OpenAIRE