Towards the automatic merging of language resources

Autor: Necşulescu, Silvia, Bel Rafecas, Núria, Padró, Muntsa, Marimon, Montserrat, Revilla, Eva
Předmět:
Zdroj: Recercat. Dipósit de la Recerca de Catalunya
instname
Popis: Language Resources are a critical component for Natural Language Processing applications. Throughout the years many resources were manually created for the same task, but with different granularity and coverage information. To create richer resources for a broad range of potential reuses, nformation from all resources has to be joined into one. The hight cost of comparing and merging different resources by hand has been a bottleneck for merging existing resources. With the objective of reducing human intervention, we present a new method for automating merging resources. We have addressed the merging of two verbs subcategorization frame (SCF) lexica for Spanish. The results achieved, a new lexicon with enriched information and conflicting information signalled, reinforce our idea that this approach can be applied for other task of NLP.
Databáze: OpenAIRE