Text Corpora, Local Grammars and Prediction

Autor: Traboulsi, Hayssam, Cheng, David, Ahmad, Khurshid
Přispěvatelé: Anderson Cancer Center, The University of Texas Health Science Center at Houston (UTHealth), Ligm, Lingu
Jazyk: angličtina
Rok vydání: 2004
Předmět:
Zdroj: 4th International Conference on Language Resources and Evaluation (LREC'04)
4th International Conference on Language Resources and Evaluation (LREC'04), 2004, Lisbonne, Portugal. pp.749--752
Popis: International audience; We present a corpus-based method for identifying and learning patterns describing events in a specific domain by examining the manner in which: (a) a small number of keywords in the domain are distributed throughout the corpus; and, (b) a local grammar that is idiosyncratic of a class of events in the domain, governs the usage of the keywords. We tested our method against a corpus of 3.63 million words. The results show promise. More importantly, the method can be applied to any arbitrary domains.
Databáze: OpenAIRE