Text Corpora, Local Grammars and Prediction
Autor: | Traboulsi, Hayssam, Cheng, David, Ahmad, Khurshid |
---|---|
Přispěvatelé: | Anderson Cancer Center, The University of Texas Health Science Center at Houston (UTHealth), Ligm, Lingu |
Jazyk: | angličtina |
Rok vydání: | 2004 |
Předmět: |
local grammar
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing [SHS] Humanities and Social Sciences [SHS.LANGUE]Humanities and Social Sciences/Linguistics computational lexicon [SHS.LANGUE] Humanities and Social Sciences/Linguistics [SHS]Humanities and Social Sciences |
Zdroj: | 4th International Conference on Language Resources and Evaluation (LREC'04) 4th International Conference on Language Resources and Evaluation (LREC'04), 2004, Lisbonne, Portugal. pp.749--752 |
Popis: | International audience; We present a corpus-based method for identifying and learning patterns describing events in a specific domain by examining the manner in which: (a) a small number of keywords in the domain are distributed throughout the corpus; and, (b) a local grammar that is idiosyncratic of a class of events in the domain, governs the usage of the keywords. We tested our method against a corpus of 3.63 million words. The results show promise. More importantly, the method can be applied to any arbitrary domains. |
Databáze: | OpenAIRE |
Externí odkaz: |