Identifying non-referentialit

Autor: Adriane Boyd, Donna Byron, Whitney Gegg-Harrison
Rok vydání: 2005
Předmět:
Zdroj: Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing - FeatureEng '05.
DOI: 10.3115/1610230.1610238
Popis: In this paper, we present a machine learning system for identifying non-referential it. Types of non-referential it are examined to determine relevant linguistic patterns. The patterns are incorporated as features in a machine learning system which performs a binary classification of it as referential or non-referential in a POS-tagged corpus. The selection of relevant, generalized patterns leads to a significant improvement in performance.
Databáze: OpenAIRE