NEED4Tweet: A Twitterbot for Tweets Named Entity Extraction and Disambiguation

Autor: Mena Badieh Habib, Maurice van Keulen
Přispěvatelé: Databases (Former)
Jazyk: angličtina
Rok vydání: 2015
Předmět:
Zdroj: ISSUE=53;TITLE=53rd Annual Meeting of the Association for Computational Linguistics 2015
Proceedings of the System Demonstrations of The 53rd Annual Meeting of the Association for Computational Linguistics (ACL 2015), Beijing, China
ACL (System Demonstrations)
Popis: In this demo paper, we present NEED4Tweet, a Twitterbot for named entity extraction (NEE) and disambiguation (NED) for Tweets. The straightforward application of state-of-the-art extraction and disambiguation approaches on informal text widely used in Tweets, typically results in significantly degraded performance due to the lack of formal structure; the lack of sufficient context required; and the seldom entities involved. In this paper, we introduce a novel framework that copes with the introduced challenges. We rely on contextual and semantic features more than syntactic features which are less informative. We believe that disambiguation can help to improve the extraction process. This mimics the way humans understand language.
Databáze: OpenAIRE