An Automatic Participant Detection Framework for Event Tracking on Twitter

Autor: Nicholas Mamo, Joel Azzopardi, Colin Layfield
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Algorithms, Vol 14, Iss 3, p 92 (2021)
Druh dokumentu: article
ISSN: 1999-4893
DOI: 10.3390/a14030092
Popis: Topic Detection and Tracking (TDT) on Twitter emulates human identifying developments in events from a stream of tweets, but while event participants are important for humans to understand what happens during events, machines have no knowledge of them. Our evaluation on football matches and basketball games shows that identifying event participants from tweets is a difficult problem exacerbated by Twitter’s noise and bias. As a result, traditional Named Entity Recognition (NER) approaches struggle to identify participants from the pre-event Twitter stream. To overcome these challenges, we describe Automatic Participant Detection (APD) to detect an event’s participants before the event starts and improve the machine understanding of events. We propose a six-step framework to identify participants and present our implementation, which combines information from Twitter’s pre-event stream and Wikipedia. In spite of the difficulties associated with Twitter and NER in the challenging context of events, our approach manages to restrict noise and consistently detects the majority of the participants. By empowering machines with some of the knowledge that humans have about events, APD lays the foundation not just for improved TDT systems, but also for a future where machines can model and mine events for themselves.
Databáze: Directory of Open Access Journals