Measures for topical cohesion of user communities on Twitter
Autor: | Laurent Vercouter, Khaled Khelif, Nicolas Malandain, Stephan Brunessaux, Alexandre Pauchet, Guillaume Gadek |
---|---|
Přispěvatelé: | Airbus [France], Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes (LITIS), Institut national des sciences appliquées Rouen Normandie (INSA Rouen Normandie), Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Université Le Havre Normandie (ULH), Normandie Université (NU), Equipe Multi-agent, Interaction, Décision (MIND - LITIS), Normandie Université (NU)-Institut national des sciences appliquées Rouen Normandie (INSA Rouen Normandie), Vercouter, Laurent, Université Le Havre Normandie (ULH), Normandie Université (NU)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA), Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA)-Université Le Havre Normandie (ULH) |
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
Engineering
Measure (data warehouse) Online social network Topical cohesion measure of communities Community detection business.industry Cohesion (computer science) 02 engineering and technology Domain (software engineering) World Wide Web Identification (information) [INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA] 020204 information systems Business intelligence 0202 electrical engineering electronic engineering information engineering Data analysis 020201 artificial intelligence & image processing [INFO.INFO-MA] Computer Science [cs]/Multiagent Systems [cs.MA] Baseline (configuration management) business Meaning (linguistics) |
Zdroj: | WI 2017-Proceedings of the International Conference on Web Intelligence WI 2017-Proceedings of the International Conference on Web Intelligence, Aug 2017, Leipzig, Germany. pp.211--218 WI |
Popis: | International audience; Nowadays, Online Social Networks (OSN) are commonly used by groups of users to communicate. Members of a family, colleagues, fans of a brand, political groups: the demand for a precise identification of these groups is increasing from brand monitoring, business intelligence and e-reputation management.However, a gap can be observed between the communities detected by many data analytics algorithms on OSN, and effective groups existing in real life: the detected communities often lack of meaning and internal semantic cohesion. Most of existing literature on OSN either focuses on the community detection problem in graphs without considering the topic of the messages exchanged, or concentrates exclusively on the messages without taking into account the social links.In this article, we support the hypothesis that communities extracted on OSN should be topically coherent. We therefore propose a model to represent the interaction between users on Twitter, the reference on micro-blogging OSN, and metrics to evaluate the topical cohesion of the detected communities. As an evaluation, we measure the topical cohesion of the groups of users detected by a baseline community detection algorithm, using two measures inspired from the classification domain, and one measure inspired from the NLP domain.A detailed analysis is performed on a big tweet dataset, from which a user graph is built. Introduced measures are compared with statistics to better picture the experiment, and yield interesting insights on a social and textual corpus. |
Databáze: | OpenAIRE |
Externí odkaz: |