A Classification Model to Analyze the Spread and Emerging Trends of the Zika Virus in Twitter

Autor: B. K. Tripathy, Rahul Chowdhury, Saurabh Singh Thakur
Rok vydání: 2017
Předmět:
Zdroj: Advances in Intelligent Systems and Computing ISBN: 9789811038730
Popis: The Zika disease is a 2015–16 virus epidemic and continues to be a global health issue. The recent trend in sharing critical information on social networks such as Twitter has been a motivation for us to propose a classification model that classifies tweets related to Zika and thus enables us to extract helpful insights into the community. In this paper, we try to explain the process of data collection from Twitter, the preprocessing of the data, building a model to fit the data, comparing the accuracy of support vector machines and Naive Bayes algorithm for text classification and state the reason for the superiority of support vector machine over Naive Bayes algorithm. Useful analytical tools such as word clouds are also presented in this research work to provide a more sophisticated method to retrieve community support from social networks such as Twitter.
Databáze: OpenAIRE