A new big data approach for topic classification and sentiment analysis of Twitter data

Autor:	Niranjan N. Chiplunkar, Anisha P. Rodrigues
Rok vydání:	2019
Předmět:	Service (systems architecture) Naive bayesian classifier Computer science business.industry Cognitive Neuroscience InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Sentiment analysis Big data Hash function 020206 networking & telecommunications 02 engineering and technology computer.software_genre Lexicon Naive Bayes classifier ComputingMethodologies_PATTERNRECOGNITION Mathematics (miscellaneous) Artificial Intelligence 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition Artificial intelligence InformationSystems_MISCELLANEOUS business computer Natural language processing
Zdroj:	Evolutionary Intelligence. 15:877-887
ISSN:	1864-5917 1864-5909
DOI:	10.1007/s12065-019-00236-3
Popis:	Twitter is a major micro-blogging service, with millions of active users. These users use Twitter to post status messages called tweets and share their opinions using hash tags on various events. Hence, Twitter is considered a major real time streaming source and one of an effective and accurate indicator of opinions. The amount of data generated by Twitter is huge and it is difficult to scan entire data manually. This paper proposes a Hybrid Lexicon-Naive Bayesian Classifier (HL-NBC) method for sentimental analysis. In addition to that, Sentiment analysis engine is preceded by topic classification, which classifies tweets into different categories and filters irrelevant tweets. The proposed method is compared with Lexicon, Naive Bayesian classifier for uni-gram and bi-gram features. Out of the different approaches, the proposed HL-NBC method does sentiment classification in an improved way and gives accuracy of 82%, which is comparatively better than other methods. Also, the sentiment analysis is performed in a shorter time compared to traditional methods and achieves 93% improvement in processing time for larger datasets.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::7ee0f8bde00e795aae52dff1571f7d5e https://doi.org/10.1007/s12065-019-00236-3 Zobrazit plný text záznamu Full text from SpringerLink