Techniques for Sentiment Analysis and Topic Detection of Spanish Tweets: Preliminary Report

Autor: Fernández Anta, Antonio|||0000-0001-6501-2377, Morere, Philippe, Chiroque, Luis F., Santos, Agustín
Jazyk: angličtina
Rok vydání: 2012
Předmět:
Zdroj: IMDEA Networks Institute Digital Repository
instname
IMDEA Networks Institute
Popis: SEPLN 2012 – XXVIII Congreso de la Sociedad Española para el Procesamiento del Lenguaje Natural), co-located with the Spanish Informatics Conference (CEDI2012 – III Congreso Español de Informática) Sentiment analysis and topic detection are new problems that are at the intersection of natural language processing(NLP) and data mining. Sentiment analysis attempts to determine if a text is positive, negative, or neither, while topic detection attempts to identify the subject of the text. A significant amount of effort has been invested in constructing effective solutions for these problems, mostly for English texts. Using a corpus of Spanish tweets,we present a comparative analysis of different approaches and classification techniques for these problems.The data is preprocessed using techniques and tools proposed in the literature,together with others specifically proposed here that take into account the characteristics of Twitter.Then,popular classifiers have been used.(In particular,all classifiers of WEKA have been evaluated.)Due to its high number not all the results obtained will be presented here. TRUE pub
Databáze: OpenAIRE