Análisis léxico sobre los tweets de Twitter

Autor: Bográn, Astrid-Paola, Alonso-Berrocal, José-Luis, G. Figuerola, Carlos
Přispěvatelé: Cruz-Benito, Juan, García-Holgado, Alicia, García-Sánchez, Sergio, Hernández-Alfageme, Daniel, Navarro-Cáceres, María, Vega-Ruiz, Roberto
Jazyk: Spanish; Castilian
Rok vydání: 2013
Předmět:
Zdroj: Bográn, Astrid-Paola and Alonso-Berrocal, José-Luis and G. Figuerola, Carlos Análisis léxico sobre los tweets de Twitter., 2013 . In Avances en Informática y Automática. Séptimo Workshop, Salamanca, 2013. [Conference paper]
Druh dokumentu: Conference paper
Popis: This paper provides an approach on Lexical analysis, focused on the tweets of Twitter. Shows the development of a web application that can connect to Twitter involving the handling of a classifier text on the web for discover the essential characteristics tweets selected, either individually or in mass, all running in real time or adding content to a database, that allow the user reprocess the tweets. The use of stemming and tokenization techniques help process the tweet cleaner and without noise. For the classification have been proposed the Naïve Bayes algorithm, and created several dictionaries in XML based on the areas of Science and Technology, as well as dictionaries that help identify empty words.
Databáze: E-LIS (Eprints in Library & Information Science)