Análisis de la riqueza léxica en el contexto de la clasificación de atributos demográficos latentes
Autor: | Roberto Rodríguez, John Alexander, Martí Antonin, M. Antònia, Salamó Llorente, Maria |
---|---|
Přispěvatelé: | Universitat de Barcelona |
Předmět: | |
Zdroj: | Recercat. Dipósit de la Recerca de Catalunya instname Dipòsit Digital de la UB Universidad de Barcelona |
Popis: | In this paper we analyse the utility of lexical richness measures for predicting latent user attributes from Spanish opinionated texts. Our aim is to know how useful could be lexical richness to predict user's gender, age and regional origin. To this end, we applied 32 lexical richness measures over 1911 previously labeled texts with demographic information. This approach has the advantage that it is domain-independent with modest computational cost. |
Databáze: | OpenAIRE |
Externí odkaz: |