An Interactive Method for Inferring Demographic Attributes in Twitter
Autor: | Timothy Cribbin, Enza Messina, Daniele Maccagnola, Valentina Beretta |
---|---|
Přispěvatelé: | Beretta, V, Maccagnola, D, Cribbin, T, Messina, V |
Rok vydání: | 2015 |
Předmět: |
Semi-automatic classification
Information retrieval Computer science Process (engineering) Interface (Java) Sampling (statistics) Resolution (logic) Data science Demographic attribution Range (mathematics) Sample size determination Key (cryptography) Twitter analytics User interface demographic attributes sampling semi-automatic classification social research sociology text analysis user interface |
Zdroj: | HT 26th ACM Conference on Hypertext and Social Media (Hypertext 2015) |
DOI: | 10.1145/2700171.2791031 |
Popis: | Twitter data offers an unprecedented opportunity to study demographic differences in public opinion across a virtually unlimited range of subjects. Whilst demographic attributes are often implied within user data, they are not always easily identified using computational methods. In this paper, we present a semi-automatic solution that combines automatic classification methods with a user interface designed to enable rapid resolution of ambiguous cases. TweetClass employs a two-step, interactive process to support the determination of gender and age attributes. At each step, the user is presented with feedback on the confidence levels of the automated analysis and can choose to refine ambiguous cases by examining key profile and content data. We describe how a user-centered design approach was used to optimise the interface and present the results of an evaluation which suggests that TweetClass can be used to rapidly boost demographic sample sizes in situations where high accuracy is required. |
Databáze: | OpenAIRE |
Externí odkaz: |