The construction of Chinese microblog gender-specific thesauruses and user gender classification

Autor: Zhiliang Zhu, Zejun Ke, Jiayin Cui, Hai Yu, Guoqi Liu
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Zdroj: Applied Network Science, Vol 3, Iss 1, Pp 1-17 (2018)
Druh dokumentu: article
ISSN: 2364-8228
DOI: 10.1007/s41109-018-0104-1
Popis: Abstract Based on the statistical features, short text messages published by different gender users are different in terms of the words and semantics used. In this paper, two new features are constructed after constructing a gender-specific thesaurus. A new classification model is constructed by combining the traditional statistical features and the improved text implicitness feature. The experimental evaluation performed on the Sina Weibo dataset demonstrated the effectiveness of gender-specific thesaurus-based features, and the improved text implicitness feature improved the accuracy of gender classification to 84.7%.
Databáze: Directory of Open Access Journals