Entity perception of Two-Step-Matching framework for public opinions

Autor: Ren-De Li, Hao-Tian Ma, Zi-Yi Wang, Qiang Guo, Jian-Guo Liu
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: Journal of Safety Science and Resilience, Vol 1, Iss 1, Pp 36-43 (2020)
Druh dokumentu: article
ISSN: 2666-4496
DOI: 10.1016/j.jnlssr.2020.06.005
Popis: Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.
Databáze: Directory of Open Access Journals