Expert Opinion and Coherence Based Topic Modeling
Autor: | Natchanon Suaysom, Weiqing Gu |
---|---|
Rok vydání: | 2018 |
Předmět: |
Structure (mathematical logic)
Topic model Computer science Coherence (statistics) computer.software_genre Data structure Directed acyclic graph Latent Dirichlet allocation Non-negative matrix factorization Tree (data structure) symbols.namesake Expert opinion Metric (mathematics) symbols Data mining Control (linguistics) computer |
Zdroj: | SSRN Electronic Journal. |
ISSN: | 1556-5068 |
DOI: | 10.2139/ssrn.3414903 |
Popis: | In this paper, we propose a novel algorithm that rearrange the topic assignment results obtained from topic modeling algorithms, including NMF and LDA. The effectiveness of the algorithm is measured by how much the results conform to expert opinion, which is a data structure called TDAG that we defined to represent the probability that a pair of highly correlated words appear together. In order to make sure that the internal structure does not get changed too much from the rearrangement, coherence, which is a well known metric for measuring the effectiveness of topic modeling, is used to control the balance of the internal structure. We developed two ways to systematically obtain the expert opinion from data, depending on whether the data has relevant expert writing or not. The final algorithm which takes into account both coherence and expert opinion is presented. Finally we compare amount of adjustments needed to be done for each topic modeling method, NMF and LDA. |
Databáze: | OpenAIRE |
Externí odkaz: |