GOW-LDA: Applying Term Co-occurrence Graph Representation in LDA Topic Models Improvement

Autor: Phu Pham, Phuc Do, Chien D.C. Ta
Rok vydání: 2018
Předmět:
Zdroj: Lecture Notes in Electrical Engineering ISBN: 9789811082757
DOI: 10.1007/978-981-10-8276-4_40
Popis: In this paper, we demonstrate a novel approach in topic model exploration by applying word co-occurrence graph or g raph-o f-w ords (GOW) in order to produce more informative extracted latent topics from a large document corpus. According to the L atent D irichlet A llocation (LDA) algorithm, it only considers the words occurrence independently via probabilistic distributions. It leads to the failure in term’s relationship recognition. Hence in order to overcome this disadvantage of traditional LDA, we propose a novel approach, called GOW-LDA. The GOW-LDA is proposed that combines the GOW graph used in document representation, the frequent subgraph extracting and distribution model of LDA. For evaluation, we compare our proposed model with the traditional one in different classification algorithms. The comparative evaluation is performed in this study by using the standardized datasets. The results generated by the experiments show that the proposed algorithm yields performance respectably.
Databáze: OpenAIRE