Autor: |
Prabha, Sneh, Sardana, Neetu |
Zdroj: |
International Journal of Innovative Computing and Applications; 2024, Vol. 15 Issue: 1 p50-69, 20p |
Abstrakt: |
Community question and answer (Q&A) websites have become invaluable information and knowledge-sharing sources. Effective topic modelling on these platforms is crucial for organising and navigating the vast amount of user-generated content. To address these challenges, we propose a novel global-local term fusion with optimised community (GLOCOM) Q&A topic modelling approach that leverages both local and global term importance to enhance topic modelling on community Q&A websites. GLOCOM combines term frequency-inverse document frequency for local importance and entropy for global importance. Further, we employ fuzzy clustering to enhance the representation of multifaceted topics. Furthermore, clustering results are optimised using a genetic algorithm (GA) to refine cluster assignments and centroids. We compared the proposed model with baseline models LDA and FLSA. GLOCOM has performed consistently well for all topic numbers. It has shown an improvement of 8.86% in silhouette score as compared to LDA and excelled for datasets with size > 3 MB. |
Databáze: |
Supplemental Index |
Externí odkaz: |
|