Adaptive Markov Clustering Algorithm for Optimal Event Detection From Heterogeneous News Documents

Autor: Wafa Zubair Al-Dyani
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: IEEE Access, Vol 12, Pp 72079-72097 (2024)
Druh dokumentu: article
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2024.3403506
Popis: Several Event Detection (ED) applications utilize various wrapper Feature Selection (FS) techniques based on wrapping the Markov Clustering Algorithm (MCL) with the Binary Bat Algorithm (BBA) or Adaptive Bat Algorithm (ABBA). These approaches have shown promising results in identifying relevant feature subset for MCL, leading to more precise event cluster from heterogeneous news articles. However, such wrapped FS methods involve coupling two methods (FS and ED) within ED model, with their performance influencing each other. While ABBA improved upon BBA’s limitations, MCL’s rapid convergence can hinder detection effectiveness. This fast convergence can lead to local optima and the detection of meaningless clusters. Additionally, MCL’s identification ability diminishes as the feature space grows. To address these issues, this paper develops two novel adaptive techniques to control MCL’s inflation (inf) and pruning (p) parameters, thereby managing its convergence behavior. Consequently, a new variant called Adaptive MCL (AMCL) is introduced and combined with ABBA. The effectiveness of the ABBA-AMCL method is evaluated using 10 benchmark datasets and two substantial Facebook news datasets. Various performance measures are employed to compare ABBA-AMCL against established methods. The empirical results demonstrate that ABBA-AMCL excels at extracting high-quality, real-world event clusters from various news text sources.
Databáze: Directory of Open Access Journals