Quantitative Comparison of Machine Learning Clustering Methods for Tuberculosis Data Analysis
Autor: | Marlen Kossakov, Assel Mukasheva, Gani Balbayev, Syrym Seidazimov, Dinargul Mukammejanova, Madina Sydybayeva |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2024 |
Předmět: | |
Zdroj: | Engineering Proceedings, Vol 60, Iss 1, p 20 (2024) |
Druh dokumentu: | article |
ISSN: | 20240600 2673-4591 |
DOI: | 10.3390/engproc2024060020 |
Popis: | In many fields, data-driven decision making has become essential due to machine learning (ML), which provides insights that improve productivity and quality of life. A basic machine learning approach called clustering helps find comparable data points. Clustering plays a critical role in the identification of patient subgroups and the customisation of treatment in the context of tuberculosis (TB) research. While prior studies have recognized its utility, a comprehensive comparative analysis of multiple clustering methods applied to TB data is lacking. Using TB data, this study thoroughly assesses and contrasts four well-known machine learning clustering algorithms: spectral clustering, DBSCAN, hierarchical clustering, and k-means. To evaluate the quality of a cluster, quantitative measures such as the silhouette score, Davies–Bouldin index, and Calinski–Harabasz index are utilised. The results provide quantitative insights that enhance comprehension of clustering and guide future research. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |