Clustering Molecules at a Large Scale: Integrating Spectral Geometry with Deep Learning

Autor: Ömer Akgüller, Mehmet Ali Balcı, Gabriela Cioca
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Molecules, Vol 29, Iss 16, p 3902 (2024)
Druh dokumentu: article
ISSN: 1420-3049
DOI: 10.3390/molecules29163902
Popis: This study conducts an in-depth analysis of clustering small molecules using spectral geometry and deep learning techniques. We applied a spectral geometric approach to convert molecular structures into triangulated meshes and used the Laplace–Beltrami operator to derive significant geometric features. By examining the eigenvectors of these operators, we captured the intrinsic geometric properties of the molecules, aiding their classification and clustering. The research utilized four deep learning methods: Deep Belief Network, Convolutional Autoencoder, Variational Autoencoder, and Adversarial Autoencoder, each paired with k-means clustering at different cluster sizes. Clustering quality was evaluated using the Calinski–Harabasz and Davies–Bouldin indices, Silhouette Score, and standard deviation. Nonparametric tests were used to assess the impact of topological descriptors on clustering outcomes. Our results show that the DBN + k-means combination is the most effective, particularly at lower cluster counts, demonstrating significant sensitivity to structural variations. This study highlights the potential of integrating spectral geometry with deep learning for precise and efficient molecular clustering.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje