A clustering-based sampling method for miRNA-disease association prediction

Autor:	Zheng Wei, Dengju Yao, Xiaojuan Zhan, Shuli Zhang
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	miRNA-disease association ensemble learning clustering sampling computational methods Genetics QH426-470
Zdroj:	Frontiers in Genetics, Vol 13 (2022)
Druh dokumentu:	article
ISSN:	1664-8021
DOI:	10.3389/fgene.2022.995535
Popis:	More and more studies have proved that microRNAs (miRNAs) play a critical role in gene expression regulation, and the irregular expression of miRNAs tends to be associated with a variety of complex human diseases. Because of the high cost and low efficiency of identifying disease-associated miRNAs through biological experiments, scholars have focused on predicting potential disease-associated miRNAs by computational methods. Considering that the existing methods are flawed in constructing negative sample set, we proposed a clustering-based sampling method for miRNA-disease association prediction (CSMDA). Firstly, we integrated multiple similarity information of miRNA and disease to represent miRNA-disease pairs. Secondly, we performed a clustering-based sampling method to avoid introducing potential positive samples when constructing negative sample set. Thirdly, we employed a random forest-based feature selection method to reduce noise and redundant information in the high-dimensional feature space. Finally, we implemented an ensemble learning framework for predicting miRNA-disease associations by soft voting. The Precision, Recall, F1-score, AUROC and AUPR of the CSMDA achieved 0.9676, 0.9545, 0.9610, 0.9928, and 0.9940, respectively, under five-fold cross-validation. Besides, case study on three cancers showed that the top 20 potentially associated miRNAs predicted by the CSMDA were confirmed by the dbDEMC database or literatures. The above results demonstrate that the CSMDA can predict potential disease-associated miRNAs more accurately.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/601b5295a6774e17ba83883ef49d74ba Zobrazit plný text záznamu View record in DOAJ