An ensemble-based drug–target interaction prediction approach using multiple feature information with data balancing

Autor: Heba El-Behery, Abdel-Fattah Attia, Nawal El-Fishawy, Hanaa Torkey
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Journal of Biological Engineering, Vol 16, Iss 1, Pp 1-14 (2022)
Druh dokumentu: article
ISSN: 1754-1611
DOI: 10.1186/s13036-022-00296-7
Popis: Abstract Background Recently, drug repositioning has received considerable attention for its advantage to pharmaceutical industries in drug development. Artificial intelligence techniques have greatly enhanced drug reproduction by discovering therapeutic drug profiles, side effects, and new target proteins. However, as the number of drugs increases, their targets and enormous interactions produce imbalanced data that might not be preferable as an input to a prediction model immediately. Methods This paper proposes a novel scheme for predicting drug–target interactions (DTIs) based on drug chemical structures and protein sequences. The drug Morgan fingerprint, drug constitutional descriptors, protein amino acid composition, and protein dipeptide composition were employed to extract the drugs and protein’s characteristics. Then, the proposed approach for extracting negative samples using a support vector machine one-class classifier was developed to tackle the imbalanced data problem feature sets from the drug–target dataset. Negative and positive samplings were constructed and fed into different prediction algorithms to identify DTIs. A 10-fold CV validation test procedure was applied to assess the predictability of the proposed method, in addition to the study of the effectiveness of the chemical and physical features in the evaluation and discovery of the drug–target interactions. Results Our experimental model outperformed existing techniques concerning the curve for receiver operating characteristic (AUC), accuracy, precision, recall F-score, mean square error, and MCC. The results obtained by the AdaBoost classifier enhanced prediction accuracy by 2.74%, precision by 1.98%, AUC by 1.14%, F-score by 3.53%, and MCC by 4.54% over existing methods.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje