Data mining on carcinogenicity of chemical compounds by the JSM method
Autor: | M. Vrachko, V. G. Blinova, M. Novich, N. Fedorova, N. V. Kharchevnikova, D. A. Dobrynin |
---|---|
Rok vydání: | 2009 |
Předmět: | |
Zdroj: | Automatic Documentation and Mathematical Linguistics. 43:330-335 |
ISSN: | 1934-8371 0005-1055 |
DOI: | 10.3103/s000510550906003x |
Popis: | Prediction of the carcinogenicity of chemical compounds to rats was carried out by data mining analysis based on the logic of John Stuart Mill (JSM) and the fragmentary code of the substructure superposition (FCSS) data presentation language. The learning (608 compounds) and test (156 compounds) samples were taken from the database on the carcinogenicity of substances for laboratory animals developed by the Environmental Protection Agency of the USA (EPA USA). Predictions were made for 44% of the test samples. The prediction accuracy was 71%, sensitivity 73%, and specificity 67%. The causes for erroneous positive and negative predictions were studied, accounting for bioactivation of compounds in the course of metabolic biotransformations. |
Databáze: | OpenAIRE |
Externí odkaz: |