CheS-Mapper 2.0 for visual validation of (Q)SAR models
Autor: | Martin Gütlein, Stefan Kramer, Andreas Karwath |
---|---|
Rok vydání: | 2014 |
Předmět: |
Visualization methods
Computer science Feature vector Library and Information Sciences computer.software_genre 01 natural sciences (Q)SAR Model validation 03 medical and health sciences Software Validation Overall performance Physical and Theoretical Chemistry Visualization 030304 developmental biology 0303 health sciences business.industry Statistical validation Computer Graphics and Computer-Aided Design 0104 chemical sciences Computer Science Applications 010404 medicinal & biomolecular chemistry 3d space 3D space Data mining business computer |
Zdroj: | Journal of Cheminformatics |
ISSN: | 1758-2946 |
Popis: | Background Sound statistical validation is important to evaluate and compare the overall performance of (Q)SAR models. However, classical validation does not support the user in better understanding the properties of the model or the underlying data. Even though, a number of visualization tools for analyzing (Q)SAR information in small molecule datasets exist, integrated visualization methods that allow the investigation of model validation results are still lacking. Results We propose visual validation, as an approach for the graphical inspection of (Q)SAR model validation results. The approach applies the 3D viewer CheS-Mapper, an open-source application for the exploration of small molecules in virtual 3D space. The present work describes the new functionalities in CheS-Mapper 2.0, that facilitate the analysis of (Q)SAR information and allows the visual validation of (Q)SAR models. The tool enables the comparison of model predictions to the actual activity in feature space. The approach is generic: It is model-independent and can handle physico-chemical and structural input features as well as quantitative and qualitative endpoints. Conclusions Visual validation with CheS-Mapper enables analyzing (Q)SAR information in the data and indicates how this information is employed by the (Q)SAR model. It reveals, if the endpoint is modeled too specific or too generic and highlights common properties of misclassified compounds. Moreover, the researcher can use CheS-Mapper to inspect how the (Q)SAR model predicts activity cliffs. The CheS-Mapper software is freely available at http://ches-mapper.org. Graphical abstract Comparing actual and predicted activity values with CheS-Mapper. |
Databáze: | OpenAIRE |
Externí odkaz: |