Comparing Molecular Patterns Using the Example of SMARTS: Theory and Algorithms
Autor: | Robert Schmidt, Emanuel S. R. Ehmki, Andriy Mashychev, Matthias Rarey, Hans-Christian Ehrlich, Farina Ohm |
---|---|
Rok vydání: | 2019 |
Předmět: |
Similarity (geometry)
010304 chemical physics business.industry Computer science Cheminformatics General Chemical Engineering General Chemistry Filter (signal processing) Library and Information Sciences 01 natural sciences Pattern Recognition Automated 0104 chemical sciences Computer Science Applications Small Molecule Libraries 010404 medicinal & biomolecular chemistry Software Analytics 0103 physical sciences Pattern recognition (psychology) business Representation (mathematics) Algorithm Algorithms |
Zdroj: | Journal of Chemical Information and Modeling. 59:2560-2571 |
ISSN: | 1549-960X 1549-9596 |
DOI: | 10.1021/acs.jcim.9b00250 |
Popis: | Molecular patterns are widely used for compound filtering in molecular design endeavors. They describe structural properties that are connected with unwanted physical or chemical properties like reactivity or toxicity. With filter sets comprising hundreds of structural filters, an analytic approach to compare those patterns is needed. Here we present a novel approach to solve the generic pattern comparison problem. We introduce chemically inspired fingerprints for pattern nodes and edges to derive an easy-to-compare pattern representation. On two annotated pattern graphs we apply a maximum common subgraph algorithm enabling the calculation of pattern inclusion and similarity. The resulting algorithm can be used in many different ways. We can automatically derive pattern hierarchies or search in large pattern collections for more general or more specific patterns. To the best of our knowledge, the presented algorithm is the first of its kind enabling these types of chemical pattern analytics. Our new tool named SMARTScompare is an implementation of the approach for the SMARTS language, which is the quasi-standard for structural filters. We demonstrate the capabilities of SMARTScompare on a large collection of SMARTS patterns from real applications. |
Databáze: | OpenAIRE |
Externí odkaz: |