Optimized Multifidelity Machine Learning for Quantum Chemistry

Autor: Vinod, Vivin, Kleinekathöfer, Ulrich, Zaspel, Peter
Rok vydání: 2023
Předmět:
Druh dokumentu: Working Paper
DOI: 10.1088/2632-2153/ad2cef
Popis: Machine learning (ML) provides access to fast and accurate quantum chemistry (QC) calculations for various properties of interest such as excitation energies. It is often the case that high accuracy in prediction using an ML model, demands a large and costly training set. Various solutions and procedures have been presented to reduce this cost. These include methods such as $\Delta$-ML, hierarchical-ML, and multifidelity machine learning (MFML). MFML combines various $\Delta$-ML like sub-models for various fidelities according to a fixed scheme derived from the sparse grid combination technique. In this work we implement an optimization procedure to combine multifidelity models in a flexible scheme resulting in optimized MFML (o-MFML) that provides superior prediction capabilities. This hyper-parameter optimization is carried out on a holdout validation set of the property of interest. This work benchmarks the o-MFML method in predicting the atomization energies on the QM7b dataset, and again in the prediction of excitation energies for three molecules of growing size. The results indicate that o-MFML is a strong methodological improvement over MFML and provides lower error of prediction. Even in cases of poor data distributions and lack of clear hierarchies among the fidelities, which were previously identified as issues for multifidelity methods, the o-MFML provides advantage to the prediction of quantum chemical properties.
Comment: SI not included
Databáze: arXiv