Restricted-Variance Molecular Geometry Optimization Based on Gradient-Enhanced Kriging

Autor:	Christian L. Ritterhoff, Gerardo Raggi, Ignacio Fernández Galván, Roland Lindh, Morgane Vacher
Přispěvatelé:	Uppsala Universitet [Uppsala], Chimie Et Interdisciplinarité : Synthèse, Analyse, Modélisation (CEISAM), Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST), Université de Nantes (UN)-Université de Nantes (UN)-Centre National de la Recherche Scientifique (CNRS)-Institut de Chimie du CNRS (INC)
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Hessian matrix 010304 chemical physics Computer science Beräkningsmatematik Computational mathematics Variance (accounting) Function (mathematics) Energy minimization 01 natural sciences Stationary point Article Computer Science Applications symbols.namesake Computational Mathematics Molecular geometry Data point Surrogate model Kriging 0103 physical sciences symbols [CHIM]Chemical Sciences Point (geometry) Physical and Theoretical Chemistry Algorithm Mathematics
Zdroj:	Journal of Chemical Theory and Computation Journal of Chemical Theory and Computation, American Chemical Society, 2020, 16 (6), pp.3989-4001. ⟨10.1021/acs.jctc.0c00257⟩
ISSN:	1549-9618 1549-9626
DOI:	10.1021/acs.jctc.0c00257⟩
Popis:	Machine learning techniques, specifically Gradient-Enhanced Kriging (GEK), has been implemented for molecular geometry optimization.GEK has many advantages as compared to conventional -- step-restricted second-order truncated -- molecular optimization methods.In particular, the surrogate model associated with GEK can have multiple stationary points, will smoothly converge to theexact model as the size of the data set increases, and contains an explicit expression for the expected average error of the model functionat an arbitrary point in space.In this respect GEK can be of interest for methods used in molecular geometry optimizations.GEK is usually, however, associated with abundance of data, contrary to the situation desired forefficient geometry optimizations.In the paper we will demonstrate how the GEK procedure can be utilized in a fashion such that in the presence of few data points, thesurrogate surface will in a robust way guide the optimization to a minimum of a molecular structure.In this respect the GEK procedure will be used to mimic the behavior of a conventional second-order scheme, but retaining theflexibility of the superior machine learning approach -- GEK is an exact interpolator.Moreover, the expected variance will be used in the optimization to facilitate restricted-variance rational function optimizations (RV-RFO).A procedure which relates the eigenvalues of the Hessian-model-function Hessian with the individual characteristiclengths, used in the GEK, reduces the number of empirical parameters to two -- the value of the trend function and themaximum allowed variance. These parameters are determined using the extended Baker (e-Baker) test suite, at the Hartree-Fock level of approximation,and a single reaction of the Baker transition-state (Baker-TS) test suite as a training set. The so-created optimizationprocedure -- RV-RFO-GEK -- is tested using the e-Baker, the full Baker-TS, and the S22 test suites, at the density-functional-theory level for the two Baker test suitesand at the second order Møller-Plesset level of approximation for the S22 test suite, respectively.The tests show that the new method is generally on par with a state-of-the-art conventional method, while for difficult cases it exhibits a definite advantage.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f99109e7712cc51c8d7271407a9df790 http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-418833 Zobrazit plný text záznamu