Cross-validatory framework for optimal parameter estimation of KPCA and KPLS models
Autor: | Lei Xie, David Rooney, Jillian M. Thompson, Zhe Li, Yujia Fu, Uwe Kruger, Juergen Hahn, Huizhong Yang |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
Mathematical optimization
Optimal parameter estimation 02 engineering and technology 01 natural sciences Kernel principal component analysis Analytical Chemistry Number of latent variable sets Kernel partial least squares Cross-validatory framework 0202 electrical engineering electronic engineering information engineering Kernel parameter Spectroscopy Mathematics Nonlinear models Estimation theory Process Chemistry and Technology 010401 analytical chemistry Contrast (statistics) 0104 chemical sciences Computer Science Applications Variable kernel density estimation Kernel (statistics) 020201 artificial intelligence & image processing Combined objective function Algorithm Software |
Zdroj: | Fu, Y, Kruger, U, Li, Z, Xie, L, Thompson, J, Rooney, D, Hahn, J & Yang, H 2017, ' Cross-validatory framework for optimal parameter estimation of KPCA and KPLS models ', Chemometrics and Intelligent Laboratory Systems, vol. 167, pp. 196-207 . https://doi.org/10.1016/j.chemolab.2017.06.007 |
DOI: | 10.1016/j.chemolab.2017.06.007 |
Popis: | This article revisits recently proposed methods to determine the kernel parameter and the number of latent components for identifying kernel principal component analysis (KPCA) and kernel partial least squares (KPLS) models. A detailed analysis shows that existing work is neither optimal nor efficient in determining these important parameters and may lead to erroneous estimates. In addition to that, most methods are not designed to simultaneously estimate both parameters, i.e. they require one parameter to be predetermined. To address these practically important issues, the article introduces a cross-validatory framework to optimally determine both parameters. Application studies to a simulation example and a total of three experimental or industrial data sets confirm that the cross-validatory framework outperforms existing methods and yields optimal estimations for both parameters. In sharp contrast, existing work has the potential to substantially overestimate the number of latent components and to provide inadequate estimates for the kernel parameter. |
Databáze: | OpenAIRE |
Externí odkaz: |