An ensemble method based on uninformative variable elimination and mutual information for spectral multivariate calibration.
Autor: | Tan C; Department of Chemistry and Chemical engineering, Yibin University, Yibin, Sichuan 644007, PR China. chaotan1112@163.com, Wang J, Wu T, Qin X, Li M |
---|---|
Jazyk: | angličtina |
Zdroj: | Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy [Spectrochim Acta A Mol Biomol Spectrosc] 2010 Dec; Vol. 77 (5), pp. 960-4. Date of Electronic Publication: 2010 Aug 27. |
DOI: | 10.1016/j.saa.2010.08.031 |
Abstrakt: | Based on the combination of uninformative variable elimination (UVE), bootstrap and mutual information (MI), a simple ensemble algorithm, named ESPLS, is proposed for spectral multivariate calibration (MVC). In ESPLS, those uninformative variables are first removed; and then a preparatory training set is produced by bootstrap, on which a MI spectrum of retained variables is calculated. The variables that exhibit higher MI than a defined threshold form a subspace on which a candidate partial least-squares (PLS) model is constructed. This process is repeated. After a number of candidate models are obtained, a small part of models is picked out to construct an ensemble model by simple/weighted average. Four near/mid-infrared (NIR/MIR) spectral datasets concerning the determination of six components are used to verify the proposed ESPLS. The results indicate that ESPLS is superior to UVEPLS and its combination with MI-based variable selection (SPLS) in terms of both the accuracy and robustness. Besides, from the perspective of end-users, ESPLS does not increase the complexity of a calibration when enhancing its performance. (Copyright © 2010 Elsevier B.V. All rights reserved.) |
Databáze: | MEDLINE |
Externí odkaz: |