Evaluation of calibration subsetting and new chemometric methods on the spectral prediction of key soil properties in a data‐limited environment

Autor: S. P. Wani, Sabine Grunwald, C. M. Clingensmith
Rok vydání: 2019
Předmět:
Zdroj: European Journal of Soil Science. 70:107-126
ISSN: 1365-2389
1351-0754
Popis: Few studies have systematically studied the effects of subsetting strategies on soil modelling or explored the potential of emergent methods from other fields not previously applied to pedometrics. This study considers smallholder agricultural villages in southern India that have been understudied in terms of chemometric modelling intended to support soil health, fertility and management. Therefore, the objective was to investigate the application of visible near‐infrared spectroscopy and chemometrics to predict soil properties in this setting. In addition, this study evaluated the effects of methods of calibration subsetting and new parametric models on the prediction of soil properties. These novel methods were transferred from the genomics field to soil science. Three strategic subsetting methods were used to produce calibration subsets that consider the variation in the soil properties, the spectra and both together; this is in addition to standard random calibration subsetting. Partial least squares regression (PLSR) and two methods from genomics that impose variable reduction were used for modelling; the latter were sparse PLSR (SPLSR) and the heteroscedastic effects model (HEM). Soil samples were collected from two villages and analysed for texture, soil carbon and available macro‐ and micro‐nutrients. Results showed that soil texture and carbon could be predicted moderately to strongly, whereas plant nutrient properties were predicted poorly to moderately. Random subsetting and subsetting by property distribution were more appropriate when spectra varied less overall, whereas subsetting that incorporates variation in spectra and properties improved results when spectral variation increased. The SPLSR and HEM models improved results over PLSR in some cases, or at least maintained prediction strength while using fewer predictors. Subsetting methods improved prediction results in 75% of cases. This study filled an important research gap by systematically studying local subsetting behaviour under different degrees of spectral and attribute variation.
Databáze: OpenAIRE