An ensemble machine learning approach for determination of the optimum sampling time for evapotranspiration assessment from high-throughput phenotyping data
Autor: | Lijalem Korbu, Soumyashree Kar, J. Adinarayana, Vikram Kumar Purbey, Jana Kholova, Vincent Vadez, Saurabh Suradhaniwar, Surya S. Durbha |
---|---|
Rok vydání: | 2021 |
Předmět: |
0106 biological sciences
Artificial neural network Computer science Forestry 04 agricultural and veterinary sciences Interval (mathematics) Horticulture computer.software_genre 01 natural sciences Ensemble learning Computer Science Applications Random forest Support vector machine 040103 agronomy & agriculture 0401 agriculture forestry and fisheries Overhead (computing) Data mining Time series Agronomy and Crop Science Throughput (business) computer 010606 plant biology & botany |
Zdroj: | Computers and Electronics in Agriculture. 182:105992 |
ISSN: | 0168-1699 |
DOI: | 10.1016/j.compag.2021.105992 |
Popis: | Efficient selection of drought-tolerant crops requires identification and high-throughput phenotyping (HTP) of the complex functional (especially canopy-conductance) traits that elicit plant responses to continually fluctuating environmental conditions. However, phenotyping of such dynamic physiology-based traits has been immensely challenging especially due to the limited availability of adequate methods that can provide continuous measurements of plant-water relations. Therefore, gravimetric phenotyping of plants is being increasingly used to allow one-to-one monitoring of plant-water relations and generate continuous evapotranspiration (ET) profiles. The gravimetric sensors or load cells can provide ET estimates at very high frequencies, e.g. 15-min interval, as chosen by the user. There is however, no study on understanding the optimum frequency or the sampling time at which ET needs to be monitored, such that data-redundancy, noise and processing overhead could be reduced. Hence, this paper makes a novel attempt in identifying the optimum sampling time for phenotyping ET from load cells time series. The proposed procedure includes an ensemble Machine-Learning (ML) approach for optimizing the sampling time through time series forecasting of ET profiles and classification of genotypes using the forecasted ET values. High-frequency load cells data from the LeasyScan, HTP platform, ICRISAT were used to derive the ET profiles at frequencies or scales varying from 15-min to 180-min, followed by ET forecasting and classification at each frequency. For both forecasting and classification, an ensemble of three ML algorithms i.e. Support Vector Machines (SVM), Artificial Neural Network (ANN) and Random Forests (RF) were leveraged. Consequently, the performance metrics (of both the operations) obtained from the ensemble were used to compute the entropy-based optimum sampling time. The results reveal that 60-min interval HTP data could be credibly used for both, forecasting ET as well as correctly classifying the genotypes. |
Databáze: | OpenAIRE |
Externí odkaz: |