Popis: |
The United States cotton industry is devoted to sustainable production strategies that reduce water, land, and energy consumption while enhancing soil health and cotton yield. Climate-smart agricultural solutions are being developed to increase yields and reduce operational costs. However, crop yield prediction is challenging because of the complex and nonlinear interactive effects of cultivar, soil type, management, pests and diseases, climate, and weather patterns on crops. To address this challenge, the machine learning (ML) method was used to predict yield, considering climatic change, soil diversity, cultivars, and fertilizer applications. Field data were collected over the southern US cotton belt in the 1980s and the 1990s. A second data source was generated from the process-based cotton model GOSSYM to reflect the most recent effects of climate change over the last six years (2017–2022). We focused on nine locations in three southern states: Texas, Mississippi, and Georgia. The accumulated heat for each set of experimental data was used as an analogue for the time-series weather data to reduce the number of computations. The Random Forest (RF) regressor, Support Vector Regression (SVR), Light Gradient Boosting Machine (LightGBM) regressor, Multiple Linear Regression (MLR), and neural networks were evaluated. Cross-validation was performed to obtain an improved model that did not suffer from overfitting. The RF regressor achieved an accuracy of 97.75%, with an $R^{2}$ of roughly 0.98 and a root mean square error of 55.05 kg/ha. The results demonstrate how a simple and robust model can be developed and utilized to help cotton climate-smart efforts. |