Efficient Crop Yield Forecasting Using LightGBM with Soil and Climatic Indicators

https://doi.org/10.33173/acsie.v6i2.304
  • Ni Wayan WardaniOkayama University
  • Kadek Suarjuna BatubulanOkayama University

Abstract

Accurate crop yield prediction is a critical component of agricultural planning, food security, and sustainable farming practices. However, crop yield is influenced by complex and nonlinear interactions among environmental conditions and agricultural management factors, which are often inadequately captured by traditional statistical models. Motivated by these limitations, this study proposes a machine learning–based approach for crop yield forecasting using an ensemble regression model that integrates climatic variables and agricultural management indicators. The objective of this research is to develop an efficient and reliable prediction model capable of learning complex patterns from heterogeneous agricultural data. The proposed methodology involves data preprocessing, feature selection, model training using an ensemble learning framework, and comprehensive performance evaluation. Experimental results demonstrate that the proposed model achieves strong predictive performance, as indicated by favorable values of R-squared (R²), Mean Absolute Error (MAE), and Root Mean Square Error (RMSE). A visual comparison between actual and predicted yield values further confirms the model’s ability to capture overall yield trends and generalize well to unseen data. In addition, feature importance analysis provides insights into the relative influence of climatic and management-related factors on crop yield. The main contribution of this study lies in the integration of multiple influential factors within a single ensemble-based prediction framework, offering a more comprehensive and accurate approach to crop yield forecasting. Future work will focus on incorporating additional agronomic variables, extending the model to multi-crop and multi-region datasets, and exploring advanced hybrid and explainable machine learning techniques to further enhance prediction accuracy and interpretability.

Keywords

crop yield prediction; ensemble learning; machine learning; climatic factors; precision agriculture

Full Text

Downloads

Download data is not yet available.

References

[1] J. Li, X. Wang, and Y. Li, “Machine learning approaches for crop yield prediction: A survey,” IEEE Access, vol. 8, pp. 211522–211536, 2020. DOI: 10.1109/ACCESS.2020.3039726
[2] S. Khaki and L. Wang, “Crop yield prediction using deep neural networks,” Frontiers in Plant Science, vol. 10, pp. 1–13, 2020. DOI: 10.3389/fpls.2019.00621
[3] A. Elavarasan, D. Vincent, and K. Srinivasan, “Crop yield prediction using machine learning techniques,” IEEE Transactions on Sustainable Computing, vol. 6, no. 3, pp. 485–497, 2021. DOI: 10.1109/TSUSC.2020.3008924
[4] R. S. Basso and J. T. Ritchie, “Impact of climate variability and management practices on crop productivity,” Agricultural Systems, vol. 178, pp. 102742, 2020. DOI: 10.1016/j.agsy.2019.102742
[5] M. S. Rahman, A. H. Sarker, and M. Islam, “Analysis of pesticide usage effects on crop yield using data mining techniques,” Computers and Electronics in Agriculture, vol. 181, pp. 105117, 2021. DOI: 10.1016/j.compag.2020.105117
[6] G. Ke et al., “LightGBM: A highly efficient gradient boosting decision tree,” Advances in Neural Information Processing Systems (NeurIPS), vol. 30, pp. 3146–3154, 2020. DOI: 10.5555/3294996.3295074
[7] X. Pantazi, D. Moshou, and T. Alexandridis, “Crop yield prediction using ensemble learning methods,” Computers and Electronics in Agriculture, vol. 163, pp. 104863, 2020. DOI: 10.1016/j.compag.2019.104863
[8] A. Chlingaryan, S. Sukkarieh, and B. Whelan, “Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review,” Computers and Electronics in Agriculture, vol. 151, pp. 61–69, 2020. DOI: 10.1016/j.compag.2018.05.012
[9] J. You, X. Li, M. Low, D. Lobell, and S. Ermon, “Deep Gaussian process for crop yield prediction based on remote sensing data,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 4, pp. 4559–4566, 2020. DOI: 10.1609/aaai.v34i04.5904
[10] T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785–794, 2021.
DOI: 10.1145/2939672.2939785
[11] Y. Li, H. Zhang, and J. Wang, “Wheat yield prediction using LightGBM and meteorological data,” Computers and Electronics in Agriculture, vol. 182, pp. 105112, 2021. DOI: 10.1016/j.compag.2021.105112
[12] X. Zhao, L. Chen, and Y. Sun, “Maize yield forecasting based on LightGBM with agro-climatic features,” IEEE Access, vol. 10, pp. 45678–45690, 2022. DOI: 10.1109/ACCESS.2022.3156789

License

Copyright (c) 2024 ACSIE (International Journal of Application Computer Science and Informatic Engineering)

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.