Efficient Wine Quality Prediction and Classification Using LightGBM Model
Abstract views: 11 , PDF downloads: 11Abstract
This study develops an efficient machine learning model using the Light Gradient Boosting Machine (LightGBM) algorithm to predict and classify wine quality based on physicochemical properties. The dataset used in this research consists of multiple chemical attributes, including alcohol content, acidity levels, sulphates, and phenolic compounds, which collectively influence wine quality. The preprocessing stage involved data cleaning, outlier treatment, feature scaling, and handling class imbalance using the Synthetic Minority Oversampling Technique (SMOTE). Feature selection was conducted using mutual information and recursive feature elimination to identify the most influential predictors. The optimized LightGBM model achieved superior performance with 100% accuracy, precision, recall, and F1-score across all quality classes, outperforming traditional algorithms such as Random Forest, SVM, and Logistic Regression. Feature importance analysis revealed that Proline, Flavanoids, and Magnesium were the most significant attributes contributing to wine classification. These findings demonstrate that LightGBM is a robust and scalable solution for wine quality prediction, offering an efficient, data-driven alternative to traditional sensory evaluations. The proposed model can enhance quality control processes in the wine industry by providing accurate and interpretable insights into the chemical determinants of wine quality.
Downloads

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
[2] Wang, H., Zhao, X., Liu, H., & Zhang, H. (2021). Application of machine learning in wine quality prediction: A comparative study. Journal of Food Engineering, 302, 110572.
[3] Sun, X., Li, Y., & Zhang, Y. (2021). Prediction of wine quality using feature engineering and ML models. Expert Systems with Applications, 186, 115754.
[4] Patel, R., & Desai, M. (2023). A novel approach to wine classification using ML. Computational Biology and Chemistry, 105, 107080.
[5] Liu, Y., Feng, X., & Yang, T. (2021). Key features selection for wine quality prediction using machine learning. Applied Artificial Intelligence, 35(5), 355-374.
[6] Martinez, S., Romero, R., & Delgado, J. (2022). A comparative study of ML techniques in wine classification. Journal of Food Science and Technology, 59(8), 3056-3067.
[7] Zhang, J., Li, C., & Xu, X. (2023). Enhancing wine quality classification using LightGBM with optimized hyperparameters. Food Chemistry, 415, 135747.
[8] Chen, Z., Hu, W., & He, Y. (2022). Application of gradient boosting models in agricultural quality prediction. Computers and Electronics in Agriculture, 199, 107074.
[9] Guo, J., Tang, L., & Wang, Q. (2020). Comparative analysis of gradient boosting models for wine classification. International Journal of Food Science, 55(6), 2503-2512.
[10] Feng, J., Zhou, M., & Wang, L. (2022). Wine quality prediction leveraging LightGBM and feature interactions. Food and Bioproducts Processing, 130, 179-188.







