Machine Learning-Based Hotel Booking Cancellation Prediction Using XGBoost
Abstract views: 7 , PDF downloads: 9Abstract
The rapid growth of online booking platforms has significantly increased the availability of hotel reservation data, enabling data-driven decision-making in the hospitality industry. However, high hotel booking cancellation rates remain a major challenge, leading to revenue loss and inefficient resource utilization. Accurately predicting booking cancellations is therefore essential to support effective reservation and revenue management strategies. Motivated by the limitations of traditional statistical and basic machine learning approaches in handling complex and imbalanced booking data, this study proposes a machine learning-based hotel booking cancellation prediction model using Extreme Gradient Boosting (XGBoost). The main contribution of this research lies in the systematic application of XGBoost combined with comprehensive data preprocessing, class imbalance handling, and hyperparameter optimization to improve prediction accuracy and robustness. The proposed approach is evaluated using a publicly available hotel booking demand dataset and assessed through multiple performance metrics, including accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (ROC-AUC). Experimental results demonstrate that the XGBoost model achieves strong and balanced classification performance in predicting both canceled and non-canceled bookings, outperforming conventional baseline methods reported in related studies. Despite the promising results, further improvements can be explored by incorporating additional contextual features and deploying explainable artificial intelligence techniques to enhance model transparency. Future work will also focus on real-time implementation and validation of the proposed model in operational hotel management systems to assess its effectiveness in dynamic booking environments.
Downloads

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.