Gradient boosting models like XGBoost are among the most popular models for tabular classification problems. Unfortunately, the greediness of gradient boosting algorithms can cause them to rely too heavily on some features, thereby starving the other features. We propose Iterative Feature eXclusion (IFX) to alleviate this problem by iteratively removing the most influential feature from the training data and continuing training. By forcing the model to learn from weaker features, we increase the diversity of the gradient boosting model and improve the predictive performance. Our experiments show that in most cases, IFX improves XGBoost predictive performance, sometimes by a large margin. All of the code and results from our experiments are freely available online. Iterative Feature eXclusion can be used as a drag-and-drop replacement for XGBoost, thereby easing the adoption of our work by machine learning researchers and practitioners.
Ismail B. Mustapha, Muyideen Abdulkareem, Taha M. Jassam, Ali Alateah, Khaled A. Alawi Al-Sodani, Mohammed M. H. Al-Tholaia, Hatem Nabus, Sophia C. Alih, Zainab Abdulkareem, Abideen Ganiyu
Discussion(0)
No comments yet. Be the first to comment.