Predicting Student Academic Success through Explainable Machine Learning Models: A Comparative Study of BRF, XGBoost, and CatBoost

Authors

  • Prasish Timalsina Department of Electronics and Computer Engineering, Thapathali, Kathmandu, Nepal
  • Rajad Shakya Department of Electronics and Computer Engineering, Thapathali, Kathmandu, Nepal

DOI:

https://doi.org/10.3126/injet.v3i1.87015

Keywords:

Academic success prediction, Machine learning, XGBoost, Balanced Random Forest, CatBoost, SHAP

Abstract

This study presents an explainable machine learning framework for predicting academic success among university students using Balanced Random Forest (BRF), XGBoost, and CatBoost. The dataset, collected from undergraduate computer science students in Bangladesh, encompasses academic, behavioral, and socio-demographic attributes. To handle class imbalance, SMOTE-based oversampling was applied, and model thresholds were optimized using the F1-score. Model interpretability was achieved using SHAP (SHapley Additive exPlanations), enabling transparent identification of influential features such as attendance, study habits, income, and academic progression. Among all models, CatBoost achieved the highest macro F1-score and ROC-AUC, demonstrating its robustness in handling heterogeneous educational data. The results highlight how interpretable ensemble learning can support early risk detection and data-driven academic interventions, thereby bridging the gap between predictive performance and educational transparency.

Downloads

Download data is not yet available.
Abstract
1
PDF
0

Downloads

Published

2025-12-24

How to Cite

Timalsina, P., & Shakya, R. (2025). Predicting Student Academic Success through Explainable Machine Learning Models: A Comparative Study of BRF, XGBoost, and CatBoost. International Journal on Engineering Technology, 3(1), 100–108. https://doi.org/10.3126/injet.v3i1.87015

Issue

Section

Articles