Predicting Secondary Student Academic Performance Using Stacked Regression Ensembles on UCI Datasets

Authors

  • Kamal Shrestha Department of Electronics and Computer Engineering, Thapathali Engineering Campus, Thapathali, Nepal
  • Bhagirath Padhya Aryal Department of Electronics and Computer Engineering, Thapathali Engineering Campus, Thapathali, Nepal

DOI:

https://doi.org/10.3126/injet.v3i1.86968

Keywords:

Ensemble Learning, Gradient Boosting (XGBoost), Stacking Ensemble, Student Performance Prediction.

Abstract

This study explores the use of stacked regression ensembles to predict secondary student academic performance using the UCI Portuguese student dataset. While previous works have focused on individual models such as Random Forest and XGBoost, this research investigates whether combining multiple regressors under a Theil-Sen meta-learner improves prediction accuracy. Among 15 models evaluated through cross-validation, XGBoost achieved the highest individual performance with R²  of 0.8420 and RMSE  of  1.2665. To further improve accuracy, stacking ensembles were created using 2 to 4 base regressors. The best-performing ensemble comprising XGBoost, ExtraTrees, LinearRegression, and Lasso achieved a cross-validated R² of 0.8553 and RMSE of 1.2082. These findings show that stacking diverse models offers enhanced predictive power and generalization, providing a robust solution for student performance prediction.

Downloads

Download data is not yet available.
Abstract
2
PDF
0

Downloads

Published

2025-12-24

How to Cite

Shrestha, K., & Aryal, B. . P. (2025). Predicting Secondary Student Academic Performance Using Stacked Regression Ensembles on UCI Datasets. International Journal on Engineering Technology, 3(1), 29–35. https://doi.org/10.3126/injet.v3i1.86968

Issue

Section

Articles