Comparative Analysis of Traditional and Ensemble Models for Water Quality Index Prediction with Explainable AI
DOI:
https://doi.org/10.3126/injet.v3i1.87014Keywords:
Water Quality Index (WQI), Machine Learning, XGBoost, SHAP, Ensemble Algorithms, Explainable AI (XAI), CCME WQI, Regression, Water Resource ManagementAbstract
Accurate prediction of water quality is vital for effective environmental management. This study presents a comparative analysis of traditional and ensemble machine-learning models for predicting the Canadian Council of Ministers of the Environment Water Quality Index (CCME-WQI) using EPA Ireland coastal monitoring data. A standardized and leakage-proof pipeline was employed with robust scaling and multiple cross-validation across multiple random seeds to ensure stable and reproducible performance. Among all models, XGBoost achieved the best performance (R2 = 0.991). Model interpretability was enabled by SHAP analysis supported by feature correlation that identified Dissolved Oxygen as the dominant factor of WQI. Overall, results illustrate the potential of ensemble learners combined with explainable AI in making accurate, interpretable, and generalizable water-quality predictions to enable data-driven environmental monitoring and decision-making.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal on Engineering Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.