Dr.A.Leo, meena sarumathi, mahila vasanthi thangam, Clement Sudhahar J, Kevin Joseph J and Lourdu Stepy P
Adv. Artif. Intell. Mach. Learn., XX (XX):-
1. Dr.A.Leo: Karunya Institute of Technology and Sciences
2. meena sarumathi: Karunya Institute of Technology and Science.
3. mahila vasanthi thangam: Karunya Institute of Technology and Sciences.
4. Clement Sudhahar J: ICFAI Business School, IFHE Deemed University.
5. Kevin Joseph J: Division of Electronics and Communication Engineering, Karunya Institute of Technology and Sciences, Tamil Nadu, India
6. Lourdu Stepy P: School of management studies, Karunya Institute of Technology and Science, Tamil Nadu, India.
DOI: 10.54364/AAIML.2026.63305
Article History: Received on: 14-Feb-26, Accepted on: 12-May-26, Published on: 19-May-26
Corresponding Author: Dr.A.Leo
Email: leoa@karunya.edu
Citation: Meena Sarumathi S, et al. A Hybrid Shannon Entropy-Driven Ensemble Framework Integrating Random Forest, XGBoost, and CatBoost for Robust Mental Stress Prediction Among School Students. Advances in Artificial Intelligence and Machine Learning.2026. (Ahead of Print) https://dx.doi.org/10.54364/AAIML.2026.63305
The academic stress,
anxiety and the uncertainty as regards the career matters have enormous
psychological effects on the students who are at this important juncture of
education. The prior support system and targeted guidance with the proper
mechanisms of early detection are required. The machine Learning algorithms
help facilitate in practical deployment and targeted guidance for students at
their age group. The most underappreciated mental health challenges in modern
education are mental stress among high school students. In this research
Shannon Entropy based feature selection has been used. Through that the three
ensemble classifiers namely Random Forest, XG Boost and Cat Boost were used to
predict mental stress levels from structured questionnaire. The data was
collected in five psychological domains from 500 respondents answering 15
questions through five-point Likert scale. The XG Boost delivered commendable
91% accuracy and AUC = 0.996, while random forest followed by 95% of accuracy
and AUC of 0.998. CatBoost achieved the highest test accuracy (97%) with AUC of
1.000, though five-fold cross-validation yielded a more conservative estimate
of 92.8% (±3.31%). The high test-set performance reflects the cluster-derived
target variable methodology; external clinical validation remains essential. In
this research the most discriminating features are of motivation related
questions, Q14 and Q 15. The framework is small enough to be deployed in
practice and comprehensible enough to be used in the context of target student
support programs.