TY - GEN
T1 - A Deep Learning Approach for Predicting Diabetes Stages
AU - Mokheleli, Tsholofelo
AU - Mbuya, Emmanuel
AU - Bokaba, Tebogo
AU - Ndayizigamiye, Patrick
AU - Idemudia, Efosa C.
N1 - Publisher Copyright:
© (2025) by Association for Information Systems (AIS) All rights reserved.
PY - 2025
Y1 - 2025
N2 - Diabetes is a global health challenge, requiring early detection to mitigate severe complications. This study explores a deep learning (DL) approach for predicting diabetes stages, incorporating social determinants of health (SDOH) and medical indicators. Using a multiclass dataset, the study addresses class imbalance through SMOTE-Tomek resampling. It adapts non-structural classifiers, Convolutional Neural Networks (CNN), and Long Short-Term Memory (LSTM) for structured data analysis. The study compares these models against a Feedforward Neural Network (FNN) baseline. Results indicate that while accuracy declined post-resampling, minority class predictions improved, enhancing model fairness. Notably, LSTM demonstrated the highest post-resampling accuracy (77.27%). This study advances DL applications in healthcare by integrating SDOH and reconfiguring CNN and LSTM for structured data, expanding their utility beyond traditional domains. These findings contribute to unbiased, clinically relevant diabetes prediction, aligning with Sustainable Development Goal 3 to promote well-being for all.
AB - Diabetes is a global health challenge, requiring early detection to mitigate severe complications. This study explores a deep learning (DL) approach for predicting diabetes stages, incorporating social determinants of health (SDOH) and medical indicators. Using a multiclass dataset, the study addresses class imbalance through SMOTE-Tomek resampling. It adapts non-structural classifiers, Convolutional Neural Networks (CNN), and Long Short-Term Memory (LSTM) for structured data analysis. The study compares these models against a Feedforward Neural Network (FNN) baseline. Results indicate that while accuracy declined post-resampling, minority class predictions improved, enhancing model fairness. Notably, LSTM demonstrated the highest post-resampling accuracy (77.27%). This study advances DL applications in healthcare by integrating SDOH and reconfiguring CNN and LSTM for structured data, expanding their utility beyond traditional domains. These findings contribute to unbiased, clinically relevant diabetes prediction, aligning with Sustainable Development Goal 3 to promote well-being for all.
KW - Class Imbalance
KW - Deep Learning
KW - Diabetes Prediction
KW - SMOTE-Tomek resampling
UR - https://www.scopus.com/pages/publications/105025160726
M3 - Conference contribution
AN - SCOPUS:105025160726
T3 - Americas Conference on Information Systems, AMCIS 2025
SP - 3141
EP - 3150
BT - Americas Conference on Information Systems, AMCIS 2025
PB - Association for Information Systems
T2 - 2025 Americas Conference on Information Systems, AMCIS 2025
Y2 - 14 August 2025 through 16 August 2025
ER -