Predicting Under-Five Year Mortality and its Determinants in South Africa: A Machine Learning Approach

Mokgoropo Makgaba, Koena Mabokela, Amusa Lateef, Michael Olusanya, Joash Mageto

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Child mortality under-five years of age remains a pressing global health challenge. This study aims to develop a machine learning model to predict under-five mortality in South Africa and identify key determinants of this mortality. Data from the 2016 South Africa Demographic and Health survey was used to explore a model that optimally predicts under-five years mortality. The study employed a chi-square test and analysis of variance for feature selection, while the synthetic minority oversampling technique was used to manage class imbalances. The models were evaluated based on multiple evaluation metrics. The best-performing models were used to determine key factors to predict child mortality. Among the models tested, random forest, XGboost and logistic regression were the best performing models. The breastfeeding status and the number of children under five years in the household were identified as the most important key factors to predict child mortality. Other influential variables were being one of a twin, the total number of children born to the mother, and access to clean drinking water. The results show the potential of machine learning models to predict under-five mortality and identify key risk factors. Random forest, XGboost and logistic regression models the best performing models for predicting under-five mortality. Child breastfeeding and children five years and under in the household have the highest influence on under-five mortality. The results of this study show the need for targeted policy intervention on promoting breastfeeding, improving the need for basic services and ensuring support for larger families with more children under the age of five in the household. The results provide policymakers with insights into designing strategies that will assist the country in achieving the Sustainable Development Goal 3.

Original languageEnglish
Title of host publicationInternational Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331535629
DOIs
Publication statusPublished - 2025
Event2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025 - Antalya, Turkey
Duration: 7 Aug 20259 Aug 2025

Publication series

NameInternational Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025

Conference

Conference2nd International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2025
Country/TerritoryTurkey
CityAntalya
Period7/08/259/08/25

Keywords

  • Child mortality
  • Logistic Regression
  • Predict
  • Random Forest
  • SMOTE
  • South Africa
  • XGBoost
  • demographic and health
  • machine learning

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems
  • Software
  • Information Systems and Management
  • Health Informatics

Fingerprint

Dive into the research topics of 'Predicting Under-Five Year Mortality and its Determinants in South Africa: A Machine Learning Approach'. Together they form a unique fingerprint.

Cite this