Utilizing Google Search Data with Deep Learning, Machine Learning and Time Series Modeling to Forecast Influenza-Like Illnesses in South Africa

Seun O. Olukanmi, Fulufhelo V. Nelwamondo, Nnamdi I. Nwulu

Research output: Contribution to journalArticlepeer-review

12 Citations (Scopus)

Abstract

Influenza-like illnesses (ILI) result in deaths and hospitalizations across the globe. Traditional surveillance systems rely on data from general medical practitioners. The process is resource-intensive and plagued with delay. Although recent studies have shown the potential utility of free and fast alternatives like web and social media data, the reliability cannot be generalized due to differences in technological culture. Meanwhile, there is a scarcity of studies exploring these free online data for (sub-Saharan) African countries. In this paper, we utilize Google trends (GT) data for ILI forecasting in South Africa. We study models based on deep learning (Long short-term memory (LSTM) and feedforward neural networks (FNN)), machine learning (Multiple linear regression (MLR), elastic net (EN), support vector machine (SVM)), and statistical time series (seasonal autoregressive integrated moving average (SARIMA)) algorithms. The FNN and SVM models using GT data alone, produce forecasts close in accuracy to those fitted to actual ILI data. The algorithms rank differently across various performance measures. Generally, the deep learning techniques perform better than the other algorithms in our study. However, tuning the former is quite intricate. Combining GT and historical ILI data enhances the models. The non-deep-learning algorithms benefit more from this enhancement. Furthermore, we observe that search volume increases proportional to and timeously with reported infection rates, suggesting that South Africans search Google in the week they feel flu symptoms. Thus, monitoring Google search trends is a reliable proxy for monitoring flu spread in South Africa.

Original languageEnglish
Pages (from-to)126822-126836
Number of pages15
JournalIEEE Access
Volume9
DOIs
Publication statusPublished - 2021

Keywords

  • Deep learning
  • Google Trends
  • ILI reporting
  • South Africa
  • flu surveillance
  • influenza forecasting
  • machine learning

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'Utilizing Google Search Data with Deep Learning, Machine Learning and Time Series Modeling to Forecast Influenza-Like Illnesses in South Africa'. Together they form a unique fingerprint.

Cite this