Missing data: A comparison of neural network and expectation maximization techniques

Fulufhelo V. Nelwamondo, Shakir Mohamed, Tshilidzi Marwala

Research output: Contribution to journalArticlepeer-review

135 Citations (Scopus)

Abstract

Two techniques have emerged from the recent literature as candidate solutions to the problem of missing data imputation. These are the expectation maximization (EM) algorithm and the auto-associative neural network and genetic algorithm (GA) combination. Both these techniques have been discussed individually and their merits discussed at length in the available literature. However, they have not been compared with each other. This article provides a comparison of the two techniques using datasets of an industrial power plant, an industrial winding process and HIV seroprevalence survey data. Results show that the EM algorithm is more suitable and performs better in cases where there is little or no interdependency between the input variables, whereas the auto-associative neural network and GA combination is suitable when there are inherent nonlinear relationships between some of the given variables.

Original languageEnglish
Pages (from-to)1514-1521
Number of pages8
JournalCurrent Science
Volume93
Issue number11
Publication statusPublished - 10 Dec 2007
Externally publishedYes

Keywords

  • Expectation maximization algorithm
  • Genetic algorithm
  • Missing data
  • Neural network

ASJC Scopus subject areas

  • Multidisciplinary

Fingerprint

Dive into the research topics of 'Missing data: A comparison of neural network and expectation maximization techniques'. Together they form a unique fingerprint.

Cite this