A Machine Learning Approach for Estimating Missing Data in Nonstationary Environments

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

The assumption with most learning techniques and algorithms developed this far is that data is complete and will continuously be available or that data conforms to a stationary distribution. Real world applications are often streaming and their measurements are often sampled after an extended period of time thereby giving rise to the formation of a time series. Measurement devices assigned to measure nonstationary quantities are subject to failure. When a failure occurs, the process of approximating missing values becomes difficulty. The process of approximating missing values in such dynamic environments is further exacerbated by the chaotic and unpredictable nature of the evolving data. Typical examples include stock market, network intrusion detection systems and seismic waves. To estimate missing values with traditional statistical methods can lead to bias when applied to environments that evolve with time. This paper introduces an ensemble of regressors approach to approximate missing data in online nonstationary data. The approach learns new concepts incrementally and the current learnt concept is then used to approximate the missing values.

Original languageEnglish
Title of host publication2020 2nd International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728195209
DOIs
Publication statusPublished - 25 Nov 2020
Event2nd International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2020 - Kimberley, South Africa
Duration: 25 Nov 202027 Nov 2020

Publication series

Name2020 2nd International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2020

Conference

Conference2nd International Multidisciplinary Information Technology and Engineering Conference, IMITEC 2020
Country/TerritorySouth Africa
CityKimberley
Period25/11/2027/11/20

Keywords

  • concept drift
  • ensemble
  • imputation methods
  • missing data

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'A Machine Learning Approach for Estimating Missing Data in Nonstationary Environments'. Together they form a unique fingerprint.

Cite this