Predictive modeling with missing data using an automatic relevance determination ensemble: A comparative study

Mlungisi Duma, Bhekisipho Twala, Fulufhelo Nelwamondo, Tshilidzi Marwala

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

The objective of this article is to present an automatic relevance determination ensemble as an effective variable extraction method for insurance datasets with large numbers of variables. Automatic relevance determination is a method that uses a Bayesian neural network and the evidence framework to rank variables in the order of relevance to the target variable. The current approach uses a single Bayesian neural network that searches only for local minima or maxima. In large datasets with numerous variables, this is a concern because we cannot be certain that the outcome is an optimal one. The method used to address this issue in this study is an automatic relevance determination ensemble with various configurations (or structures) of the Bayesian neural networks. Each outcome in the ensemble is determined by using a confidence factor rather than by scrutinizing the most probable weights values or hyperparameters directly. The extraction method is used with the repeated incremental pruning to produce error reduction, logistic discriminant analysis, and k-nearest neighbor models to evaluate the performance. Furthermore, the datasets employed contain escalating missing data to measure the accuracy and resilience of the models when they are used with the proposed ensemble. The ensemble is compared with the principal component analysis method. The results show that with the automatic relevance determination ensemble, the models achieve higher accuracies in performance than when used with the principal component analysis. Furthermore, the resilience and strength of models is higher when using the ensemble, compared with the principal component analysis method.

Original languageEnglish
Pages (from-to)967-984
Number of pages18
JournalApplied Artificial Intelligence
Volume26
Issue number10
DOIs
Publication statusPublished - 2012

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Predictive modeling with missing data using an automatic relevance determination ensemble: A comparative study'. Together they form a unique fingerprint.

Cite this