Abstract
Neural Networks are used as pattern recognition tools in data mining to classify HIV status of individuals based on demographic and socio-economic characteristics. The data consists of seroprevalence survey information and contains variables such as age, education, location, race, parity and gravidity. The multilayer perceptron (MLP) neural network architecture was used for this study since as preliminary design showed this architecture to be the most optimal. The design of classifiers involves the assessment of classification performance, and this is based on the accuracy of the prediction using the confusion matrix. Three design approaches were implemented and a comparative analysis done. An accuracy of 84.24% was obtained for the genetic algorithms meanwhile an accuracy of 74% is obtained for the standard optimized network. The network structures for the different methodologies as well as the training and optimization times are also different. The gradient method proved to be the less computationally expensive but the most erroneous. A committee of neural networks was also analyzed and this obtained the same accuracy as the genetic algorithm network thus emphasizing the genetic algorithm as the better method.
Original language | English |
---|---|
Pages (from-to) | 684-690 |
Number of pages | 7 |
Journal | WSEAS Transactions on Information Science and Applications |
Volume | 3 |
Issue number | 4 |
Publication status | Published - Apr 2006 |
Externally published | Yes |
Keywords
- AIDS
- Bayesian classification
- Committee of networks
- Confusion matrix
- Conjugate gradient methods
- Genetic algorithms
- Multi layer perceptron
- Neural networks
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications