TY - GEN
T1 - Improving the performance of the support vector machine in insurance risk classification
T2 - International Conference on Neural Computation Theory and Applications, NCTA 2011
AU - Duma, Mlungisi
AU - Twala, Bhekisipho
AU - Marwala, Tshilidzi
AU - Nelwamondo, Fulufhelo V.
PY - 2011
Y1 - 2011
N2 - The support vector machine is a classification technique used in linear and non- linear complex problems. It was shown that the performance of the technique decreases significantly in the presence of escalating missing data in the insurance domain. Furthermore the resilience of the technique when the quality of the data deteriorates is weak. When dealing with missing data, the support vector machine uses the mean-mode strategy to replace missing values. In this paper, we propose the use of the autoassociative network and the genetic algorithm as alternative strategies to help improve the classification performance as well as increase the resilience of the technique. A comparative study is conducted to see which of the techniques helps the support vector machine improve in performance and sustain resilience. The training data with completely observable data is used to construct the support vector machine and testing data with missing values is used to measuring the accuracy. The results show that both models help increase resilience with the autoassociative network showing better overall performance improvement.
AB - The support vector machine is a classification technique used in linear and non- linear complex problems. It was shown that the performance of the technique decreases significantly in the presence of escalating missing data in the insurance domain. Furthermore the resilience of the technique when the quality of the data deteriorates is weak. When dealing with missing data, the support vector machine uses the mean-mode strategy to replace missing values. In this paper, we propose the use of the autoassociative network and the genetic algorithm as alternative strategies to help improve the classification performance as well as increase the resilience of the technique. A comparative study is conducted to see which of the techniques helps the support vector machine improve in performance and sustain resilience. The training data with completely observable data is used to construct the support vector machine and testing data with missing values is used to measuring the accuracy. The results show that both models help increase resilience with the autoassociative network showing better overall performance improvement.
KW - Artificial neural network
KW - Autoassociative network
KW - Genetic algorithms
KW - Missing data
KW - Principal component analysis
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=84862190134&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84862190134
SN - 9789898425843
T3 - NCTA 2011 - Proceedings of the International Conference on Neural Computation Theory and Applications
SP - 340
EP - 346
BT - NCTA 2011 - Proceedings of the International Conference on Neural Computation Theory and Applications
Y2 - 24 October 2011 through 26 October 2011
ER -