TY - GEN
T1 - Using optimisation techniques to granulise rough set partitions
AU - Crossingham, Bodie
AU - Marwala, Tshilidzi
PY - 2007
Y1 - 2007
N2 - This paper presents an approach to optimise rough set partition sizes using various optimisation techniques. Three optimisation techniques are implemented to perform the granularisation process, namely, genetic algorithm (GA), hill climbing (HC) and simulated annealing (SA). These optimisation methods maximise the classification accuracy of the rough sets. The proposed rough set partition method is tested on a set of demographic properties of individuals obtained from the South African antenatal survey. The three techniques are compared in terms of their computational time, accuracy and number of rules produced when applied to the Human Immunodeficiency Virus (HIV) data set. The optimised methods results are compared to a well known non-optimised discretisation method, equal-width-bin partitioning (EWB). The accuracies achieved after optimising the partitions using GA, HC and SA are 66.89%, 65.84% and 65.48% respectively, compared to the accuracy of EWB of 59.86%. In addition to rough sets providing the plausabilities of the estimated HIV status, they also provide the linguistic rules describing how the demographic parameters drive the risk of HIV.
AB - This paper presents an approach to optimise rough set partition sizes using various optimisation techniques. Three optimisation techniques are implemented to perform the granularisation process, namely, genetic algorithm (GA), hill climbing (HC) and simulated annealing (SA). These optimisation methods maximise the classification accuracy of the rough sets. The proposed rough set partition method is tested on a set of demographic properties of individuals obtained from the South African antenatal survey. The three techniques are compared in terms of their computational time, accuracy and number of rules produced when applied to the Human Immunodeficiency Virus (HIV) data set. The optimised methods results are compared to a well known non-optimised discretisation method, equal-width-bin partitioning (EWB). The accuracies achieved after optimising the partitions using GA, HC and SA are 66.89%, 65.84% and 65.48% respectively, compared to the accuracy of EWB of 59.86%. In addition to rough sets providing the plausabilities of the estimated HIV status, they also provide the linguistic rules describing how the demographic parameters drive the risk of HIV.
KW - Bioinformatics application
KW - Evolutionary optimisation techniques
KW - HIV modelling
KW - Rough set theory
UR - http://www.scopus.com/inward/record.url?scp=67649471564&partnerID=8YFLogxK
U2 - 10.1063/1.2816629
DO - 10.1063/1.2816629
M3 - Conference contribution
AN - SCOPUS:67649471564
SN - 9780735404663
T3 - AIP Conference Proceedings
SP - 248
EP - 257
BT - Computational Models For Life Sciences (CMLS '07) - 2007 International Symposium
T2 - 2007 International Symposium on Computational Models for Life Sciences, CMLS '07
Y2 - 17 December 2007 through 19 December 2007
ER -