TY - GEN
T1 - Rough set theory for the treatment of incomplete data
AU - Nelwamondo, Fulufhelo V.
AU - Marwala, Tshilidzi
PY - 2007
Y1 - 2007
N2 - This paper proposes an algorithm based on rough set theory for missing data estimation. This paper also applies a rough set technique for missing data estimation to a large and real database for the first time. It is envisaged in this work that in large databases, it is more likely that the missing values could be correlated to some other variables observed somewhere in the same data. Instead of approximating missing data, it might be cheaper to identify indiscernibility relations between the observed data instances and those that contain missing attributes. Results obtained using the HIV database are acceptable with accuracies ranging from 74.7% to 100%. One drawback of this method is that it makes no extrapolation or interpolation and as a result, can only be used if the missing case is simmilar or related to another case with more observations.
AB - This paper proposes an algorithm based on rough set theory for missing data estimation. This paper also applies a rough set technique for missing data estimation to a large and real database for the first time. It is envisaged in this work that in large databases, it is more likely that the missing values could be correlated to some other variables observed somewhere in the same data. Instead of approximating missing data, it might be cheaper to identify indiscernibility relations between the observed data instances and those that contain missing attributes. Results obtained using the HIV database are acceptable with accuracies ranging from 74.7% to 100%. One drawback of this method is that it makes no extrapolation or interpolation and as a result, can only be used if the missing case is simmilar or related to another case with more observations.
UR - http://www.scopus.com/inward/record.url?scp=50249131553&partnerID=8YFLogxK
U2 - 10.1109/FUZZY.2007.4295389
DO - 10.1109/FUZZY.2007.4295389
M3 - Conference contribution
AN - SCOPUS:50249131553
SN - 1424412102
SN - 9781424412105
T3 - IEEE International Conference on Fuzzy Systems
BT - 2007 IEEE International Conference on Fuzzy Systems, FUZZY
T2 - 2007 IEEE International Conference on Fuzzy Systems, FUZZY
Y2 - 23 July 2007 through 26 July 2007
ER -