TY - CHAP
T1 - Missing Data Estimation Using Bat Algorithm
AU - Leke, Collins Achepsah
AU - Marwala, Tshilidzi
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - In recent years, several methods for missing data estimation have been developed. Real-world datasets possess the properties of big data such as volume, velocity and variety. With an increase in volume which includes sample size and dimensionality, existing imputation methods have become less effective and accurate. Much attention has been given to narrow artificial intelligence frameworks courtesy of their efficiency in low-dimensional settings. However, with an increase in dimensionality, these methods yield unrepresentative imputations with an impact on decision-making processes. Therefore, in this chapter, we present a new framework for missing data imputation in high-dimensional datasets. A deep learning technique is used in conjunction with a swarm intelligence algorithm. The performance of the proposed technique is experimentally tested and compared against other existing methods on an off-line dataset. The results obtained showed promising potential with slightly longer execution times, which are a worthy trade-off when accuracy is of importance.
AB - In recent years, several methods for missing data estimation have been developed. Real-world datasets possess the properties of big data such as volume, velocity and variety. With an increase in volume which includes sample size and dimensionality, existing imputation methods have become less effective and accurate. Much attention has been given to narrow artificial intelligence frameworks courtesy of their efficiency in low-dimensional settings. However, with an increase in dimensionality, these methods yield unrepresentative imputations with an impact on decision-making processes. Therefore, in this chapter, we present a new framework for missing data imputation in high-dimensional datasets. A deep learning technique is used in conjunction with a swarm intelligence algorithm. The performance of the proposed technique is experimentally tested and compared against other existing methods on an off-line dataset. The results obtained showed promising potential with slightly longer execution times, which are a worthy trade-off when accuracy is of importance.
UR - http://www.scopus.com/inward/record.url?scp=85132897687&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-01180-2_3
DO - 10.1007/978-3-030-01180-2_3
M3 - Chapter
AN - SCOPUS:85132897687
T3 - Studies in Big Data
SP - 41
EP - 56
BT - Studies in Big Data
PB - Springer Science and Business Media Deutschland GmbH
ER -