TY - GEN
T1 - Sensed Outlier Detection for Water Monitoring Data and a Comparative Analysis of Quantization Error Using Kohonen Self-Organizing Maps
AU - Dogo, E. M.
AU - Nwulu, N. I.
AU - Twala, B.
AU - Aigbavboa, C. O.
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12
Y1 - 2018/12
N2 - Measurement values obtained from sensors deployed in the field are sometimes prone to deviation from known patterns of the sensed data which is referred to as outlier or anomalous readings. The reasons for this outlier may include noise, faulty sensor errors, environmental events and cyber-attack on the sensor network, resulting in faulty and missing data that greatly affects quality of the raw data and its subsequent analysis. This paper employs the Self-Organizing Maps (SOM) algorithm to visualise and interpret clusters of sensed data obtained from fresh water monitoring sites, with patterns of similar expressions in a graphical form. With the aim of detecting potential anomalous sensed data, so that they could be investigated and possibly removed to guarantee the quality of the overall dataset. Furthermore, a comparative study of the effects of four different well known neighborhood functions (gaussian, bubble, triangle and mexican hat) with varying neighborhood radius (σ) and learning rate (η) values on Quantization Error (QE) metric was conducted. From the experiment conducted a 3.45% potentially anomalous sensed data were discovered from the entire dataset, in addition, our initial finding suggests a very insignificant variation of the QE based on our dataset and the experiments conducted.
AB - Measurement values obtained from sensors deployed in the field are sometimes prone to deviation from known patterns of the sensed data which is referred to as outlier or anomalous readings. The reasons for this outlier may include noise, faulty sensor errors, environmental events and cyber-attack on the sensor network, resulting in faulty and missing data that greatly affects quality of the raw data and its subsequent analysis. This paper employs the Self-Organizing Maps (SOM) algorithm to visualise and interpret clusters of sensed data obtained from fresh water monitoring sites, with patterns of similar expressions in a graphical form. With the aim of detecting potential anomalous sensed data, so that they could be investigated and possibly removed to guarantee the quality of the overall dataset. Furthermore, a comparative study of the effects of four different well known neighborhood functions (gaussian, bubble, triangle and mexican hat) with varying neighborhood radius (σ) and learning rate (η) values on Quantization Error (QE) metric was conducted. From the experiment conducted a 3.45% potentially anomalous sensed data were discovered from the entire dataset, in addition, our initial finding suggests a very insignificant variation of the QE based on our dataset and the experiments conducted.
KW - Quantization Error (QE)
KW - Self Organizing Maps (SOM)
KW - outlier detection
KW - performance measure
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85070378687&partnerID=8YFLogxK
U2 - 10.1109/CTEMS.2018.8769276
DO - 10.1109/CTEMS.2018.8769276
M3 - Conference contribution
AN - SCOPUS:85070378687
T3 - Proceedings of the International Conference on Computational Techniques, Electronics and Mechanical Systems, CTEMS 2018
SP - 427
EP - 430
BT - Proceedings of the International Conference on Computational Techniques, Electronics and Mechanical Systems, CTEMS 2018
A2 - Niranjan, S. K.
A2 - Desai, Veena
A2 - Rajpurohit, Vijay S.
A2 - Nadkatti, M N
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st International Conference on Computational Techniques, Electronics and Mechanical Systems, CTEMS 2018
Y2 - 21 December 2018 through 23 December 2018
ER -