K-Means-MIND: An Efficient Alternative to Repetitive k-Means Runs

Peter O. Olukanmi, Fulufhelo Nelwamondo, Tshilidzi Marwala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Citations (Scopus)

Abstract

The problem of local minimum in k-means clustering, is commonly addressed by running the algorithm repeatedly in order to choose the best run. Although effective, the approach is computationally expensive. In this paper, we observe that the approach is effectively a comparison among different initializations. Thus, if there is a way to compare these initializations ab initio, there will be no need for repeated clustering. We propose such a technique in this paper. Specifically, we choose the initialization with the largest minimum inter-center distance (MIND), as the 'best' one. In other words, our technique is a general approach to improving existing seeding techniques. We demonstrate the concept with MIND-optimized versions of two standard algorithms: k-means and k-means++. Experiments show that in addition to drastic efficiency gain when compared to repetitive k-means, our approach improves the accuracy of the standard versions of these algorithms.

Original languageEnglish
Title of host publication2020 7th International Conference on Soft Computing and Machine Intelligence, ISCMI 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages172-176
Number of pages5
ISBN (Electronic)9781728175591
DOIs
Publication statusPublished - 14 Nov 2020
Event7th International Conference on Soft Computing and Machine Intelligence, ISCMI 2020 - Virtual, Stockholm, Sweden
Duration: 14 Nov 202015 Nov 2020

Publication series

Name2020 7th International Conference on Soft Computing and Machine Intelligence, ISCMI 2020

Conference

Conference7th International Conference on Soft Computing and Machine Intelligence, ISCMI 2020
Country/TerritorySweden
CityVirtual, Stockholm
Period14/11/2015/11/20

Keywords

  • clustering
  • k-means
  • k-means++
  • local optimization
  • multi-start
  • repeated
  • restart

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Computational Mathematics
  • Modeling and Simulation
  • Numerical Analysis

Fingerprint

Dive into the research topics of 'K-Means-MIND: An Efficient Alternative to Repetitive k-Means Runs'. Together they form a unique fingerprint.

Cite this