Performance evaluation of sampling-based large-scale clustering algorithms

Peter O. Olukanmi, Fulufhelo Nelwamondo, Tshilidzi Marwala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Using benchmark datasets, we study the performances of three efficient clustering algorithms which find cluster centers using a fixed number of random samples. The algorithms are also compared with two other (well-known) algorithms, namely k-means and PAM. One of the efficient algorithms, CLARA, is well-known while the other two, k-means-lite and PAM-lite, were introduced recently. CLARA and PAM-lite are based on the k-medoids approach, while k-means-lite adopts the k-means approach. The study shows that k-means-lite is the most efficient, followed by PAM-lite which is faster than CLARA. PAM-lite exhibits the best balance of efficiency and accuracy; it produces the most competitive results relative to PAM which is the most accurate but most inefficient.

Original languageEnglish
Title of host publicationProceedings - 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages194-199
Number of pages6
ISBN (Electronic)9781728103693
DOIs
Publication statusPublished - 1 May 2019
Event2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019 - Bloemfontein, South Africa
Duration: 28 Jan 201930 Jan 2019

Publication series

NameProceedings - 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019

Conference

Conference2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019
Country/TerritorySouth Africa
CityBloemfontein
Period28/01/1930/01/19

Keywords

  • CLARA
  • PAM
  • PAM-lite
  • accurate
  • efficient
  • k-means
  • k-means-lite
  • k-medoids
  • large datasets

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Energy Engineering and Power Technology
  • Mechanical Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Performance evaluation of sampling-based large-scale clustering algorithms'. Together they form a unique fingerprint.

Cite this