PAM-lite: Fast and accurate k-medoids clustering for massive datasets

Peter O. Olukanmi, Fulufhelo Nelwamondo, Tshilidzi Marwala

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

The Partitioning Around Medoids (PAM) clustering algorithm is well-known for its robustness and accuracy, but it is computationally expensive. This paper proposes a fast and accurate version, named PAM-lite. Like CLARA which also addresses PAM's inefficiency, PAM-lite applies PAM to random samples. However, unlike CLARA, it does not choose one of the obtained medoid sets (which would involve evaluating each set), but simply applies PAM again to the combination of all the obtained medoids. This simple change yields accuracy and speed improvement. We discuss the rationale behind PAM-lite's approach and evaluate the algorithm on benchmark datasets. In all cases tested, PAM-lite achieves better speed-up and clustering quality than CLARA; the speed-up margin increasing with problem size. PAM-lite competes so closely with the clustering quality produced by the full PAM algorithm, that in one high cluster variance case, it beats PAM's clustering quality slightly.

Original languageEnglish
Title of host publicationProceedings - 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages200-204
Number of pages5
ISBN (Electronic)9781728103693
DOIs
Publication statusPublished - 1 May 2019
Event2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019 - Bloemfontein, South Africa
Duration: 28 Jan 201930 Jan 2019

Publication series

NameProceedings - 2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019

Conference

Conference2019 Southern African Universities Power Engineering Conference/Robotics and Mechatronics/Pattern Recognition Association of South Africa, SAUPEC/RobMech/PRASA 2019
Country/TerritorySouth Africa
CityBloemfontein
Period28/01/1930/01/19

Keywords

  • CLARA
  • PAM
  • accurate
  • clustering
  • efficient
  • k-medoids

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Energy Engineering and Power Technology
  • Mechanical Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'PAM-lite: Fast and accurate k-medoids clustering for massive datasets'. Together they form a unique fingerprint.

Cite this