Skip to main content
Article
Accuracy of Clustering Prediction of PAM and K-Modes Algorithms
Advances in Information and Communication Networks, FICC 2018 (2018)
  • Marc Gregory Dixon, Rowan University
  • Stanimir Genov, Rowan University
  • Vasil Hnatyshin, Rowan University
  • Umashanger Thayasivam, Rowan University
Abstract
The concept of grouping (or clustering) data points with similar characteristics is of importance when working with the data that frequently appears in everyday life. Data scientists cluster the data that is numerical in nature based on the notion of distance, usually computed using Euclidean measure. However, there are many datasets that often consists of categorical values which require alternative methods for grouping the data. That is why clustering of categorical data employs methods that rely on similarity between the values rather than distance. This work focuses on studying the ability of different clustering algorithms and several definitions of similarity to organize categorical data into groups.
Disciplines
Publication Date
April 5, 2018
DOI
10.1007/978-3-030-03402-3_22
Citation Information
Marc Gregory Dixon, Stanimir Genov, Vasil Hnatyshin and Umashanger Thayasivam. "Accuracy of Clustering Prediction of PAM and K-Modes Algorithms" Advances in Information and Communication Networks, FICC 2018 (2018) p. 330 - 345
Available at: http://works.bepress.com/umashanger-thayasivam/18/