Skip to main content
Cluster Rendering of Skewed Datasets via Visualization
SAC '03 Proceedings of the 2003 ACM Symposium on Applied Computing
  • Keke Chen, Wright State University - Main Campus
  • Ling Liu
Document Type
Conference Proceeding
Publication Date

Information Visualization is commonly recognized as a useful method for understanding sophistication in large datasets. In this paper, we introduce a flexible clustering approach with visualization techniques, aiming at the datasets that have skewed cluster distribution. This paper has three contributions. First, we propose a framework Vista that incorporates information visualization methods into the clustering process in order to enhance the understanding of the intermediate clustering results and allow user to revise the clustering results easily. Second, we develop a visualization model that maps multidimensional dataset to 2D visualizations while preserving or partially preserving clusters. Third, based on the visualization model, a set of operating rules are proposed to guide the user rendering clusters efficiently. Experiments show that the Vista system can yield lower error rates for real datasets than typical automated algorithms.


This paper was presented at the Symposium on Applied Computing(ACM SAC03), Melbourne, FL, March 2003.

Citation Information
Keke Chen and Ling Liu. "Cluster Rendering of Skewed Datasets via Visualization" SAC '03 Proceedings of the 2003 ACM Symposium on Applied Computing (2003) p. 909 - 916 ISSN: 1-58113-624-2
Available at: