Categorical data visualization and clustering using subjective factors

Chia Hui Chang, Zhi Kai Ding

研究成果: 雜誌貢獻期刊論文同行評審

24 引文 斯高帕斯(Scopus)


Clustering is an important data mining problem. However, most earlier work on clustering focused on numeric attributes which have a natural ordering to their attribute values. Recently, clustering data with categorical attributes, whose attribute values do not have a natural ordering, has received more attention. A common issue in cluster analysis is that there is no single correct answer to the number of clusters, since cluster analysis involves human subjective judgement. Interactive visualization is one of the methods where users can decide a proper clustering parameters. In this paper, a new clustering approach called CDCS (Categorical Data Clustering with Subjective factors) is introduced, where a visualization tool for clustered categorical data is developed such that the result of adjusting parameters is instantly reflected. The experiment shows that CDCS generates high quality clusters compared to other typical algorithms.

頁(從 - 到)243-262
期刊Data and Knowledge Engineering
出版狀態已出版 - 6月 2005


深入研究「Categorical data visualization and clustering using subjective factors」主題。共同形成了獨特的指紋。