K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data
K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data
lustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, Average Normalized Mutual Information-ANMI) borrowed from cluster ensemble. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-art categorical data clustering algorithms with respect to clustering accuracy.
lustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, Average Normalized Mutual Information-ANMI) borrowed from cluster ensemble. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-art categorical data clustering algorithms with respect to clustering accuracy.
Xiaofei Xu、何增友、Shengchun Deng
计算技术、计算机技术
lustering Categorical Data Mutual Information Cluster Ensemble Data Mining
lustering Categorical Data Mutual Information Cluster Ensemble Data Mining
Xiaofei Xu,何增友,Shengchun Deng.K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data[EB/OL].(2005-11-04)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/200511-76.点此复制
评论