|国家预印本平台
首页|K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data

K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data

K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data

中文摘要英文摘要

lustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, Average Normalized Mutual Information-ANMI) borrowed from cluster ensemble. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-art categorical data clustering algorithms with respect to clustering accuracy.

lustering categorical data is an integral part of data mining and has attracted much attention recently. In this paper, we present k-ANMI, a new efficient algorithm for clustering categorical data. The k-ANMI algorithm works in a way that is similar to the popular k-means algorithm, and the goodness of clustering in each step is evaluated using a mutual information based criterion (namely, Average Normalized Mutual Information-ANMI) borrowed from cluster ensemble. Experimental results on real datasets show that k-ANMI algorithm is competitive with those state-of-art categorical data clustering algorithms with respect to clustering accuracy.

Xiaofei Xu、何增友、Shengchun Deng

计算技术、计算机技术

lustering Categorical Data Mutual Information Cluster Ensemble Data Mining

lustering Categorical Data Mutual Information Cluster Ensemble Data Mining

Xiaofei Xu,何增友,Shengchun Deng.K-ANMI: A Mutual Information Based Clustering Algorithm for Categorical Data[EB/OL].(2005-11-04)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/200511-76.点此复制

评论