一种新的聚类集成方法
New Method for Clustering Ensembles
近年来,聚类集成技术的优势已经得到众多研究者的认可,越来越多的研究者投入到聚类集成的研究中来,而聚类集成的关键问题是构造具有多样性的数据划分和设计合适的共识函数(Consensus Function)。本文给出了一种基于SEAM算法的共识函数计算方法,得到了一种新的集成聚类算法,并通过实际数据集的测试,验证了该算法的性能好于已有的基于图的集成聚类算法CSPA、HGPA和MCLA。
In recent years, clustering ensembles have attracted much attention since they outperforms the traditional clustering methods. A lot of researches have been done both on constructing the individual partitions and on designing the consensus functions. This paper focuses on the second aspect. Namely, how to combine the multiple data partitions to get a consistent partition for a given dataset using the information obtained in the different clusterings. In this paper, we propose a new method of combining multiple partitions by using the Squared Error Adjacent Matrix (SEAM) algorithm. We conducted several experiments of the proposed method both on the synthetic and the real-world datasets and compared the method with the graph-based consensus functions, CSPA, HGPA, and MCLA. Experimental results show that the proposed method is better or comparable to the graph-based methods.
杨丽丽、于剑、贾彩燕
计算技术、计算机技术
数据挖掘聚类聚类集成
data miningclusteringclustering ensemble
杨丽丽,于剑,贾彩燕.一种新的聚类集成方法[EB/OL].(2011-03-18)[2025-08-10].http://www.paper.edu.cn/releasepaper/content/201103-792.点此复制
评论