基于方形领域的网格密度聚类算法
针对大数据聚类低效的问题,提出一种方形邻域快速网格密度聚类算法SGBSCAN (square-neighborhood and Grid-based DBSCAN)。首先给出方形邻域密度聚类定义,利用方形邻域代替圆形邻域,降低时间复杂度;其次提出方形邻域密度聚类的Grid概念,快速确定高密度区域内核心点与数据点之间的密度关系;最后提出Grid密度簇,利用网格之间的关系加快密度簇的形成。算法应用于16个数据集,分别与已有文献算法进行对比,结果表明所提算法在聚类效率方面有显著提升,数据量越大算法效率提升越明显,且所提算法适用于多维数据的聚类。
o solve the problem of low efficiency of large data clustering, this paper proposes a fast grid density clustering algorithm SGBSCAN(Square-neighborhood and Grid-based DBSCAN) . Firstly, this paper gave the definition of square neighborhood density clustering , and used the square neighborhood instead of the circular neighborhood to reduce the time complexity. Secondly, this paper proposed the concept of grid of square neighborhood density clustering , to determine the density relationship between core points and data points in high density region quickly. Finally, this paper proposed the Grid density cluster, the method used the relationship between the grid to accelerate the formation of density clusters. The algorithm made 16 data sets and compared with the existing literature algorithms. The results shows that the algorithm has a significant improvement in clustering efficiency. The larger the data volume, the more obvious the efficiency of the algorithm, and the algorithm is suitable for multidimensional data clustering.
朱合隆、兰红
计算技术、计算机技术
聚类分析密度聚类方形邻域Grid 网格网格簇
朱合隆,兰红.基于方形领域的网格密度聚类算法[EB/OL].(2019-05-10)[2025-08-02].https://chinaxiv.org/abs/201905.00024.点此复制
评论