首页 | 官方网站   微博 | 高级检索  
     

自适应聚合策略优化的密度峰值聚类算法
引用本文:钱雪忠,金辉.自适应聚合策略优化的密度峰值聚类算法[J].计算机科学与探索,2020,14(4):712-720.
作者姓名:钱雪忠  金辉
作者单位:江南大学 物联网工程学院 物联网技术应用教育部工程研究中心,江苏 无锡 214122;江南大学 物联网工程学院 物联网技术应用教育部工程研究中心,江苏 无锡 214122
基金项目:The National Natural Science Foundation of China under Grant No. 61673193 (国家自然科学基金);the Fundamental Research Fundsfor the Central Universities of China under Grant Nos. JUSRP51635B, JUSRP51510 (中央高校基本科研业务费专项资金)
摘    要:针对密度峰值聚类算法受人为干预影响较大和参数敏感的问题,即不正确的截断距离dc会导致错误的初始聚类中心,而且在某些情况下,即使设置了适当的dc值,仍然难以从决策图中人为选择初始聚类中心。为克服这些缺陷,提出一种新的基于密度峰值的聚类算法。该算法首先根据K近邻的思想来确定数据点的局部密度,然后提出一种新的自适应聚合策略,即首先通过算法给出阈值判断初始类簇中心,然后依据离初始类簇中心最近分配剩余点,最后通过类簇间密度可达来合并相似类簇。在实验中,该算法在合成和实际数据集中的表现比DPC、DBSCAN、KNNDPC和K-means算法要好,能有效提高聚类准确率和质量。

关 键 词:密度峰  K近邻(KNN)  局部密度  合并策略  类簇间密度可达

Optimized Density Peak Clustering Algorithm by Adaptive Aggregation Strategy
QIAN Xuezhong,JIN Hui.Optimized Density Peak Clustering Algorithm by Adaptive Aggregation Strategy[J].Journal of Frontier of Computer Science and Technology,2020,14(4):712-720.
Authors:QIAN Xuezhong  JIN Hui
Affiliation:(Engineering Research Center of Internet of Things Technology Applications,Ministry of Education,School of Internet of Things Engineering,Jiangnan University,Wuxi,Jiangsu 214122,China)
Abstract:Aiming at the problem that the density peak clustering algorithm is greatly influenced by human intervention and parameter is sensitive,that is the improper selection of its parameter cutoff distance dc will lead to the wrong selection of initial cluster centers.And in some cases,even the proper value of dc is set,initial cluster centers are still difficult to be selected from the decision graph artificially.To overcome these defects,a new clustering algorithm based on density peak is proposed.Firstly,the algorithm determines the local density of data points according to the idea of K-nearest neighbors,and then a new adaptive aggregation strategy is proposed,which firstly determines the initial cluster center by the threshold of the algorithm,then allocates the remaining points according to the nearest cluster center,and finally merges the similar clusters by the density reachable between the clusters.In the experiment,the algorithm performs better than the DPC,DBSCAN,KNNDPC and K-means algorithm in the synthetic and actual datasets,and the algorithm can effectively improve clustering accuracy and quality.
Keywords:density peak  K-nearest neighbor(KNN)  local density  merging strategy  clustering density reachable
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号