首页 | 官方网站   微博 | 高级检索  
     

基于自适应近邻参数的密度峰聚类算法
引用本文:周欢欢,郑伯川,张征,张琦.基于自适应近邻参数的密度峰聚类算法[J].计算机应用,2022,42(5):1464-1471.
作者姓名:周欢欢  郑伯川  张征  张琦
作者单位:西华师范大学 数学与信息学院,四川 南充 637009
西华师范大学 计算机学院,四川 南充 637009
基金项目:国家自然科学基金资助项目(62176217)~~;
摘    要:针对基于共享最近邻的密度峰聚类算法中的近邻参数需要人为设定的问题,提出了一种基于自适应近邻参数的密度峰聚类算法。首先,利用所提出的近邻参数搜索算法自动获得近邻参数;然后,通过决策图选取聚类中心;最后,根据所提出的代表点分配策略,先分配代表点,后分配非代表点,从而实现所有样本点的聚类。将所提出的算法与基于共享最近邻的快速密度峰搜索聚类(SNN?DPC)、基于密度峰值的聚类(DPC)、近邻传播聚类(AP)、对点排序来确定聚类结构(OPTICS)、基于密度的噪声应用空间聚类(DBSCAN)和K-means这6种算法在合成数据集以及UCI数据集上进行聚类结果对比。实验结果表明,所提出的算法在调整互信息(AMI)、调整兰德系数(ARI)和FM指数(FMI)等评价指标上整体优于其他6种算法。所提算法能自动获得有效的近邻参数,且能较好地分配簇边缘区域的样本点。

关 键 词:共享最近邻  局部密度  密度峰聚类    content-type='mathml'>  xmlns:mml='http://www.w3.org/1998/Math/MathML'  id='M1'>k  xmlns:xlink='http://www.w3.org/1999/xlink'  xlink:href='1001-9081-2022-42-5-1464/A2D008F5-AF13-45b1-A2D6-559A197C5E19-M001.jpg'/>-近邻  逆近邻  
收稿时间:2021-05-11
修稿时间:2021-08-27

Density peak clustering algorithm based on adaptive nearest neighbor parameters
Huanhuan ZHOU,Bochuan ZHENG,Zheng ZHANG,Qi ZHANG.Density peak clustering algorithm based on adaptive nearest neighbor parameters[J].journal of Computer Applications,2022,42(5):1464-1471.
Authors:Huanhuan ZHOU  Bochuan ZHENG  Zheng ZHANG  Qi ZHANG
Affiliation:School of Mathematics and Information,China West Normal University,Nanchong Sichuan 637009,China
School of Computer Science,China West Normal University,Nanchong Sichuan 637009,China
Abstract:Aiming at the problem that the nearest neighbor parameters need to be set manually in density peak clustering algorithm based on shared nearest neighbor, a density peak clustering algorithm based on adaptive nearest neighbor parameters was proposed. Firstly, the proposed nearest neighbor parameter search algorithm was used to automatically obtain the nearest neighbor parameters. Then, the clustering centers were selected through the decision diagram. Finally, according to the proposed allocation strategy of representative points, all sample points were clustered through allocating the representative points and the non-representative points sequentially. The clustering results of the proposed algorithm was compared with those of the six algorithms such as Shared-Nearest-Neighbor-based Clustering by fast search and find of Density Peaks (SNN?DPC), Clustering by fast search and find of Density Peaks (DPC), Affinity Propagation (AP), Ordering Points To Identify the Clustering Structure (OPTICS), Density-Based Spatial Clustering of Applications with Noise (DBSCAN), and K-means on the synthetic datasets and UCI datasets. Experimental results show that, the proposed algorithm is better than the other six algorithms on the evaluation indicators such as Adjusted Mutual Information (AMI), Adjusted Rand Index (ARI) and Fowlkes and Mallows Index (FMI). The proposed algorithm can automatically obtain the effective nearest neighbor parameters, and can better allocate the sample points in the edge region of the cluster.
Keywords:shared nearest neighbor  local density  density peak clustering  k-neighbor  inverse neighbor  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号