首页 | 官方网站   微博 | 高级检索  
     

一种基于网格划分的密度峰值聚类改进算法
引用本文:江平平,曾庆鹏.一种基于网格划分的密度峰值聚类改进算法[J].计算机应用与软件,2019,36(8):268-274,280.
作者姓名:江平平  曾庆鹏
作者单位:南昌大学信息工程学院 江西 南昌 330031;南昌大学信息工程学院 江西 南昌 330031
基金项目:江西省自然科学基金项目
摘    要:针对密度峰值聚类(Density Peak Clustering,DPC)算法具有时空复杂度高而降低了对大规模数据集聚类的有效性,以及依靠决策图人工选取聚类中心等缺点,提出基于网格的密度峰值聚类(G-DPC)算法。采用基于网格的方式进行网格划分,用网格代表点替换网格单元整体;对各代表点聚类,通过改进的自适应方法选出核心网格代表点作为聚类中心;将剩余点归类,剔除噪声点。仿真实验验证了该算法对大规模数据集和高维数据集聚类的有效性。

关 键 词:密度峰值  网格  聚类算法

AN IMPROVED DENSITY PEAK CLUSTERING ALGORITHM BASED ON GRID
Jiang Pingping,Zeng Qingpeng.AN IMPROVED DENSITY PEAK CLUSTERING ALGORITHM BASED ON GRID[J].Computer Applications and Software,2019,36(8):268-274,280.
Authors:Jiang Pingping  Zeng Qingpeng
Affiliation:(School of Information Engineering,Nanchang University,Nanchang 330031,Jiangxi,China)
Abstract:The high spatial complexity of density peak clustering algorithm(DPC) reduces the validity of large data clustering,and the cluster center needs to be manually selected by decision graph.To solve these problems,we proposed an improved DPC algorithm based on grid(G-DPC).This algorithm adopted the grid clustering algorithm to divide the grid and replaced the grid cell with grid representative point.Then,all grid representative points were clustered,and the core grid representative points were selected as the cluster center by the improved adaptive method.Finally,the remaining points were classified and the noise points were eliminated.Simulation results show that the algorithm is effective for large data sets and high dimensional data clustering.
Keywords:Density peaks  Grid  Clustering
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号