首页 | 官方网站   微博 | 高级检索  
     

快速搜索与发现密度峰值聚类算法的优化研究
引用本文:蒋礼青,张明新,郑金龙,戴娇,尚赵伟.快速搜索与发现密度峰值聚类算法的优化研究[J].计算机应用研究,2016,33(11).
作者姓名:蒋礼青  张明新  郑金龙  戴娇  尚赵伟
作者单位:常熟理工学院,常熟理工学院 计算机科学与工程学院,常熟理工学院 计算机科学与工程学院,中国矿业大学 计算机科学与技术学院,重庆大学 计算机科学与技术
基金项目:国家自然科学基金资助项目
摘    要:CFSFDP是基于密度的新聚类算法,可聚类非球形数据集,具有聚类速度快实现简单等优点。CFSFDP需人工尝试确定密度阈值dc且对一个类中存在多密度峰值的数据无法进行准确聚类,为解决该缺点,本文提出基于近邻距离曲线和类合并优化CFSFDP(简称 NM-CFSFDP)的聚类算法。首先,算法用近邻距离曲线变化情况自动确定密度阈值dc;然后,用本文提出自动确定dc的CFSFDP对数据聚类;最后用本文计算dc值的方法指导类的合并,引入内聚程度衡量参数解决了类合并后不能撤销的难题,从而实现对多密度峰值数据的正确聚类。通过实验对比,NM-CFSFDP算法确实比CFSFDP算法具有更加精确的聚类效果。

关 键 词:聚类  密度峰值  近邻距离曲线  类合并。
收稿时间:2015/7/17 0:00:00
修稿时间:2016/9/23 0:00:00

Optimization of Clustering by Fast Search and Find of Density Peaks
Jiang Liqing,Zhang Mingxin,Zheng Jinlong,Dai Jiao and Shang Zhaowei.Optimization of Clustering by Fast Search and Find of Density Peaks[J].Application Research of Computers,2016,33(11).
Authors:Jiang Liqing  Zhang Mingxin  Zheng Jinlong  Dai Jiao and Shang Zhaowei
Affiliation:School of Computer Science and Technology,China University of Mining and Technology,,School of Computer Science and Engineering,Changshu Institute of Technology,School of Computer Science and Technology,China University of Mining and Technology,Department of College of Computer Science,University of Chongqing,Chongqing
Abstract:CFSFDP algorithm was a new clustering algorithm based on density, which can cluster non spherical data sets, CFSFDP had the advantages of fast clustering speed and simple realization. But the CFSFDP algorithm need to perform multiple attempts to determine the density threshold dc and the existence of multiple density peaks of one class leaded to incorrect clustering, in view of the disadvantages, optimization of CFSFDP based on neighbor distance curve and merging clusters (for short NM-CFSFDP) algorithm was proposed. Firstly, the new algorithm gave the density threshold which named dc automatically, the dc was determined by the change of the nearest neighbor distance curve. Secondly, NM-CFSFDP used CFSFDP algorithm, which gave dc automatically, to cluster the data set, and then merged the classes that can be merged, merging operation can be dynamically revoked in the algorithm. Through the contrast experiment, the NM-CFSFDP algorithm is more accurate than the CFSFDP in clustering.
Keywords:clustering  density peaks  nearest neighbor distance curve  merging clusters
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号