基于模糊邻域的比较密度峰值算法 Clustering by Comparitive Density Peaks using FuzzyNeighborhood期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于模糊邻域的比较密度峰值算法

引用本文：	李昕,雷迎科.基于模糊邻域的比较密度峰值算法[J].信号处理,2019,35(11):1919-1928.

作者姓名：	李昕雷迎科

作者单位：	国防科技大学电子对抗学院

摘要：	聚类作为机器学习中一种重要的无监督学习方式，在图像处理及生物基因分类上具有广泛的应用。《Clustering by fast search and find of density peaks》(DPC)提出通过寻找密度峰对数据进行分类，它既不需要迭代过程，也不需要太多参数输入。但DPC算法在球形数据集上表现较差，容易忽略潜在的聚类中心，且需要人工参与聚类中心选取。针对上述问题，本文采用模糊邻域关系计算数据密度，采用比较距离代替DPC算法中的相对距离。通过对机器学习数据集的实验，将本文提出的算法同DBSCN、OPTICS、DPC在准确率和调整兰德指数上进行比较。实验结果表明本文提出的算法可行有效
关键词：	无监督机器学习密度峰值聚类算法模糊聚类算法比较距离
收稿时间：	2019-06-13
Clustering by Comparitive Density Peaks using FuzzyNeighborhood

Affiliation:	Electronic Countermeasures Institution of National University of Defense Technology

Abstract:	As an important unsupervised learning method in machine learning, clustering has a wide range of applications in image processing and biological gene classification. "Clustering by fast search and find of density peaks" (DPC) proposes to classify data by looking for density peaks, which does not require an iterative process or too many parameter inputs. However, the DPC algorithm performs poorly on the spherical dataset, and it is easy to ignore the potential cluster center, and needs to manually participate in the cluster center selection. In view of the above problems, this paper uses the fuzzy neighborhood relationship to calculate the data density, and uses the comparative distance instead of the relative distance in the DPC algorithm. Through the experiment of machine learning data set, we compared our algorithm with DBSCN, OPTICS and DPC in accuracy and ARI. The experimental results show that the proposed algorithm is feasible and effective.

Keywords:

	点击此处可从《信号处理》浏览原始摘要信息
	点击此处可从《信号处理》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏