首页 | 官方网站   微博 | 高级检索  
     

基于主成分分析的密度聚类算法
引用本文:范文,孙红跃,王萍. 基于主成分分析的密度聚类算法[J]. 天津城市建设学院学报, 2012, 0(1): 60-62,76
作者姓名:范文  孙红跃  王萍
作者单位:[1]天津城市建设学院电子与信息工程系,天津300384 [2]天津大学电气与自动化工程学院,天津300072
摘    要:密度聚类算法可以描述任意形状的聚类,可以有效地处理异常数据,适合处理大数据集,但不适用于高维数据集的聚类,因此提出了基于主成分分析的密度聚类算法,将DBSCAN算法应用于PCA的k个主成分张成的子空间,解决了DBSCAN算法用于高维数据集的问题.运用气象数据进行实验,结果表明:主成分个数k值的选择严重影响聚类效果,故提出k的基本选择方法,正确选择k值情况下,该算法具有较好的聚类效果.

关 键 词:聚类  密度聚类算法  主成分分析

Density Clustering Algorithm Based on Principal Component Analysis
FAN Wen,SUN Hong-yue,WANG Ping. Density Clustering Algorithm Based on Principal Component Analysis[J]. Journal of Tianjin Institute of Urban Construction, 2012, 0(1): 60-62,76
Authors:FAN Wen  SUN Hong-yue  WANG Ping
Affiliation:1.Department of Electronics and Information Engineering,TIUC,Tianjin 300384,China;2.College of Electronic En-gineering and Automation,Tianjin University,Tianjin 300072,China)
Abstract:Density clustering algorithm can describe any shape clustering and deal with abnormal data processing.It is suitable for large data set,but not for high dimensional data set,so density clustering algorithm based on principal component analysis is proposed.The DBSCAN algorithm is applied to the k principal component expanded subspace.The problem of the DBSCAN algorithm used for the high dimensional data set is solved.Meteorological data is used in the experiments.The experimental results show that principal component k influences clustering effect,so the selection method of k is put forward.With the correct choice of k,the proposed algorithm has good clustering effect.
Keywords:clustering  density clustering algorithm  principal component analysis
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号