首页 | 官方网站   微博 | 高级检索  
     

基于K-Means改进算法在微博话题发现中的应用研究
引用本文:张云伟,宋安军.基于K-Means改进算法在微博话题发现中的应用研究[J].计算机系统应用,2016,25(10):308-311.
作者姓名:张云伟  宋安军
作者单位:上海海事大学 信息工程学院, 上海 201306,上海海事大学 信息工程学院, 上海 201306
基金项目:国家自然科学基金(61502298)
摘    要:在传统的K-means算法中,聚类结果很大程度依赖于随机选择的初始聚类中心点以及人工指定的k值.为了提高聚类精度,本文提出了利用最小距离与平均聚集度来对初始聚类中心点进行选取,将层次聚类CURE算法得到的聚簇数作为k值,从而使聚类精度得到提高.最后,将改进后的K-means算法应用到微博话题发现中,通过对实验结果分析,证明该算法提高了聚类结果精度.

关 键 词:K-means  微博  话题  聚类
收稿时间:2016/2/19 0:00:00
修稿时间:2016/4/11 0:00:00

Application of Improved Algorithm Based on K-Means in Microblog Topic Discovery
ZHANG Yun-Wei and SONG An-Jun.Application of Improved Algorithm Based on K-Means in Microblog Topic Discovery[J].Computer Systems& Applications,2016,25(10):308-311.
Authors:ZHANG Yun-Wei and SONG An-Jun
Affiliation:College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China and College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
Abstract:In the traditional K-means algorithm, the clustering results greatly depend on the random selection of initial cluster centers and the artificial K values. In order to improve the clustering accuracy, this paper proposes to select the initial cluster centers by using the minimum distance and the average clustering degree. The number of clusters is obtained by the hierarchical clustering CURE algorithm as K value, so that the clustering accuracy can be improved. Finally, the improved K-means algorithm is applied to the micro-blog topic discovery. Through the analysis of the experimental results, it is proved that the algorithm can improve the accuracy of clustering results.
Keywords:K-means  microblog  topic  clustering
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号