首页 | 官方网站   微博 | 高级检索  
     

基于混合聚类的中文词聚类
引用本文:史金成,程转流.基于混合聚类的中文词聚类[J].微计算机信息,2010(15).
作者姓名:史金成  程转流
作者单位:铜陵学院数学与计算机科学系;
基金项目:安徽省高校省级自然科学研究项目(KJ2010B455); 铜陵学院院级科研项目(2009tlxy21)
摘    要:文本聚类在文本挖掘和信息检索系统中发挥着重要的作用,而词聚类是文本聚类的基础。提出了一种基于混合聚类的中文词聚类方法,它将层次聚类和概念聚类结合起来,以缩短整个聚类时间。首先对预处理后的词集进行初始聚类,然后从每个类中各取一个出现次数最多的词组成新的词集,最后对该词集进行再聚类。实验表明,这种方法有效降低了中文词聚类的时间复杂度。

关 键 词:词聚类  层次聚类  概念聚类  混合聚类  

Chinese Word Clustering Based on hybrid Clustering
SHI Jin-cheng CHENG Zhuan-liu.Chinese Word Clustering Based on hybrid Clustering[J].Control & Automation,2010(15).
Authors:SHI Jin-cheng CHENG Zhuan-liu
Affiliation:SHI Jin-cheng CHENG Zhuan-liu(Department of Mathematics , Computer Science,Tongling College,Tongling Anhui,244000,China)
Abstract:Text clustering based on word clustering plays an important role in text mining and information search system.A Chinese word clustering method based on hybrid clustering was proposed,which conjugates hierarchical clustering and conceptual clustering to reduce the whole clustering time.The method firstly clustered word set which had been pretreated,and then used the words whose occurence number was maximum in each cluster to compose a new word set.In the end,the new word set was clustered again.The experimen...
Keywords:word clustering  hierarchical clustering  conceptual clustering  hybrid clustering  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号