首页 | 官方网站   微博 | 高级检索  
     

基于iTopicModel的关联文本分类算法
引用本文:梁鹏鹏,柴玉梅,王黎明.基于iTopicModel的关联文本分类算法[J].计算机工程,2011,37(21):124-125,130.
作者姓名:梁鹏鹏  柴玉梅  王黎明
作者单位:郑州大学信息工程学院,郑州,450001
基金项目:国家自然科学基金资助项目
摘    要:针对传统文本分类方法对文档间关联关系考虑不充分的问题,提出一种基于iTopicModel的关联文本分类算法。根据类信息已知的文档归属于各个主题的概率判断主题代表的类信息,利用待分类文档归属于各个主题的概率及文本信息对文档进行分类。实验结果表 明,当文档间的关联关系对类信息影响较大时,TC-iTM的分类性能优于传统文本分类方法。

关 键 词:文本分类  文档网络  主题模型  EM算法
收稿时间:2011-04-12

Relational Text Classification Algorithm Based on iTopicModel
LIANG Peng-peng,CHAI Yu-mei,WANG Li-ming.Relational Text Classification Algorithm Based on iTopicModel[J].Computer Engineering,2011,37(21):124-125,130.
Authors:LIANG Peng-peng  CHAI Yu-mei  WANG Li-ming
Affiliation:(School of Information Engineering,Zhengzhou University,Zhengzhou 450001,China)
Abstract:In order to solve the problem that traditional text classification methods do not emphasize the links among text documents enough,this paper proposes a novel text classification algorithm TC-iTM based on iTopicModel.TC-iTM uses the probability that the labeled documents are assigned to each topic to judge the category that each topic represents.TC-iTM classifies unlabelled documents by using the probability that the documents are assigned to each topic and the text information of these documents.Experimental result shows that TC-iTM outperforms the traditional text classification methods when links among documents are important to the categories of the documents in document network.
Keywords:text classification  document network  topic model  EM algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号