首页 | 官方网站   微博 | 高级检索  
     

一种新的兼类文本分类方法
引用本文:秦玉平,陈一荻,王春立,王秀坤.一种新的兼类文本分类方法[J].计算机科学,2011,38(11):204-205,224.
作者姓名:秦玉平  陈一荻  王春立  王秀坤
作者单位:1. 渤海大学工学院 锦州121000
2. 大连海事大学信息科学技术学院 大连116026
3. 大连理工大学计算机科学与技术学院 大连116024
基金项目:国家自然科学基金项目(60603023); 国家基础研究重大项目(973)研究专项(2001CCA00700); 辽宁省教育厅重点实验室项目(LS2010180)资助
摘    要:提出了一种基于超椭球的兼类文本分类算法。对每一类样本,在特征空间求得一个包围该类样本的最小超椭球,使得各类样本之间通过超椭球隔开。对待分类样本,通过判断其是否在超椭球内确定其类别。若没有超椭球包围待分类样本,则通过隶属度确定其所属类别。在标准数据集Reuters 21578上的实验结果表明,该方法较超球方法提高了分类精度和分类速度。

关 键 词:超椭球,兼类分类,缩放因子,隶属度

New Multi-label Text Classification Algorithm
QIN Yu-ping,CHEN Yi-di,WANG Chun-li,WANG Xiu-kun.New Multi-label Text Classification Algorithm[J].Computer Science,2011,38(11):204-205,224.
Authors:QIN Yu-ping  CHEN Yi-di  WANG Chun-li  WANG Xiu-kun
Affiliation:QIN Yu-ping1 CHEN Yi-di1 WANG Chun-li2 WANG Xiu-kun3(College of Engineering,Bohai University,Jinzhou 121000,China)1(College of Information Science and Technology,Dalian Maritime University,Dalian 116026,China)2(School of Computer Science and Technology,Dalian University of Technology,Dalian 116024,China)3
Abstract:A new multi-label text classification algorithm based on hyper ellipsoidal was proposed in this paper. For every class, the smallest hyper ellipsoidal that contains the samples of the class is structured, which can divide the class samples from others. For the sample to be classified, its class is confirmed by the hyper ellipsoidal that surrounds it. If the sample is not surrounded by any hyper ellipsoidal, the membership is used to confirmed its class. The experiments were done on Reuters 21578 and the experiment results show that the algorithm has a higher performance on classificalion speed and classification precision compare with hyper sphere algorithm.
Keywords:Hyper ellipsoidal  Multi-label classification  Extension factor  Membership
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号