首页 | 官方网站   微博 | 高级检索  
     

使用特征分辨率和差别对象对集的特征选择
引用本文:吴洪丽,朱颢东,周瑞琼.使用特征分辨率和差别对象对集的特征选择[J].计算机工程与应用,2010,46(16):160-162.
作者姓名:吴洪丽  朱颢东  周瑞琼
作者单位:1. 海南师范大学,信息科学技术学院,海口,571158;中国科学院,成都计算机应用研究所,成都,610041
2. 冲国科学院,成都计算机应用研究所,成都,610041
3. 海南师范大学,信息科学技术学院,海口,571158
基金项目:海南省自然科学基金,四川省科技计划项目 
摘    要:特征选择是文本分类的关键步骤之一,所选特征子集的优劣直接影响文本分类的结果。首先简单分析了几种经典的特征选择方法,总结了它们的不足,然后提出了特征分辨率的概念,并提出了一个基于差别对象对集的属性约简算法,最后把该属性约简算法同特征分辨率结合起来,提出了一个新的特征选择方法。该方法首先利用特征分辨率进行特征初选以过滤掉一些词条来降低特征空间的稀疏性,然后利用所提属性约简算法消除冗余,从而获得较具代表性的特征子集。实验结果表明此种特征选择方法效果良好。

关 键 词:特征选择  文本分类  特征分辨率  差别对象对集  属性约简
收稿时间:2009-10-12
修稿时间:2009-12-2  

Feature selection using feature distinguishability and discernibility object pair set
WU Hong-li,ZHU Hao-dong,ZHOU Rui-qiong.Feature selection using feature distinguishability and discernibility object pair set[J].Computer Engineering and Applications,2010,46(16):160-162.
Authors:WU Hong-li  ZHU Hao-dong  ZHOU Rui-qiong
Affiliation:1.College of Information Science and Technology,Hainan Normal University,Haikou 571158,China 2.Chengdu Institute of Computer Application,Chinese Academy of Sciences,Chengdu 610041,China
Abstract:Feature selection is one of the key steps in text categorization.The selected feature subset directly influences results of text categorization.Firstly,several classic feature selection methods are analyzed simply and their deficiencies are summarized.And then,the concept of feature distinguishability is presented.Next,an attribute reduction algorithm based on discernibility object pair set is provided.Finally,combining the attribute reduction algorithm with feature distinguishability,a new feature selection method is proposed.The new method firstly uses feature distinguishability to select feature and filter out some terms to reduce the sparsity of feature spaces,and then employs the attribute reduction algorithm to eliminate redundancy,so that the feature subsets which are more representative are acquired.The experimental results show that the new method is promising.
Keywords:feature selection  text categorization  feature distinguishability  discernibility object pair set  attribute reduction
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号