首页 | 官方网站   微博 | 高级检索  
     

基于互信息和粗糙集理论的特征选择
引用本文:朱颢东,李红婵. 基于互信息和粗糙集理论的特征选择[J]. 计算机工程, 2011, 37(15): 181-183. DOI: 10.3969/j.issn.1000-3428.2011.15.057
作者姓名:朱颢东  李红婵
作者单位:郑州轻工业学院计算机与通信工程学院,郑州,450002
基金项目:河南省基础与前沿技术研究计划基金,郑州轻工业学院博士科研基金
摘    要:针对互信息方法在精度方面的不足,通过引入粗糙集,给出一种基于关系积理论的属性约简算法,以此为基础提出一个适用于海量文本数据集的特征选择方法。该方法采用互信息进行特征初选,利用提出的属性约简算法消除冗余,获得较具代表性的特征子集。实验结果表明,该特征选择方法能获得冗余度小且较具代表性的特征子集。

关 键 词:特征选择  互信息  粗糙集  关系积理论  属性约简
收稿时间:2011-01-13

Feature Selection Based on Mutual Information and Rough Set Theory
ZHU Hao-dong,LI Hong-chan. Feature Selection Based on Mutual Information and Rough Set Theory[J]. Computer Engineering, 2011, 37(15): 181-183. DOI: 10.3969/j.issn.1000-3428.2011.15.057
Authors:ZHU Hao-dong  LI Hong-chan
Affiliation:(School of Computer and Communication Engineering,Zhengzhou University of Light Industry,Zhengzhou 450002,China)
Abstract:Feature selection is research hotspot in text automatic categorization. Mutual Information(MI) is analyzed. And according to deficiency of MI, Rough Set(RS) is introduced and an attribute reduction algorithm based on relation union theory is proposed. A feature selection method based on MI and the proposed attribute reduction algorithm is presented, and it is suitable for massive text data sets. The method uses MI to select features, and employs the proposed attribute reduction algorithm to eliminate redundancy, so it can acquire the feature subsets which are more representative. Experimental results show that the method is promising.
Keywords:feature selection  Mutual Information(MI)  Rough Set(RS)  relation union theory  attribute reduction
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号