首页 | 官方网站   微博 | 高级检索  
     

基于最小联合互信息亏损的最优特征选择算法
引用本文:张逸石,陈传波. 基于最小联合互信息亏损的最优特征选择算法[J]. 计算机科学, 2011, 38(12): 200-205
作者姓名:张逸石  陈传波
作者单位:1. 华中科技大学软件学院 武汉430074
2. 华中科技大学软件学院 武汉430074;华中科技大学计算机科学与技术学院 武汉430074
基金项目:国家自然科学基金项目(60973085)资助
摘    要:提出了一种基于最小联合互信息亏损的最优特征选择算法。该算法首先通过一种动态渐增策略搜索一个特征全集的无差异特征子集,并基于最小条件互信息原则在保证每一步中联合互信息量亏损都最小的情况下筛选其中的冗余特征,从而得到一个近似最优特征子集。针对现有基于条件互信息的条件独立性测试方法在高维特征域上所面临的效率瓶颈问题,给出了一种用于估计条件互信息的快速实现方法,并将其用于所提算法的实现。分类实验结果表明,所提算法优于经典的特征选择算法。此外,执行效率实验结果表明,所提条件互信息的快速实现方法在执行效率上有着显著的优势。

关 键 词:特征选择,条件互信息,最小联合互信息亏损,快速实现

Minimum Joint Mutual Information Loss-based Optimal Feature Selection Algorithm
ZHANG Yi-shi,CHEN Chuan-bo. Minimum Joint Mutual Information Loss-based Optimal Feature Selection Algorithm[J]. Computer Science, 2011, 38(12): 200-205
Authors:ZHANG Yi-shi  CHEN Chuan-bo
Affiliation:ZHANG Yi-shi1 CHEN Chuan-bo1,2(School of Software Engineering,Huazhong University of Science and Technology,Wuhan 430074,China)1(School of Computer Science and Technology,China)2
Abstract:In this paper, a minimum joint mutual information loss-based optimal feature selection algorithm was proposed,which firstly finds a non-discriminate feature subset of the original set via a dynamic incremental searching stratagy,and then eliminates false positives by keeping minimum joint mutual information loss with class in each iteration using a minimal conditional mutual information criterion, in such a way as to obtain an approximate optimal feature subset. Furthermore, for the computationally intractable problem arising in high dimensional feature space that characterizes the existing method of conditional independence test with conditional mutual information, a fast implementation of conditional mutual information estimation was introduced and used to implement the proposed algorithm. Experimental resups for the classification task show that the proposed algorithm performs better than the representative feature selection algorithms. Experimental results for the execution task show that the proposed implementation of conditional mutual information estimation has a considerable advantage.
Keywords:Feature selection  Conditional mutual information  Minimum joint mutual information loss  Fast implementation
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号