首页 | 官方网站   微博 | 高级检索  
     

基于条件互信息的特征选择改进算法
引用本文:刘海燕,王超,牛军钰. 基于条件互信息的特征选择改进算法[J]. 计算机工程, 2012, 38(14): 135-137
作者姓名:刘海燕  王超  牛军钰
作者单位:复旦大学计算机科学技术学院,上海,201203
基金项目:国家“863”计划基金资助项目
摘    要:针对传统特征选择算法只专注于特征类相关性或者特征冗余性的问题,提出一种基于条件互信息的特征选择算法。该算法采用k-means的基本思想聚类特征,并从中选出类相关度最大的特征,从而去除不相关和冗余特征。实验使用5个数据集,结果表明,该算法的分类性能优于传统特征选择算法。

关 键 词:数据挖掘  特征选择  互信息  条件互信息  聚类  度量距离
收稿时间:2011-11-21

Improved Feature Selection Algorithm Based on Conditional Mutual Information
LIU Hai-yan , WANG Chao , NIU Jun-yu. Improved Feature Selection Algorithm Based on Conditional Mutual Information[J]. Computer Engineering, 2012, 38(14): 135-137
Authors:LIU Hai-yan    WANG Chao    NIU Jun-yu
Affiliation:(School of Computer Science,Fudan University,Shanghai 201203,China)
Abstract:Aiming at the shortcomings of traditional feature selection which are neglect of relevancy to the class and redundancy to the feature,this paper introduces a feature selection algorithm based on conditional mutual information.The algorithm clusters interdependent features into clusters and selects one feature which has maximum mutual information with class,the irrelevant and redundant features are removed.Experimental results show that the method is prior to traditional feature selection from the point of view of classification accuracy.
Keywords:data mining  feature selection  mutual information  conditional mutual information  clustering  metric distance
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号