首页 | 官方网站   微博 | 高级检索  
     

基于交互信息的混合特征选择算法
引用本文:姜文煊,段友祥,孙歧峰. 基于交互信息的混合特征选择算法[J]. 应用科学学报, 2021, 39(4): 545-558. DOI: 10.3969/j.issn.0255-8297.2021.04.003
作者姓名:姜文煊  段友祥  孙歧峰
作者单位:中国石油大学 (华东) 计算机科学与技术学院, 山东 青岛 266580
基金项目:国家科技重大专项基金(No.2017ZX05009-001,No.2016ZX05011-002);中央高校基本科研业务费项目基金(No.18CX02020A)资助
摘    要:针对传统的特征选择算法只专注于特征间的相关性和冗余性而没有考虑特征之间交互作用的问题,提出一种基于交互信息的混合特征选择(hybrid feature selection based on mutual information,MIHFS)算法,该算法以K-最近邻算法的分类准确率作为衡量所选特征分类性能的评价指标,有效...

关 键 词:特征选择  交互信息  混合特征选择  K-最近邻  灰色关联分析法  逼近理想解的排序技术
收稿时间:2020-08-28

Hybrid Feature Selection Algorithm Based on Mutual Information
JIANG Wenxuan,DUAN Youxiang,SUN Qifeng. Hybrid Feature Selection Algorithm Based on Mutual Information[J]. Journal of Applied Sciences, 2021, 39(4): 545-558. DOI: 10.3969/j.issn.0255-8297.2021.04.003
Authors:JIANG Wenxuan  DUAN Youxiang  SUN Qifeng
Affiliation:School of Computer Science and Technology, China University of Petroleum (East China), Qingdao 266580, Shandong, China
Abstract:Traditional feature selection algorithms only focus on feature correlation and feature redundancy without considering the interaction between features. This paper proposes a hybrid feature selection based on mutual information (MIHFS) algorithm. The algorithm takes the classification accuracy of K-nearest neighbor (KNN) algorithm as evaluation index to evaluate the classification performance of selected features, effectively removes redundant and irrelevant features, and retains the interactive features. In order to evaluate the performance of the proposed algorithm, the classification accuracy, the number of selected features and the stability of the algorithm are compared with seven other feature selection algorithms such as minimal redundancy maximal relevance (mRMR) and joint mutual information (JMI) in eight datasets. Experimental results show that the MIHFS algorithm has strong stability, which not only effectively reduces the dimension of feature space, but also has better classification performance than other feature selection algorithms. Finally, in combination with grey relation analysis (GRA) method-technique for order preference by similarity to ideal solution (TOPSIS) method, MIHFS algorithm is applied to the geological evaluation of the first member of Dainan Formation at Yong’an Area, Gaoyou Sag. Experimental results show that MIHFS algorithm performs an evaluation accuracy of 80% with high reliability, and this is basically consistent with actual drilling results and proves the effectiveness of MIHFS in oil and gas geological evaluation.
Keywords:feature selection  interactive information  hybrid feature selection  K-nearest neighbor (KNN)  gray relation analysis (GRA) method  technique for order preference by similarity to ideal solution (TOPSIS)  
本文献已被 CNKI 等数据库收录!
点击此处可从《应用科学学报》浏览原始摘要信息
点击此处可从《应用科学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号