首页 | 官方网站   微博 | 高级检索  
     

基于PCA和信息增益的肿瘤特征基因选择方法
引用本文:徐久成,黄方舟,穆辉宇,王云,徐战威.基于PCA和信息增益的肿瘤特征基因选择方法[J].河南师范大学学报(自然科学版),2018(2).
作者姓名:徐久成  黄方舟  穆辉宇  王云  徐战威
作者单位:河南师范大学计算机与信息工程学院河南省高校计算智能与数据挖掘工程技术研究中心;
摘    要:针对肿瘤基因数据因维度高和冗余基因较多而导致分类精度低的问题,提出一种基于PCA和信息增益的肿瘤特征基因选择方法.该方法首先使用PCA算法剔除冗余基因,获得预选特征基因子集;然后利用信息增益算法对预选特征基因子集进行优化选取,得到特征基因子集;最后采用不同分类模型对特征基因子集进行仿真实验.实验结果表明,所提方法提高了基因表达谱的分类精度,从而表明致病基因被有效地选取出来.

关 键 词:基因分类  主成分分析  信息增益  特征选择

Tumor feature gene selection method based on PCA and information gain
Affiliation:,College of Computer and Information Engineering,Henan Technology Research Center for Computational Intelligence and Data Mining,Henan Normal University
Abstract:Aiming at the low classification accuracy of tumor genetic data with the characterstic of high dimensional and unrelated genes,a tumor feature gene selection method based on PCA and information gain is proposed.Firstly,the PCA algorithm is used to eliminate miscellaneous genes and select the preselected feature gene subset in this method.Then,the information gain algorithm is used to optimize the subset of the preselected feature gene subset,and the feature gene subset is obtained.Finally,different sorting algorithms are used to simulate the feature gene subset.The experimental results show that the method proposed in this paper improves the classification accuracy of gene expression profile,thus indicating that the pathogenic gene is effectively selected.
Keywords:gene classification  PCA  information gain  feature selection
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号