首页 | 官方网站   微博 | 高级检索  
     

结合ICA和SVM进行蛋白质氧链糖基化位点的预测
引用本文:杨雪梅.结合ICA和SVM进行蛋白质氧链糖基化位点的预测[J].计算机与数字工程,2012,40(8):32-34,41.
作者姓名:杨雪梅
作者单位:咸阳师范学院数学与信息科学学院 咸阳 712000
基金项目:陕西省教育厅科学研究计划项目
摘    要:为了提高蛋白质氧链糖基化位点的预测准确率,提出了把独立成分分析和支持向量机相结合的方法。实验样本(蛋白质序列)用稀疏编码方式编码,窗口长度为w=21,对于训练样本和待测样本,首先用独立成分分析法(ICA)提取了120个独立成分(特征),把这些独立成分作为支持向量机的输入,在特征空间用支持向量机(SVM)进行预测(分类)。实验结果表明,ICA+SVM的方法比PCA+SVM和SVM的好。预测准确率为88%。更进一步,用同一个蛋白质序列在不同窗口长度下的样本做实验,结果表明,窗口长度越长,预测准确率越高。

关 键 词:蛋白质  糖基化  预测准确率  独立成分分析  支持向量机

Prediction of the Protein O-Glycosylation by Support Vector Machine Based on Independent Component Analysis
YANG Xuemei.Prediction of the Protein O-Glycosylation by Support Vector Machine Based on Independent Component Analysis[J].Computer and Digital Engineering,2012,40(8):32-34,41.
Authors:YANG Xuemei
Affiliation:YANG Xuemei(School of Mathematics and Information Science,Xianyang Normal University,Xianyang 712000)
Abstract:To improve the prediction accuracy of O-glycosylation sites,a new method of ICA+SVM is proposed.The samples(protein sequence) for experiment are encoded by the sparse coding with window size w=21,120 independent components(feature) are extracted by independent component analysis(ICA),then the prediction(classification) is done in feature space by support vector machines(SVM).The results of experiment show that the performance of ICA+SVM is better than that of PCA+SVM and SVM.The prediction accuracy is about 88%.Furthermore,we investigated the same protein sequence under various window size,the results indicate that the longer the length of protein sequence,the higher the prediction accuracy.
Keywords:protein  glycosylation  prediction accuracy  ICA  SVM
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号