首页 | 官方网站   微博 | 高级检索  
     

融合多数据源的蛋白质功能模块的挖掘算法
引用本文:张媛,贾克斌,张爱冬.融合多数据源的蛋白质功能模块的挖掘算法[J].北京工业大学学报,2014,40(6):837-842.
作者姓名:张媛  贾克斌  张爱冬
作者单位:1. 北京工业大学电子信息与控制工程学院,北京 100124;Computer Science and Engineering Department, The State University of New York at Buffalo, Buffalo, NY, 14260, USA
2. 北京工业大学电子信息与控制工程学院,北京,100124
3. Computer Science and Engineering Department, The State University of New York at Buffalo, Buffalo, NY, 14260, USA
基金项目:国家自然科学基金资助项目,教育部博士点基金资助项目
摘    要:针对蛋白质相互作用(protein-protein interaction,PPI)网络的信息不完善和高噪声问题,提出一种融合多生物数据的二分图聚类集成方法以检测网络中的功能模块.该方法结合了基因本体论(gene ontology,GO)、基因表达谱数据以及多种基础聚类算法,用一种新的二分图来组织多种基础聚类算法的中间结果,并结合对称非负矩阵分解(non-negative matrix factorization,NMF)算法挖掘其中功能意义上最一致蛋白质功能模块,同时,该算法能处理蛋白质功能重叠问题.实验结果表明:所提算法整体优于基准比较方法,是一种融合多种生物信息源和不同的聚类方法的有效途径.

关 键 词:蛋白质相互作用网络  网络模块挖掘  多数据集成  聚类集成  可重叠聚类

Bipartite Graph-based Integrative Method to Detect Consistent Protein Functional Modules from Multiple Sources
ZHANG Yuan , JIA Ke-bin , ZHANG Ai-dong.Bipartite Graph-based Integrative Method to Detect Consistent Protein Functional Modules from Multiple Sources[J].Journal of Beijing Polytechnic University,2014,40(6):837-842.
Authors:ZHANG Yuan  JIA Ke-bin  ZHANG Ai-dong
Abstract:A bipartite graph-based cluster ensemble method that integrates gene ontology( GO) and gene expression data with protein-protein interaction( PPI) networks is proposed. In this method,all different views of biological information and three basic clustering methods are contributed to a bipartite graph that comprehensively represents the relationships between the objects in this problem,including the proteins and the meta-clusters from the basic cluster methods. Furthermore,consistent modules are extracted using a symmetric non-negative matrix factorization( NMF)-based graph partition method and overlapping results are achieved. Extensive experimental results show that this method is superior to the baseline methods; further analysis is addressed to discuss the benefits of integrating multiple biological information sources and diverse clustering methods.
Keywords:protein-protein interaction (PPI) network  functional module detection  multiple data sources integration  cluster ensemble  soft clustering
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号