首页 | 官方网站   微博 | 高级检索  
     

分布式图处理系统技术综述
引用本文:王童童,荣垂田,卢卫,杜小勇.分布式图处理系统技术综述[J].软件学报,2018,29(3):569-586.
作者姓名:王童童  荣垂田  卢卫  杜小勇
作者单位:数据工程与知识工程教育部重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872,天津工业大学 计算机科学与软件学院, 天津 300387,数据工程与知识工程教育部重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872,数据工程与知识工程教育部重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872
基金项目:国家自然科学基金项目(61502504,61402329,61732014,61472321);中国人民大学科学研究基金(中央高校基本科研业务费专项资金资助)项目成果(15XNLF09)收稿时间:2017-08-01;修改时间:2017-09-05
摘    要:图作为一种基本的数据类型,是对现实世界中对象及其关联关系的一种抽象.现实中许多的科学问题都可以被模型化为图的问题,因此对图数据进行分析非常的重要.图数据分析在语义web分析、社交网络、生物基因分析以及信息检索等领域有着广泛的应用.随着移动互联、物联网等信息技术的发展,图数据的规模处于持续增长的状态.为了能够应对大规模图数据的高效分析和计算,谷歌提出了Pregel分布式图处理框架,此后学术界和工业界提出了许多基于Pregel框架的优化技术和系统实现.在充分调研和分析的基础上,本文首先总结出分布式图处理系统的3个优化目标;其次,论文从计算粒度、任务调度、通信方式、负载划分等四个维度,对现有分布式图处理系统中的各类优化技术作一个详细的综述;最后,论文对该领域未来的研究内容和发展方向进行了探讨与展望.

关 键 词:分布式图处理系统  计算粒度  任务调度  通信方式  负载划分
收稿时间:2017/8/1 0:00:00
修稿时间:2017/9/5 0:00:00

Survey on Technologies of Distributed Graph Processing Systems
WANG Tong-Tong,RONG Chui-Tian,LU Wei and DU Xiao-Yong.Survey on Technologies of Distributed Graph Processing Systems[J].Journal of Software,2018,29(3):569-586.
Authors:WANG Tong-Tong  RONG Chui-Tian  LU Wei and DU Xiao-Yong
Affiliation:Key Laboratory of Data Engineering and Knowledge Engineering, MOE(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China,School of Computer Science and Software Engineering, Tianjin Polytechnic University Tianjin 300387, China,Key Laboratory of Data Engineering and Knowledge Engineering, MOE(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China and Key Laboratory of Data Engineering and Knowledge Engineering, MOE(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China
Abstract:Well recognized as a primitive data structure, graph is an abstraction of objects and their pairwise connections. Thus far, there exist a wide spectrum of graph aplications, including semantic web analysis, social network analysis, biological genetic analysis and information retrieval, which can be modeled as graphs. Therefore, it is of great importance to conduct data analysis over these applications. With the development of information technology such as Mobile Internet and Internet of things, the scale of graph data is increasing continuously and rapidly. To do fast analysis over large-scale graph data, Pregel was first proposed as a distributed graph processing framework by Google. Since then, based Pregel framework, a variety of optimization techniques and systems have been proposed by academic and industrial communities. After extensive investigation and analysis, in this paper, we first abstract three optimization objectives for the state-of-the-arts solutions to build distributed graph processing systems. Subsequently, we review mainstream optimizing techniques for the state-of-the-arts solutions from the perspective of computation granularity, task scheduling, communication mode and load balance. Finally, we discuss some open research problems and possible future research directions in this field.
Keywords:distributed graph processing systems  calculating granularity  task scheduling  communication mode  load balancing
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号