首页 | 官方网站   微博 | 高级检索  
     

基于Hadoop的分布式CIF四叉树索引方法
引用本文:徐 欢,冯 钧,张鹏程,唐志贤,刘 艺,陈志飞,张立霞. 基于Hadoop的分布式CIF四叉树索引方法[J]. 计算机与现代化, 2016, 0(11): 12. DOI: 10.3969/j.issn.1006-2475.2016.11.003
作者姓名:徐 欢  冯 钧  张鹏程  唐志贤  刘 艺  陈志飞  张立霞
基金项目:国家自然科学基金面上项目(61370091); 国家科技支撑计划项目(2015BAB07B00)
摘    要:针对矩形空间数据对象,以传统CIF四叉树索引技术为基础,利用Hadoop平台与MapReduce并行编程模型,采用“分而治之”的思想,对数据空间进行划分,设计适用于分布式环境的创建索引、相交查询、区域删除的并行算法。在此基础上,通过改变数据集中矩形对象的数目与map数进行实验,分析并行创建与相交查询的效率。实验结果表明,对于大数据量的数据集与多数据集,并行创建与查询可以提高处理效率。

关 键 词:Hadoop   MapReduce   CIF四叉树   分布式环境   并行算法  
收稿时间:2016-11-23

Distributed CIF Quadtree Indexing Method Based on Hadoop
XU Huan,FENG Jun,ZHANG Peng-cheng,TANG Zhi-xian,LIU Yi,CHEN Zhi-fei,ZHANG Li-xia. Distributed CIF Quadtree Indexing Method Based on Hadoop[J]. Computer and Modernization, 2016, 0(11): 12. DOI: 10.3969/j.issn.1006-2475.2016.11.003
Authors:XU Huan  FENG Jun  ZHANG Peng-cheng  TANG Zhi-xian  LIU Yi  CHEN Zhi-fei  ZHANG Li-xia
Abstract: We design some algorithms about parallel index creation, intersection query and regional remove for the rectangle objects, which are suitable for the distributed environment. These algorithms rely on the methods of dividing the data space, as well as the idea of divide-and-conquer. And they are based on the CIF indexing techniques supported by the Hadoop platform and the MapReduce programming model. On this basis, we test the parallel index creation and intersection queriess efficiency by changing the size of data sets of rectangle objects and the number of the map tasks. The experiments results show that using parallel algorithms of the parallel index creation and intersection queries can improve the processing efficiency for large data sets.
Keywords:   Hadoop;MapReduce; CIF quadtree; distributed environment; parallel algorithm  
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号