首页 | 官方网站   微博 | 高级检索  
     

基于云存储的重复数据删除架构的研究与设计
引用本文:宋雨,易璐,王凤霞.基于云存储的重复数据删除架构的研究与设计[J].计算机系统应用,2013,22(1):208-211.
作者姓名:宋雨  易璐  王凤霞
作者单位:华北电力大学 控制与计算机工程学院, 保定 071003;华北电力大学 控制与计算机工程学院, 保定 071003;华北电力大学 控制与计算机工程学院, 保定 071003
摘    要:随着云计算的发展,云存储技术通过集群应用、虚拟化技术、分布式文件系统等功能将网络中大量各种不同类型的存储设备集合起来协同工作,缓解了老式数据中心的存储压力.另外,重复数据删除技术是一种缩减存储空间减少网络传输量的技术,随着云的广泛应用也势必会发展应用于云存储中.这两种技术结合将会给IT存储业带来实际效益.本文通过研究重复数据删除技术、云存储技术,设计了基于云存储的重复数据删除架构,提出了一种用In-line方式在客户端进行数据块级与字节级相结合的重复数据删除操作后再将数据存入云中的方案.在本架构下,海量数据存储在HDFS中;而文件数据块的哈希值存储在HBase中.

关 键 词:重复数据删除技术  云存储  hash值  HDFS  HBase
收稿时间:2012/6/21 0:00:00
修稿时间:8/6/2012 12:00:00 AM

Research and Design of Data De-duplication Architecture Based on Cloud Storage
SONG Yu,YI Lu and WANG Feng-Xia.Research and Design of Data De-duplication Architecture Based on Cloud Storage[J].Computer Systems& Applications,2013,22(1):208-211.
Authors:SONG Yu  YI Lu and WANG Feng-Xia
Affiliation:School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China;School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China;School of Control and Computer Engineering, North China Electric Power University, Baoding 071003, China
Abstract:With the development of cloud computing, the cloud storage technology gets a large variety of different types of network storage devices together to work collaboratively by clustering applications, virtualization, Distributed File System, alleviating the pressure of old data center storage. Besides, Data De-duplication is a technology that reduces storage space and lowers the network transmission. And it is going to be adaptable for cloud storage system one day. The combination of these two technologies will bring real benefits to IT storage industry. The paper has designed a de-duplication architecture based on cloud storage, proposed a scheme which runs at the client with In-line manner to eliminate duplicated data in chunk level, and then put those data into cloud. Under this architecture, HDFS stores the mass data while HBase stores hash value of data block.
Keywords:data de-duplication technology  cloud storage  hash value  HDFS  HBase
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号