首页 | 官方网站   微博 | 高级检索  
     

分布式异构数据库数据同步工具
引用本文:徐梓荐,叶盛,张孝.分布式异构数据库数据同步工具[J].软件学报,2019,30(3):684-699.
作者姓名:徐梓荐  叶盛  张孝
作者单位:教育部数据工程与知识工程重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872,教育部数据工程与知识工程重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872,教育部数据工程与知识工程重点实验室(中国人民大学), 北京 100872;中国人民大学 信息学院, 北京 100872
基金项目:国家重点研发计划(2018YFB1004401);国家自然科学基金(61732014);北京市科技计划(Z171100005117002)
摘    要:一般而言,读写分离技术可以解决当前大数据环境下的读写速度失配的部分问题,但是现有的读写分离技术主要是针对同构数据库的解决方案.由于存储结构的不一致,由行式存储数据库和列式存储数据库构成的异构分布式数据库系统相较于同构分布式数据库系统在数据同步的过程中就会面临格式转换、同步速度不匹配等诸多难题.提出了基于MySQL二进制日志(Binlog)进行SQL还原的方法TD-Reduction,设计并实现了Binlog解析器BinParser和Binlog还原器BinReducer,它们可以基于Mixed格式的Binlog,针对不同的事件(event)进行日志的解析,并依据相应的规则进行还原,生成可执行的SQL语句.综合以上技术,实现了一款分布式数据库数据同步工具Cynomys.在实验环境中,Cynomys表现出较好的性能.该方法适用于所有具有类似Binlog机制的其他异构数据库之间进行数据同步.

关 键 词:数据同步  读写分离  SQL还原
收稿时间:2018/7/20 0:00:00
修稿时间:2018/9/20 0:00:00

Data Synchronization Tool for Distributed Heterogeneous Database
XU Zi-Jian,YE Sheng and ZHANG Xiao.Data Synchronization Tool for Distributed Heterogeneous Database[J].Journal of Software,2019,30(3):684-699.
Authors:XU Zi-Jian  YE Sheng and ZHANG Xiao
Affiliation:Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China,Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China and Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education(Renmin University of China), Beijing 100872, China;School of Information, Renmin University of China, Beijing 100872, China
Abstract:In general, the read-write separation technology can solve some of the problems on mismatch between read and write in the current big data environment, but most of the current read-write separation technology are prepared for homogeneous database. Due to the inconsistent storage structure, heterogeneous distributed database systems composed of a row storage database and a columnar storage database will face many difficulties like format conversion and mismatch of synchronization speed in data synchronization compared to a homogeneous distributed database system. This study proposes the use of MySQL binary log to perform the TD-Reduction of SQL. It designs and implements Binlog parser BinParser and Binlog restorer BinReducer, which based on the mixed format. Different events perform log parsing and restoring according to the corresponding rules to generate executable SQL statements. Based on the above techniques, this study has implemented Cynomys, a distributed database data synchronization tool. In the experimental environment, Cynomys has shown sound performance. The method is suitable for data synchronization between all other heterogeneous databases with a similar mechanism like Binlog.
Keywords:data synchronization  read/write separation  SQL reduction
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号