首页 | 官方网站   微博 | 高级检索  
     

基于特征路径的XML文档变化检测算法
引用本文:徐海渊,吴泉源,王怀民,贾焰.基于特征路径的XML文档变化检测算法[J].计算机研究与发展,2003,40(9):1375-1381.
作者姓名:徐海渊  吴泉源  王怀民  贾焰
作者单位:国防科学技术大学计算机学院,长沙,410073
基金项目:国家"八六三"高技术研究发展计划基金项目(2002AA116040)
摘    要:由于在线信息变化频繁,XML文档变化快速检测成为Internet查询系统、搜索引擎以及连续查询系统的关键技术。目前国际上的研究主要集中于有序模式的XML文档比较,针对有序模式最好的算法复杂度为O(nkgn),其中n为文档的长度,而针对无序模式为多项式时间复杂度,为提高处理效率,提出一种基于特征路径的变化检测算法,将传统标号树匹配问题转换为基于特征路径的无重复路径标号树的匹配问题,同时适于有序和无序两种模式,复杂度为O(n),其中n为文档结点的个数.实验证明KF-Diff 能够非常高效地比较XML文档。

关 键 词:XML  Delta  Key  算法

A Key-Path-Based Change Detection Algorithm for XML Documents
XU Hai Yuan,WU Quan Yuan,WANG Huai Min,and JIA Yan.A Key-Path-Based Change Detection Algorithm for XML Documents[J].Journal of Computer Research and Development,2003,40(9):1375-1381.
Authors:XU Hai Yuan  WU Quan Yuan  WANG Huai Min  and JIA Yan
Abstract:Since online information changes frequently, being able to quickly detect changes in XML documents is important to Internet query systems, search engines, and continuous query systems Most previous work in change detection on XML or other hierarchically structured data uses the ordered tree model, with the best complexity of O(n log n ), where n is the size of the document The best algorithm for unordered mode achieves polynomial time in complexity In this paper, a Key path based change detection algorithm named KF Diff+is represented, which transforms the traditional tree to tree correction into the comparing of the Key trees which are substantially label trees without duplicate paths, with the complexity of O(n) , where n is the total number of nodes in the documents In addition, KF Diff+is tailored to both ordered trees and unordered trees, which is also a significant difference compared with the previous work Experiments show that KF Diff+can handle XML documents at extreme speed
Keywords:XML  delta  Key  algorithm  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号