首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到17条相似文献,搜索用时 144 毫秒
1.
通过对有序XML文档进行编码,在不需要访问XML原始数据文件的条件下实现对XML数据的高效处理。但是目前提出的支持插入更新的编码方案存在牺牲查询性能或者编码空间偏大等问题。提出了一种基于素数的新的编码方案FOP(Float-Order based-on Prime),FOP在没有降低查询性能的前提下,实现了XML文档的插入更新计算,并且编码空间得到了控制。实验结果表明FOP优于同类型的编码方案。  相似文献   

2.
一种新的XML文档更新计算   总被引:1,自引:1,他引:0       下载免费PDF全文
当对XML文档进行插入操作时面临调整编码问题,目前提出的很多编码方案不能同时很好地支持XPath查询和XML文档更新。在分析现有编码方案的基础上,提出了基于完全树的编码方案,该编码方案采用序号冗余和虚拟节点两种冗余方式,不仅支持XPath的查询,而且能有效降低因插入节点需要对XML文档进行二次编码率。实验结果表明,完全树以及相应编码有效提高了XML文档插入节点的效率。  相似文献   

3.
摘  要:在XML查询中,为了快速判断节点之间祖先后裔关系和双亲孩子关系,提出了一些编码方案,然而,当对顺序敏感的有序XML文档进行插入更新操作的时候,现有的编码方案必须重新对存在的节点进行编码或者重新计算节点编码的值,导致了很高的更新代价。在路径编码方法的基础上,提出了一种新的编码方案BSEPS(Binary String Encoding based on Path Scheme),该方案支持在不重新编码或者重新计算的情况下完成顺序敏感的插入更新操作。实验结果表明,BSEPS编码能有效处理顺序敏感查询和叶节点/子树更新。  相似文献   

4.
基于分治策略的XML文档更新计算   总被引:1,自引:0,他引:1  
XML文档更新是XML数据管理的重要功能,对XML文档进行插入更新操作时面临编码调整问题,目前大部分编码方案并不能很好支持文档更新。在分析现有编码方案基础上,提出了OMPN(order, maxOrder, parentOrder, nextOrder)编码方案,采用分治策略和迟后更新方法,在插入节点过程中不需要调整编码,在计算资源空闲时,只需调整部分编码就可以完成对XML文档的更新。实验结果表明,基于分治策略和迟后更新方法提高了更新XML文档的性能。  相似文献   

5.
基于扩展Dewey编码的XML文档更新计算   总被引:1,自引:0,他引:1  
针对在进行XML文档插入、删除等更新时因需要频繁调整相应的子树编码而导致更新低效问题.分析了低效的原因,提出了一种扩展的Dewey编码方案.该编码方案对Dewey编码的连续性进行扩展,使兄弟节点之间的编码不再连续,为插入新节点准备冗余空间.在此基础上,设计了支持该扩展编码方案的XML文档节点插入算法和分裂算法.实验结果表明,提出的扩展Dewey编码方案有效地实现了XML文档的更新计算.  相似文献   

6.
任家东  尹晓鹏 《计算机工程》2006,32(18):79-80,8
为了提高查询效率,许多XML文档编码方案相继被提出。目前大部分编码方案并不能很好地支持文档更新。在分析比较现有编码方案的基础上,提出了一种新的动态编码方案(DNS)。该方案用实数表示XML文档树中的节点编码,能够利用连续数值间的区域为新插入的节点或子树编码,并能够根据文档的更新情况动态调整部分节点的编码。  相似文献   

7.
覃遵跃  蔡国民  张彬连  汤庸 《计算机科学》2015,42(2):157-160,181
对有序XML文档树进行编码,不需要访问XML原始文件就能够实现对XML数据的管理,提高了XML管理系统的效率。针对查询提出的编码方案具有很高的查询性能,但更新效率很低。为提高更新性能而设计的方案存在查询效率低或者编码空间大等问题。为了在提高更新XML文档效率的同时不对查询性能和编码空间产生负面影响,提出了一种新的编码方法VEMBP(Vector Encoding Method Based of Prime),该方法利用向量表示有序XML节点之间的顺序关系,采用素数表示有序XML文档节点之间的结构信息;并设计了一种算法来实现在没有牺牲查询性能的前提下完全避免更新过程中的二次编码和重新计算,降低了更新代价,同时编码空间也得到了控制。实验结果显示,VEMBP具有较好的查询和更新性能。  相似文献   

8.
针对可扩展标记语言(XML)数据的查询与更新问题,提出一种基于左儿子右兄弟节点链式关联的XML动态编码方案。通过左儿子右兄弟节点的链式相关信息,仅需在局部做简单的若干改动,就可实现XML数据的更新,并能方便快速地实现祖先后裔关系、父子关系和兄弟关系等各种轴操作。研究结果表明,该编码方案不仅能高效地支持结构查询,而且编码时间与插入节点的时间也较少,可快速准确地判断XML文档结构树中任意两节点之间的关系,从而避免更新操作带来的编码大量调整问题,且支持XML文档的查询与更新。  相似文献   

9.
目前现有的前缀编码、区间编码等编码方案均不能很好地支持XML文档的更新计算。为此,提出一种新的前缀编码方案TDE。将实数映射为二维元组,利用任意2个实数间存在无限个实数的特点,对XML文档进行插入节点操作而无需对其他节点进行二次编码,并采用压缩存储减小编码的存储空间。实验结果表明,该方案能有效支持XML文档的更新计算。  相似文献   

10.
一种分数前缀XML编码方案   总被引:1,自引:0,他引:1       下载免费PDF全文
针对XML数据的更新与查询问题,提出一种分数前缀编码方案(FPES)。将分数引入LSDX前缀编码中,在2个分数间可插入无穷多个分数,支持XML节点数据的无限更新,并可表示祖先后裔关系、父子关系和兄弟关系,避免二次编码。实验结果表明,FPES与LSDX相比,查询效率较高;与分数编码相比,编码时间与插入节点的时间较少。  相似文献   

11.
提出了适用于XML文档更新环境下的区间编码方法——DCLS(dynamic containment labeling scheme).DCLS将基于整数的编码泛化到基于向量的编码,扩展了传统静态区间编码方法,有效避免了XML文档更新时的重新编码.不论文档更新与否,DCLS都显示了良好的性能:DCLS利用基于整数的静态区间编码方法进行初始编码,在文档不更新的环境下,具有较高的存储效率和查询性能;同时,DCLS将整数视为特殊向量,不仅能够支持文档更新,而且更新效率高;特别是倾斜插入时,DCLS可以避免编码位长的快速增加.实验结果表明,与已有的动态区间编码方法相比,DCLS具有更好的性能.  相似文献   

12.
XML query processing based on labeling schemes has been thoroughly studied in the past several years. Recently efficient processing of updates in dynamic XML data has gained more attention. However, all the existing techniques have high update cost, they cannot completely avoid re-labeling in XML updates, and they will increase the label size which will influence the query performance. Thus, in this paper we propose a novel Compact Dynamic Binary String (CDBS) encoding to efficiently process updates. CDBS has two important properties which form the foundations of this paper: (1) CDBS supports that CDBS codes can be inserted between any two consecutive CDBS codes with orders kept and without re-encoding the existing codes; (2) CDBS is orthogonal to specific labeling schemes; thus it can be applied broadly to different labeling schemes or other applications to efficiently process updates. Moreover, because CDBS will encounter the overflow problem, we improve CDBS to Compact Dynamic Quaternary String (CDQS) encoding which can completely avoid re-labeling in XML leaf node updates no matter what the labeling schemes are. Meanwhile, we also discuss how to efficiently process internal node updates. We report the experimental results to show that our CDBS and CDQS are superior to previous approaches to process both leaf node and internal node updates.  相似文献   

13.
XML data can be represented by a tree or graph structure and XML query processing requires the information of structural relationships among nodes. The basic structural relationships are parent-child and ancestor-descendant, and finding all occurrences of these basic structural relationships in an XML data is clearly a core operation in XML query processing. Several node labeling schemes have been suggested to support the determination of ancestor-descendant or parent-child structural relationships simply by comparing the labels of nodes. However, the previous node labeling schemes have some disadvantages, such as a large number of nodes that need to be relabeled in the case of an insertion of XML data, huge space requirements for node labels, and inefficient processing of structural joins. In this paper, we propose the nested tree structure that eliminates the disadvantages and takes advantage of the previous node labeling schemes. The nested tree structure makes it possible to use the dynamic interval-based labeling scheme, which supports XML data updates with almost no node relabeling as well as efficient structural join processing. Experimental results show that our approach is efficient in handling updates with the interval-based labeling scheme and also significantly improves the performance of the structural join processing compared with recent methods.  相似文献   

14.
In order to facilitate the XML query processing, several labeling schemes have been proposed to directly determine the structural relationships between two arbitrary XML nodes without accessing the original XML documents. However, the existing XML labeling schemes have to re-label the pre-existing nodes or re-calculate the label values when a new node is inserted into the XML document during an update process. In this paper, we devise a novel encoding scheme based on the fractional number to encode the labels of the XML nodes. Moreover, we propose a mapping method to convert our proposed fractional number based encoding scheme to bit string based encoding scheme with the intention to minimize the label size and save the storage space. By applying our proposed bit string encoding scheme to the range-based labeling scheme and the prefix labeling scheme, the process of re-labeling the pre-existing nodes can be avoided when nodes are inserted as leaf nodes and sibling nodes without affecting the order of XML nodes. In addition, we propose an algorithm to control the increment of label size when new nodes are inserted frequently at a fix place of an XML tree. Experimental results show that our proposed bit string encoding scheme provides efficient support to the process of XML updating without sacrificing the query performance when it is applied to the range-based labeling schemes.  相似文献   

15.
现有XML文档树编码存储空间较大,动态更新困难。结合DeweyBFN提出一种新的基于素数的XML动态编码方法。新方法采用子树域映射机制,避免使用较大素数,平均编码长度较小。理论分析和实验结果表明,该编码方法可以有效降低编码存储空间,具有较高的查询效率,支持XML数据的动态更新。  相似文献   

16.
In this paper, we propose an efficient encoding and labeling scheme for XML, called EXEL, which is a variant of the region labeling scheme using ordinal and insert-friendly bit strings. We devise a binary encoding method to generate the ordinal bit strings, and an algorithm to make a new bit string inserted between bit strings without any influences on the order of preexisting bit strings. These binary encoding method and bit string insertion algorithm are the bases of the efficient query processing and the complete avoidance of re-labeling for updates. We present query processing and update processing methods based on EXEL. In addition, the Stack-Tree-Desc algorithm is used for an efficient structural join, and the String B-tree indexing is utilized to improve the join performance. Finally, the experimental results show that EXEL enables complete avoidance of re-labeling for updates while providing fairly reasonable query processing performance.  相似文献   

17.
Temporal XML: modeling, indexing, and query processing   总被引:1,自引:0,他引:1  
In this paper we address the problem of modeling and implementing temporal data in XML. We propose a data model for tracking historical information in an XML document and for recovering the state of the document as of any given time. We study the temporal constraints imposed by the data model, and present algorithms for validating a temporal XML document against these constraints, along with methods for fixing inconsistent documents. In addition, we discuss different ways of mapping the abstract representation into a temporal XML document, and introduce TXPath, a temporal XML query language that extends XPath 2.0. In the second part of the paper, we present our approach for summarizing and indexing temporal XML documents. In particular we show that by indexing continuous paths, i.e., paths that are valid continuously during a certain interval in a temporal XML graph, we can dramatically increase query performance. To achieve this, we introduce a new class of summaries, denoted TSummary, that adds the time dimension to the well-known path summarization schemes. Within this framework, we present two new summaries: LCP and Interval summaries. The indexing scheme, denoted TempIndex, integrates these summaries with additional data structures. We give a query processing strategy based on TempIndex and a type of ancestor-descendant encoding, denoted temporal interval encoding. We present a persistent implementation of TempIndex, and a comparison against a system based on a non-temporal path index, and one based on DOM. Finally, we sketch a language for updates, and show that the cost of updating the index is compatible with real-world requirements.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号