首页 | 官方网站   微博 | 高级检索  
     

一种基于XML文档关键字检索的结构索引
引用本文:娄颖,李战怀,郭文琪,陈群,韩萌.一种基于XML文档关键字检索的结构索引[J].计算机科学,2010,37(12):120-124.
作者姓名:娄颖  李战怀  郭文琪  陈群  韩萌
作者单位:西北工业大学计算机学院,西安,710129
基金项目:本文受863国家重点基金项目(2009AA1Z134),国家自然科学基金(60803043,60720106001)资助。
摘    要:XML数据索引对其检索效率有较大的影响。在深入分析现有XMI、结构索引之后,结合XML文档特点,提出了一种基于关键字检索的结构索引--LSS(Level Structure Summary) . LSS采用了把具有相同标签路径的结点进行合并的策略,具有高效判断结点之间同构异构关系的能力。实现了LSS索引生成算法CSCAN,并在LSS索引的基础上设计了XML关键字检索算法LSSearch。该算法依据LSS索引,将各个关键字的原始倒排表集合分拆成不同类型的子集合,最后在所有子集合上进行查询。实验结果表明,LSS可以帮助减少XML文档中关键字倒排表的规模,提高检索效率。

关 键 词:XML,关键字检索,索引,倒排表

Structure Summary for Keyword Search over XML Documents
LOU Ying,LI Zhan-huai,GUO Wen-qi,CHEN Qun,HAN Meng.Structure Summary for Keyword Search over XML Documents[J].Computer Science,2010,37(12):120-124.
Authors:LOU Ying  LI Zhan-huai  GUO Wen-qi  CHEN Qun  HAN Meng
Affiliation:(School of Computer Science,Northwestern Polytechnical University,Xi'an 710129,China)
Abstract:The index of XML Data is crucial for retrieval efficiency of XML document After analysis of existing XML structure summaries, this paper proposed a structural summary over keyword search called LSS combining the XML document. I_SS merges the nodes in the XMI_ tree with the same label path so as to determine nodes' homogeneity and heterogeneity efficiently. This paper implemented LSS constructing algorithm called CSCAN, and designed a XML keyword retrieval algorithm called LSScarch based on LSS. hhis algorithm split keywords' inverted list into different type subsets,finally retrieved to get all results quickly on these subsets. Experimental results demonstrated that I_SS can help to reduce the size of the keyword inverted list in XML document dramatically and improve retrieval efficiency.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号