首页 | 官方网站   微博 | 高级检索  
     

基于Zipf分布与属性相关性的选择性估计
引用本文:姜芳艽.基于Zipf分布与属性相关性的选择性估计[J].计算机科学,2010,37(11):184-189.
作者姓名:姜芳艽
作者单位:徐州师范大学智能信息处理研究所,徐州,221116;中国人民大学信息学院,北京,100872
基金项目:本文受国家自然科学基金(60773216)资助。
摘    要:在Deep Web数据集成中,集成查询接口和很多W cb数据库查询接口用合取谓词表达查询,但是也有相当一部分Web数据库的查询接口用互斥谓词表达查询,这意味着查询转换时每次只能选择一个谓词。因此,准确、高效地佑计每个互斥查询的选择性是优化查询转换的关键。提出了基于Zipf分布与属性相关性的选择性佑计方法。通过属性之间的相关性从Web数据库上获取该属性近似随机的属性级样本,在此基础上计算属性值的Zipf分布方程,进而推断该无限值属性的任意值的选择性。实验表明,该方法可以准确、高效地估计各互斥查询的选择性。

关 键 词:Zipf分布,属性相关性,选择性估计
收稿时间:2009/12/31 0:00:00
修稿时间:2010/3/18 0:00:00

Selectivity Estimation Based on 7ipf Distribution and Attribute Correlation
JIANG Fang-jiao.Selectivity Estimation Based on 7ipf Distribution and Attribute Correlation[J].Computer Science,2010,37(11):184-189.
Authors:JIANG Fang-jiao
Affiliation:(Institute of Intelligent Information Processing,Xuzhou Normal University,Xuzhou 221116,China);(School of Information,Renmin University of China,Beijing 100872,China)
Abstract:In Deep Web data integration,some Web database interfaces express exclusive predicates,which permit only one predicate to be selected. Accurately and efficiently estimating the selectivity of each exclusive query is of critical importance to optimal query translation. In this paper, we proposed a novel selectivity estimation method. Firstly, we computed the Attribute Correlation and access approximately random attributclevel sample through submitting the query on the least correlative attribute to the real Web database. hhen we computed Zipf equation aided by the information of word rank from the sample and the actual selectivity of several words from the real Web database. Finally, the selectivity of any word on the infinitcvaluc attribute was derived by the Zipf equation. An experimental evaluation of the proposed selectivity estimation method was provided and experimental results are highly accurate.
Keywords:Zipf distribution  Attribute correlation  Selectivity estimation
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号