首页 | 官方网站   微博 | 高级检索  
     

基于聚类的非一致性数据库查询重写
引用本文:谢东,杨路明,蒲保兴,刘波. 基于聚类的非一致性数据库查询重写[J]. 小型微型计算机系统, 2007, 28(12): 2199-2202
作者姓名:谢东  杨路明  蒲保兴  刘波
作者单位:中南大学,信息科学与工程学院,湖南,长沙,410083
基金项目:湖南省教育厅资助项目;中南大学创新基金
摘    要:在非一致性数据库上,以元组匹配技术所产生的聚类和概率数据库的元组概率为基础,提出了可信聚类概率和可重写查询判断方法.考虑了最普通的IC情况(key-to-key和nonkey-to-key),给出了无连接和有连接的查询重写方法.连接查询重写方法缩小了用于连接的中间结果集中可信聚类的元组数量,有效地提高了查询性能.实验使用TPC-H决策支持基准的数据和查询进行性能研究,分析了聚类基数和数据库尺寸等相关因素的影响,结果显示方法是有效的.

关 键 词:关系数据库  非一致性数据库  查询重写  聚类概率
文章编号:1000-1220(2007)12-2199-04
收稿时间:2006-08-31
修稿时间:2006-12-14

Query Rewritings Based on Clusters in Inconsistent Databases
XIE Dong,YANG Lu-ming,PU Bao-xing,LIU Bo. Query Rewritings Based on Clusters in Inconsistent Databases[J]. Mini-micro Systems, 2007, 28(12): 2199-2202
Authors:XIE Dong  YANG Lu-ming  PU Bao-xing  LIU Bo
Abstract:This paper presents the probability of believable cluster and the decision method for rewritable queries in inconsistent databases,they are based on clusters that are produced by the tuple matching technology and tuple probabilities of probabilistic databases.The non-join and join query rewriting methods are proposed by considering the prevailing IC situations(key-to-key and nonkey-to-key).The join query rewriting method condenses the number of tuples in believable clusters of middle result sets for join,and enhances the query performance effectively.The experiment presents a performance study using the data and queries of the TPC-H decision support benchmark and analyses some effects of relative factors such as the cluster cardinality and database size.The experiment results show that the method is effective.
Keywords:relation database   inconsistent database   query rewriting   cluster probability
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号