首页 | 官方网站   微博 | 高级检索  
     


Mining Short-Rule Covers in Relational Databases
Authors:Claudio Carpineto  Giovanni Romano
Affiliation:Fondazione Ugo Bordoni, Rome, Italy; , Fondazione Ugo Bordoni, Rome, Italy
Abstract:An implication rule Q → R is a statement of the form "for all objects in the database, if an object has the attribute–value pairs Q then it has also the attribute–value pairs R ." This simple type of rule is theoretically interesting, because it supports reasoning, similar to functional dependencies in database theory, and it may be of practical significance because the size of the set of implication rules that hold in a relation can remain substantially high even when mining real data and considering only most general covers; i.e., covers containing rules with unredundant right and left sizes. Motivated by these observations, we focus on the extraction of short-rule covers, which cannot be efficiently mined by standard rule miners. We present an algorithm driven by "negative examples" (i.e., satisfy Q but not R ) to prune the rule-candidate lattice associated with each "positive example" (i.e., satisfies both Q and R ). The algorithm scales up quite well with respect to the number of objects and it is particularly suitable for databases with attributes described by large domains. Furthermore, a perfect hash function ensures extraction of short-rule covers even from databases containing a large number of attributes.
Keywords:knowledge discovery from databases  implication rules  data mining  hashing  computational complexity
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号