首页 | 官方网站   微博 | 高级检索  
     

超越支持度-置信度框架的负相关对规则挖掘
引用本文:钱铁云,冯小年,王元珍.超越支持度-置信度框架的负相关对规则挖掘[J].计算机科学,2005,32(10):124-127.
作者姓名:钱铁云  冯小年  王元珍
作者单位:[1]华中科技大学计算机学院数据库与多媒体技术研究所,武汉430074 [2]中国电力财务有限公司华中分公司,武汉430062
基金项目:本文研究获高等学校博士学科点专项科研基金:基于浓缩数据立方的联机分析处理与梯度挖掘(项目编号20030487032)资助.
摘    要:相关规则比传统的关联规则更具有实际意义.但现存的相关规则挖掘算法均需利用apriori类似算法挖掘具有高支持度的项集,再对获得的项集进行相关性测试而获取相关规则,这导致低支持度-高相关度的规则不易被发现.直接挖掘相关规则的困难在于候选相关项不能利用apriori类似性质进行剪枝,导致搜索空间爆炸性增长.本文提出的算法MNI利用Phi相关系数的下界来产生候选负相关项,从而缩小负相关项搜索空间,并证明了该算法的完全性和正确性.在负相关项对基础上利用规则可靠度产生负相关规则时,提出将负相关对计数统一转化为正相关对计数的方法.在真实数据集上的实验结果表明,该算法MNI能有效提高负相关项对的挖掘速度.

关 键 词:关联规则  相关规则  Phi相关系数  规则可靠度

Mining Negative Correlation Rules Beyond Support-Confidence Framework
QIAN Tie-Yun, FENG Xiao-Nian, WANG Yuan-Zhen.Mining Negative Correlation Rules Beyond Support-Confidence Framework[J].Computer Science,2005,32(10):124-127.
Authors:QIAN Tie-Yun  FENG Xiao-Nian  WANG Yuan-Zhen
Affiliation:1 Computer Science Department, Huazhong University of Science and Technology, Wuhan 430074;2 China Power Finance Company, Huazhong Branch, Wuhan 430062
Abstract:High correlation rules are more practical than traditional association rules,but existed correlation rule mining algorithms are almost apriori-based. This results in the difficulty of finding correlation rules with low support but high correlation. In this paper a new algorithm called MNI is introduced to use the lower bound of Phi correlation coefficient to generate all candidate negative correlation items and reduce explosive search space. Both the completeness and correctness of MNI are proved. Negative correlation rules are mined using reliability measure without directly counting the number of negative correlation pairs. Experiments on real datasets show that the algorithm is quite efficient in negative correlation items mining.
Keywords:Association rules  Correlation rules  Phi correlation coefficient  Rule reliability measure
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号