基于线索频繁模式树的关联规则产生算法 Generating Association Bules Based on Threaded Frequent Pattern Tree期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于线索频繁模式树的关联规则产生算法

引用本文：	李川,范明.基于线索频繁模式树的关联规则产生算法[J].计算机工程与应用,2004,40(4):188-192.

作者姓名：	李川范明

作者单位：	郑州大学计算机系,郑州,450052

基金项目：	河南省自然科学基金的资助(编号:0111060700)

摘要：	虽然FP-Growth算法能够有效地从数据库中挖掘频繁模式,但如何由其挖掘出的频繁模式中高效地产生关联规则仍是一个相当复杂的问题。该文提出了用于组织频繁模式的线索频繁模式树(TFPT)和一个从TFPT中挖掘关联规则的高效算法—最短模式优先算法(SPF)。挖掘模式Y的关联规则时,SPF算法应用了两个优化策略,避免了对大量的不可能成为规则XY-X左部的Y的子集的检查,从而获得了很好的性能。实验表明:与类FP-Growth算法结合时,SPF算法运行速度远远快于Apriori算法,并有相当好的可伸缩性。
关键词：	数据挖掘频繁模式关联规则
文章编号：	1002-8331-(2004)04-0188-05
Generating Association Bules Based on Threaded Frequent Pattern Tree

Li Chuan Fan Ming.Generating Association Bules Based on Threaded Frequent Pattern Tree[J].Computer Engineering and Applications,2004,40(4):188-192.

Authors:	Li Chuan Fan Ming

Abstract:	FP-Growth algorithm can mine frequent patterns from database effectively,but how to generate association rules from its resulting patterns efficiently remains a complicated problem.This paper proposes a data structure,called threaded frequent pattern tree(TFPT),to maintain frequent patterns,and an efficient algorithm,called Shortest-Pattern-First(SPF)algorithm,to mine association rules from TFPT.While mining association rules from pattern Y,two optimization strategies are used to avoid examining a lot of subsets of Y that can't be the left of any association rule of the form XY-X.In this way,SPF algorithm achieves a good performance.Our experiments shows that combined with a FP-Growth-like algorithm,SPF runs far faster than Apriori algorithm and has excellent scalability.

Keywords:	Data mining Frequent pattern Association rule
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏