首页 | 官方网站   微博 | 高级检索  
     

关联规则挖掘中Apriori算法的研究与改进
引用本文:崔贯勋,李梁,王柯柯,苟光磊,邹航.关联规则挖掘中Apriori算法的研究与改进[J].计算机应用,2010,30(11):2952-2955.
作者姓名:崔贯勋  李梁  王柯柯  苟光磊  邹航
作者单位:1. 重庆理工大学2.
基金项目:教育部科学研究项目,重庆市科技攻关计划项目,重庆市自然科学基金,重庆理工大学科研青年基金
摘    要:经典的产生频繁项目集的Apriori算法存在多次扫描数据库可能产生大量候选及反复对候选项集和事务进行模式匹配的缺陷,导致了算法的效率较低。为此,对Apriori算法进行以下3方面的改进:改进由k阶频繁项集生成k+1阶候选频繁项集时的连接和剪枝策略;改进对事务的处理方式,减少Apriori算法中的模式匹配所需的时间开销;改进首次对数据库的处理方法,使得整个算法只扫描一次数据库,并由此提出了改进算法。实验结果表明,改进算法在性能上得到了明显提高。

关 键 词:数据挖掘  关联规则  Apriori算法  频繁项集  候选项集  
收稿时间:2010-05-18
修稿时间:2010-07-12

Research and improvement on Apriori algorithm of association rule mining
CUI Guan-xun,LI Liang,WANG Ke-ke,GOU Guang-lei,ZOU Hang.Research and improvement on Apriori algorithm of association rule mining[J].journal of Computer Applications,2010,30(11):2952-2955.
Authors:CUI Guan-xun  LI Liang  WANG Ke-ke  GOU Guang-lei  ZOU Hang
Abstract:The classic Apriori algorithm for discovering frequent itemsets scans the database many times and the pattern matching between candidate itemsets and transactions is used repeatedly, so a large number of candidate itemsets were produced, which results in low efficiency of the algorithm. The improved Apriori algorithm improved it from three aspects: firstly, the strategy of the join step and the prune step was improved when candidate frequent (k+1)-itemsets were generated from frequent k-itemsets; secondly, the method of dealing with transaction was improved to reduce the time of pattern matching to be used in the Apriori algorithm; in the end, the method of dealing with database was improved, which lead to only once scanning of the database during the whole course of the algorithm. According to these improvements, an improved algorithm was introduced. The efficiency of Apriori algorithm got improvement both in time and in space. The experimental results of the improved algorithm show that the improved algorithm is more efficient than the original.
Keywords:data mining                                                                                                                        association rule                                                                                                                        apriori algorithm                                                                                                                        frequent itemsets                                                                                                                        candidate itemsets
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号