首页 | 官方网站   微博 | 高级检索  
     

利用抽样技术和元学习的分布式关联规则挖掘算法
引用本文:李梅花,王黎明,许红涛.利用抽样技术和元学习的分布式关联规则挖掘算法[J].计算机应用,2006,26(4):872-874.
作者姓名:李梅花  王黎明  许红涛
作者单位:郑州大学,信息工程学院,河南,郑州,450052
摘    要:结合动态项集计数技术和抽样的思想,利用元学习策略来产生频繁项集,提出了一个不共享内存的分布式关联规则挖掘算法DASM;引进了相似度的概念,并用之提高了挖掘的精确度。理论分析以及在IBM数据生成器生成的数据集上的实验均表明,DASM算法具有较高的挖掘效率和较低的通信量,适用于对效率要求较高的应用领域。

关 键 词:抽样  元学习  动态项集计数  相似度  分布式关联规则挖掘
文章编号:1001-9081(2006)04-0872-03
收稿时间:2005-10-17
修稿时间:2005-10-17

Distributed association rules mining algorithm by sampling and meta-learning
LI Mei-hua,WANG Li-ming,XU Hong-tao.Distributed association rules mining algorithm by sampling and meta-learning[J].journal of Computer Applications,2006,26(4):872-874.
Authors:LI Mei-hua  WANG Li-ming  XU Hong-tao
Affiliation:College of Information Engineering, Zhengzhou University, Zhengzhou Henan 450052, China
Abstract:A new distributed association rule mining algorithm of DASM was presented. It adopted the ideas of dynamic itemset counting and sampling, and produced frequent itemsets by meta-learning method. Different sites that applied DASM needn't share the same memory. To assure the completeness of the results, the concept of similar degree was introduced. Theory analysis and experiments on the datasets generated using the generator from the IBM Almaden Quest research group show that DASM has better performance and less communication loads. DASM is applicable to those applications where the efficiency could be more important than accuracy results.performance and less communication loads. DASM is applicable to those applications where the efficiency could be more important than accuracy results.
Keywords:sampling  meta-learning  dynamic itemset counting  similar degree  distributed association rule mining
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号