首页 | 官方网站   微博 | 高级检索  
     

频繁情景并行挖掘算法研究
引用本文:王云岚,周兴社,侯正雄.频繁情景并行挖掘算法研究[J].西北工业大学学报,2007,25(2):173-176.
作者姓名:王云岚  周兴社  侯正雄
作者单位:西北工业大学,高性能计算研究与发展中心,陕西,西安,710072
摘    要:频繁情景可用于挖掘蕴藏在事件序列数据库中的知识,为了提高算法的时间性能,提出了一种挖掘频繁情景的并行算法PRE。研究了局部频繁情景与全局频繁情景的关系;通过研究频繁情景挖掘中事件可删除的条件,提出了逐步缩减数据库的技术,使得算法在迭代过程中需要扫描的数据库大小逐渐减少。数据实验表明,仅采用数据库缩减技术时算法PRE的时间性能比算法WINEPI提高25%,并行挖掘时算法PRE的并行效率与处理器个数近似成线性关系。

关 键 词:数据挖掘  频繁情景  并行算法
文章编号:1000-2758(2007)02-0173-04
修稿时间:2006-06-12

A Parallel Algorithm for Mining Frequent Episodes
Wang Yunlan,Zhou Xingshe,Hou Zhengxiong.A Parallel Algorithm for Mining Frequent Episodes[J].Journal of Northwestern Polytechnical University,2007,25(2):173-176.
Authors:Wang Yunlan  Zhou Xingshe  Hou Zhengxiong
Affiliation:Center for High Performance Computing, Northwestern Polytechnical University, Xilan 710072, China
Abstract:To our knowledge,there is no paper in the open literature concerning parallel algorithm for mining frequent episodes.We now present an algorithm,called PRE(parallel algorithm using database reduction technique for mining frequent episodes) by us,that is more efficient than the existing WINEPI algorithm because:(1) PRE utilizes the efficiency of parallel computing and(2) the size of database can be gradually reduced during mining.In the full paper,we explain PRE algorithm in detail;in this abstract,we just add some pertinent remarks to listing the three topics of explanation:(1) the important properties in parallel mining episodes of frequent occurrence;(2) the database reduction techniques in parallel mining frequent episodes;and(3) the iterative procedure of PRE algorithm;in topic 1,we give Theorems 1,2,and 3 in the full paper that make clear the relations among global frequent episodes and local frequent episodes under various conditions;in topic 2,we give Theorems 4 and 5 in the full paper for reducing the database gradually during mining;in topic 3,we give a four-step iterative procedure.Finally we give some numerical examples,whose results,shown in Figs.1,2,and 3 in the full paper,show preliminarily that,by using the database reduction techniques alone,algorithm PRE is faster than WINEPI about 25%.The experiment results also show that algorithm DRE has good speedup performance.
Keywords:data mining  frequent episode  parallel algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号