首页 | 官方网站   微博 | 高级检索  
     

基于多核并行的海量数据序列模式挖掘*
引用本文:俞东进,郑苏杭,李万清.基于多核并行的海量数据序列模式挖掘*[J].计算机应用研究,2012,29(2):478-481.
作者姓名:俞东进  郑苏杭  李万清
作者单位:杭州电子科技大学计算机学院,杭州,310018
基金项目:国家自然科学基金资助项目(60903053);浙江省重大科技计划资助项目(2008C11099-1)
摘    要:为了在多核处理器上充分利用多核资源以提升挖掘性能,提出了一种动态与静态任务分配机制相结合的基于多核的并行序列模式挖掘算法。该算法采用数据并行与任务并行相结合的策略,在各处理器核生成局部序列模式后,再与其他处理器核协同,以最终获得所有的全局序列模式。算法通过并行局部归约技术消除了局部序列的重复生成与计算,并可结合静态与动态任务分配机制解决处理器的负载不均衡问题。理论分析和实验都证实了该算法可有效利用多核计算平台及多核体系结构优势,具有较高的运行效率和加速比。

关 键 词:并行  多核  序列模式  海量数据挖掘

Parallel massive mining of sequential patterns based on multi-core processors
YU Dong-jin,ZHENG Su-hang,LI Wan-qing.Parallel massive mining of sequential patterns based on multi-core processors[J].Application Research of Computers,2012,29(2):478-481.
Authors:YU Dong-jin  ZHENG Su-hang  LI Wan-qing
Affiliation:(School of Computer, Hangzhou Dianzi University, Hangzhou 310018, China)
Abstract:To fully utilize the multi-core resources on multi-core processors to improve mining performance, this paper presented a novel algorithm of mining parallel sequential patterns based on the multi-core processors. It combined the data parallelism and task parallelism, with global sequential patterns obtained by combining local patterns discovered in different processor cores. Through local parallel reduction, it eliminated the repetitive patterns and reduced computational effort. Besides, it achieved the workload balancing by static and dynamic task distribution mechanisms. Both theoretical analysis and practical experiments show that the algorithm takes good advantage of multi-core computing platform, having higher operating efficiency and speedup.
Keywords:parallel  multi-core  sequential patterns  massive data mining
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号