首页 | 官方网站   微博 | 高级检索  
     


Sliding window based weighted maximal frequent pattern mining over data streams
Affiliation:1. Department of Computer Engineering, Sejong University, Seoul, South Korea;2. Department of Computer Science, Chungbuk National University, South Korea;1. Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan;2. Department of Electrical Engineering, Chang Gung University, Taipei, Taiwan;1. Fraunhofer INT, Appelsgarten 2, D-53879 Euskirchen, Germany;2. Ghent University, Faculty of Economics and Business Administration, Tweekerkenstraat 2, B-9000 Gent, Belgium;1. School of IOT Engineering, Jiangnan University, Wuxi 214122, China;2. Department of Electronics and Information Engineering, Chonbuk National University, Jeonju, Jeonbuk 561756, Republic of Korea;1. College of Biomedical Engineering and Instrument Science, Zhejiang University, 310008 Zhou Yiqing Building 510, Zheda road 38#, Hangzhou, Zhejiang, China;2. Department of Information and Communication Engineering, University of Murcia, Spain;1. Innovative Information Industry Research Center, School of Computer Science and Technology, Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen, China;2. Information and Communications Research Laboratories, ITRI, Hsinchu, Taiwan, ROC;3. CyLab, Carnegie Mellon University, Pittsburgh, USA;4. Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan, ROC
Abstract:As data have been accumulated more quickly in recent years, corresponding databases have also become huger, and thus, general frequent pattern mining methods have been faced with limitations that do not appropriately respond to the massive data. To overcome this problem, data mining researchers have studied methods which can conduct more efficient and immediate mining tasks by scanning databases only once. Thereafter, the sliding window model, which can perform mining operations focusing on recently accumulated parts over data streams, was proposed, and a variety of mining approaches related to this have been suggested. However, it is hard to mine all of the frequent patterns in the data stream environment since generated patterns are remarkably increased as data streams are continuously extended. Thus, methods for efficiently compressing generated patterns are needed in order to solve that problem. In addition, since not only support conditions but also weight constraints expressing items’ importance are one of the important factors in the pattern mining, we need to consider them in mining process. Motivated by these issues, we propose a novel algorithm, weighted maximal frequent pattern mining over data streams based on sliding window model (WMFP-SW) to obtain weighted maximal frequent patterns reflecting recent information over data streams. Performance experiments report that MWFP-SW outperforms previous algorithms in terms of runtime, memory usage, and scalability.
Keywords:Data mining  Data stream  Sliding window  Weighted maximal frequent pattern mining
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号