首页 | 官方网站   微博 | 高级检索  
     

一种面向流数据频繁项挖掘的降载策略*
引用本文:邹永贵,龚海平,夏英,宋强.一种面向流数据频繁项挖掘的降载策略*[J].计算机应用研究,2011,28(4):1304-1307.
作者姓名:邹永贵  龚海平  夏英  宋强
作者单位:1. 中韩合作重庆GIS研究所,重庆,400065
2. 重庆邮电大学,计算机科学与技术学院,重庆,400065
基金项目:重庆市计算机网络与通信技术重点实验室开放基金项目,重庆市科技计划项目
摘    要:流数据产生速率具有不可预见性,当其速率超过系统处理能力时,部分数据元素不能被实时处理。降载技术是处理此问题的关键技术之一。分析了目前降载技术的不足,提出了一种面向挖掘流数据频繁项集的降载策略。该策略采用了基于元组出现频率的语义删除策略,优先删除出现频率相对较低的元组,从而有效解决了在挖掘流数据中的频繁项所遇到系统超载时所出现的问题,同时采用了根据流数据产生速率自动地控制是否启动降载策略,有效地解决了降载的适应性问题。最后,通过实验和分析,证明了该策略在流数据频繁项挖掘中有效性。

关 键 词:流数据  流数据管理系统  降载  频繁项
收稿时间:8/31/2010 9:20:05 PM
修稿时间:2011/3/14 0:00:00

Load-shedding strategy for data stream frequent item mining
ZOU Yong-gui,GONG Hai-ping,XIA Ying,SONG Qiang.Load-shedding strategy for data stream frequent item mining[J].Application Research of Computers,2011,28(4):1304-1307.
Authors:ZOU Yong-gui  GONG Hai-ping  XIA Ying  SONG Qiang
Affiliation:(1.School of Business, The University of Shanghai for Science and Technology, Shanghai 200093, China; 2.School of Information Management & Engineering,Shanghai University of Finace and Ecnomics,Shanghai 200433, China)
Abstract:With the unpredictability of data stream generation rate, when the rate exceeds system capacity, some of the data elements cannot be real-time processing. Load shedding techniques is one of the key technologies to deal with this issue. The deficiencies of current load shedding techniques are analyzed and a new load-shedding strategy for data stream frequent data item mining is proposed in this paper. This strategy adopts the semantics of tuple deletion based on data item frequency to delete tuples with relatively low frequency as possible, thus it can effectively solve the problems when mining the frequent data item while the system is overloaded. Moreover, starting and stopping load shedding strategy is controlled automatically based on the data stream rate, so it is effectively address the problem of load shedding adaptability. According to our experiments and analysis, the proposed strategy has the effectiveness of mining frequent items in data stream.
Keywords:data stream  DSMS  load shedding  frequent item
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号