首页 | 官方网站   微博 | 高级检索  
     

基于时间序列分析的杀手级任务在线识别方法
引用本文:唐红艳,李影,贾统,袁小雍. 基于时间序列分析的杀手级任务在线识别方法[J]. 计算机科学, 2017, 44(4): 43-46
作者姓名:唐红艳  李影  贾统  袁小雍
作者单位:北京大学软件与微电子学院 北京100871,北京大学软件与微电子学院 北京100871;北京大学软件工程国家工程研究中心 北京100871,北京大学软件与微电子学院 北京100871,北京大学软件与微电子学院 北京100871
基金项目:本文受深圳市科技计划重点项目(JSGG20140516162852628)资助
摘    要:通过分析Google集群中任务的失效次数和失效模式,找到具有高失效频次和连续失效特征的杀手级任务。杀手级任务不仅影响云计算系统上应用运行的可靠性与可用性,而且会浪费大量资源并显著增加调度负载。在杀手级任务资源使用模式的基础上,提出一种基于时间序列的在线识别方法,以利用资源使用时间序列在失效早期准确识别出杀手级任务并通知云计算系统采取前摄性失效恢复措施,从而避免不必要的重复调度和资源浪费。实验结果表明,该方法能够以98.5%的准确率在平均3%的失效时间内识别出杀手级任务,同时节约96.75%的系统资源。

关 键 词:云计算系统  杀手级任务  在线识别  时间序列  资源使用模式  失效频率
收稿时间:2015-11-30
修稿时间:2016-02-29

Time Series Based Killer Task Online Recognition Approach
TANG Hong-yan,LI Ying,JIA Tong and YUAN Xiao-yong. Time Series Based Killer Task Online Recognition Approach[J]. Computer Science, 2017, 44(4): 43-46
Authors:TANG Hong-yan  LI Ying  JIA Tong  YUAN Xiao-yong
Affiliation:School of Software and Microelectronics,Peking University,Beijing 100871,China,School of Software and Microelectronics,Peking University,Beijing 100871,China;National Engineering Center of Software Engineering,Peking University,Beijing 100871,China,School of Software and Microelectronics,Peking University,Beijing 100871,China and School of Software and Microelectronics,Peking University,Beijing 100871,China
Abstract:By analyzing failure frequency and failure patterns in Google cluster dataset,this paper fond what are called as killer tasks that suffer from frequent and continuous failure.Killer task is a big concern of cloud system as it causes unnecessary resource wasting and significant increase of scheduling overhead.In this paper,an online recognition approach was proposed to make use of the resource usage time series to recognize killer tasks precisely at the very early stage of their occurrence so that proactive actions can be taken to avoid rescheduling and resource wasting.The experiment results show that the proposed approach performs a 98.5% precision in recognizing killer tasks at 3% of failure duration,with a 96.75% resource saving for the cloud system averagely.
Keywords:Cloud system  Killer tasks  Online recognition  Time series  Resource usage pattern  Failure frequency
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号