首页 | 官方网站   微博 | 高级检索  
     

基于异构Flink集群的节点优先级调度策略
引用本文:汪文豪,史雪荣.基于异构Flink集群的节点优先级调度策略[J].计算机工程,2022,48(3):197-203.
作者姓名:汪文豪  史雪荣
作者单位:1. 南京工业大学 计算机科学与技术学院, 南京 211816;2. 盐城师范学院 数学与统计学院, 江苏 盐城 224002
基金项目:国家自然科学基金(11872327);;江苏省高等学校自然科学研究项目(20KJA190001);
摘    要:Flink流处理系统默认的任务调度策略在一定程度上忽略了集群异构和节点可用资源,导致集群整体负载不均衡。研究分布式节点的实时性能和集群作业环境,根据实际作业环境的异构分布情况,设计结合异构Flink集群的节点优先级调整方法,以基于Ganglia可扩展分布式集群资源监控系统的集群信息为依据,动态调整适应当前作业环境的节点优先级指数。基于此提出Flink节点动态自适应调度策略,通过实时监测节点的异构状况,并在任务执行过程中根据实时作业环境更新节点优先级指数,为系统任务找到最佳的执行节点完成任务分配。实验结果表明,相比于Flink默认的任务调度策略,基于节点优先级调整方法的自适应调度策略在WorldCount基准测试中的运行时间约平均减少6%,可使异构Flink集群在保持集群低延迟的同时,节点资源利用率和任务执行效率更高。

关 键 词:Flink集群  异构集群  负载不均衡  节点优先级  自适应调度  
收稿时间:2020-11-09
修稿时间:2021-02-20

Node Priority Scheduling Strategy Based on Heterogeneous Flink Cluster
WANG Wenhao,SHI Xuerong.Node Priority Scheduling Strategy Based on Heterogeneous Flink Cluster[J].Computer Engineering,2022,48(3):197-203.
Authors:WANG Wenhao  SHI Xuerong
Affiliation:1. College of Computer Science and Technology, Nanjing Tech University, Nanjing 211816, China;2. College of Mathematics and Statistics, Yancheng Teachers University, Yancheng, Jiangsu 224002, China
Abstract:The default task scheduling strategy of the Flink stream processing system ignores the cluster heterogeneity and available resources of nodes to a certain extent,resulting in an unbalanced overall cluster load. This study investigates the real-time performance of distributed nodes and the cluster operation environment and designs a node priority-adjustment method based on heterogeneous Flink clusters according to the heterogeneous distribution problem of the actual operation environment. The method dynamically adjusts the node priority index that adapts to the current operating environment based on the cluster information of the Ganglia scalable distributed cluster resource-monitoring system. Based on this,a dynamic adaptive scheduling strategy for link nodes is proposed. By monitoring the heterogeneous status of nodes in real time and updating the node priority index according to the real-time working environment during the task execution process,the best execution node for the system task to complete the task assignment can be found.The experimental results show that compared with Flink’s default task scheduling strategy,the adaptive scheduling strategy based on the node priority adjustment method reduces the running time of the WorldCount benchmark by approximately 6% on average.This enables the heterogeneous Flink cluster to maintain low cluster latency while maintaining higher node resource utilization and task execution efficiency.
Keywords:Flink cluster  heterogeneous cluster  load unbalancing  node priority  adaptive scheduling
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号