基于深度强化学习的综合能源业务通道优化机制 A Integrated Energy Service Channel Optimization Mechanism Based on Deep Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于深度强化学习的综合能源业务通道优化机制

引用本文：	马庆刘,喻鹏,吴佳慧,熊翱,颜拥.基于深度强化学习的综合能源业务通道优化机制[J].北京邮电大学学报,2020,43(2):87-93.

作者姓名：	马庆刘喻鹏吴佳慧熊翱颜拥

作者单位：	1. 北京邮电大学网络与交换技术国家重点实验室, 北京 100876;2. 国网浙江省电力有限公司, 杭州 310007

基金项目：	国家电网公司科技项目"高可信智能感知互动综合服务系统关键技术研发及应用示范"（52110418002V）

摘要：	为了保障综合能源系统的稳定运行，承载综合能源业务的通信网络需要具备高可靠、低风险等特征.依据综合能源业务的通道要求，提出了一种深度强化学习的算法，旨在对大规模综合能源业务在承载的电力通信网上寻找到整体最优的路径.该方法以整体时延和网络负载均衡度为目标，对网络拓扑进行训练，并保存模型，然后通过迭代学习获取最优的结果.仿真结果表明，该方法找到的路径既可以保证整体时延较短，又可以保证网络的整体负载均衡.同时，在网络规模很大、业务数量很多的情况下，深度强化学习算法可有效提高计算效率.
关键词：	深度强化学习路径优化时延负载均衡
收稿时间：	2019-05-31
A Integrated Energy Service Channel Optimization Mechanism Based on Deep Reinforcement Learning

MA Qing-liu,YU Peng,WU Jia-hui,XIONG Ao,YAN Yong.A Integrated Energy Service Channel Optimization Mechanism Based on Deep Reinforcement Learning[J].Journal of Beijing University of Posts and Telecommunications,2020,43(2):87-93.

Authors:	MA Qing-liu YU Peng WU Jia-hui XIONG Ao YAN Yong

Affiliation:	1. State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China;2. State Grid Zhejiang Electric Power Company Limited, Hangzhou 310007, China

Abstract:	In order to ensure the stable operation of the integrated energy system, the integrated energy service needs to have high reliability and low risk when being carried by the communication network. According to the channel requirements of the integrated energy service, an algorithm of deep reinforcement learning is proposed, aiming to find the overall optimal path for the large-scale integrated energy service on the carried power communication network. The method that aims at the overall delay and network load balance, trains the network topology and saves the model, and then obtains the optimal result through iterative learning. The simulation results show that the routing found by this method can ensure the overall delay is short and guarantee the overall load balance of the network. At the same time, for scenarios with a large network size and a large number of services, the deep reinforcement learning algorithm can effectively improve the computational efficiency.

Keywords:	deep reinforcement learning routing optimization time delay load balancing

	点击此处可从《北京邮电大学学报》浏览原始摘要信息
	点击此处可从《北京邮电大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏