首页 | 官方网站   微博 | 高级检索  
     

基于深度强化学习的无人机自组网路由算法
引用本文:乔冠华,吴麒,王翔,潘俊男,张易新,丁建.基于深度强化学习的无人机自组网路由算法[J].重庆邮电大学学报(自然科学版),2023,35(2):335-342.
作者姓名:乔冠华  吴麒  王翔  潘俊男  张易新  丁建
作者单位:中国西南电子技术研究所 成都 610036;重庆邮电大学 通信与信息工程学院, 重庆 400065
基金项目:中国电子科技集团公司第十研究所基金(SSJ-2125-XY-002)
摘    要:针对无人机自组网节点密度大、拓扑变换频繁,导致移动自组网复杂的问题,提出了一种基于深度强化学习(deep-reinforcement learning, DRL)的分布式无人机自组网路由算法。利用DRL感知学习无人机特征,使节点不断与环境交互、探索学习最优行动(路由)策略;通过存储经验知识,维护端到端路由,赋予无人机网络智能化重构和快速修复的能力,从而提高路径的稳定性,降低路由建立和维护开销,增强网络的鲁棒性能。仿真结果表明,提出的算法具有较好的收敛性能;在路由修复时间、端到端时延,以及网络适应性、扩展性方面都优于传统的路由算法。

关 键 词:无人机自组网  深度强化学习  路由算法  路由修复
收稿时间:2022/11/7 0:00:00
修稿时间:2023/3/20 0:00:00

Reinforcement learning based routing algorithm for UAV ad-hoc networks
QIAO Guanhu,WU Qi,WANG Xiang,PAN Junnan,ZHANG Yixin,DING Jian.Reinforcement learning based routing algorithm for UAV ad-hoc networks[J].Journal of Chongqing University of Posts and Telecommunications,2023,35(2):335-342.
Authors:QIAO Guanhu  WU Qi  WANG Xiang  PAN Junnan  ZHANG Yixin  DING Jian
Affiliation:China Southwest Electronic Technology Research Institute, Chengdu 610036, P.R. China;School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, P.R. China
Abstract:Aiming at the problem that the density of nodes in UAV ad hoc network is large and the topology transformation is frequent, which leads to the complexity of mobile ad hoc network, a distributed UAV ad hoc network routing algorithm based on deep reinforcement learning (DRL) is proposed. Using DRL perception to learn the characteristics of UAV, the node can continuously interact with the environment and explore and learn the optimal action (routing) strategy. By storing experience knowledge and maintaining end-to-end routes, the UAV network is endowed with the ability of intelligent reconfiguration and rapid repair, thus improving the stability of the path, reducing the cost of route establishment and maintenance, and enhancing the robustness of the network. Simulation results show that the proposed algorithm has good convergence performance. It is superior to traditional routing algorithms in routing repair time, end-to-end delay, network adaptability and scalability.
Keywords:unmanned aerial vehicle ad-hoc network  deep-reinforcement learning  routing algorithm  route repair
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号