首页 | 官方网站   微博 | 高级检索  
     

基于DDPG算法的海上无人救援技术研究
引用本文:郑帅,贾宝柱,张昆阳,张程.基于DDPG算法的海上无人救援技术研究[J].计算机应用与软件,2021,38(4):159-164,255.
作者姓名:郑帅  贾宝柱  张昆阳  张程
作者单位:大连海事大学轮机工程学院 辽宁 大连 116026;大连海事大学轮机工程学院 辽宁 大连 116026;广东海洋大学海运学院 广东 湛江524088
摘    要:针对海上无人救援过程中遇险目标的漂移及如何快速靠近的问题,提出一种基于深度强化学习理论的目标追踪算法,使无人搜救船在与环境交互的过程中学习到自主驾驶追踪漂移遇险目标的最优驾驶决策。在SART的辅助下,通过自主学习能够使搜救船以最短的时间追踪到漂移遇险目标。在Gazebo物理仿真器中建立三维仿真环境,基于ROS系统分别设计直线漂移轨迹和不规则漂移轨迹仿真实验,通过多次自主学习训练,验证所提方法的有效性。

关 键 词:深度强化学习  无人船  海上救援  目标追踪

MARINE UNMANNED RESCUE TECHNOLOGY BASED ON DDPG ALGORITHM
Zheng Shuai,Jia Baozhu,Zhang Kunyang,Zhang Cheng.MARINE UNMANNED RESCUE TECHNOLOGY BASED ON DDPG ALGORITHM[J].Computer Applications and Software,2021,38(4):159-164,255.
Authors:Zheng Shuai  Jia Baozhu  Zhang Kunyang  Zhang Cheng
Affiliation:(Marine Engineering College,Dalian Maritime University,Dalian 116026,Liaoning,China;College of Maritime,Guangdong Ocean University,Zhanjiang 524088,Guangdong,China)
Abstract:Aiming at the problem of drifting distress target and the way of approaching quickly in the process of unmanned rescue at sea,a target tracking algorithm based on theory of deep reinforcement learning is proposed,which makes unmanned rescue vessel learn to autonomous driving to track drift target optimal decision during the interaction with environment.With the assistance of SART,the vessel got close to the drift distress target in shortest time through self-learning.A three-dimensional simulation environment was established in the Gazebo physics simulator.The simulation experiments of linear drift trajectory and irregular drift trajectory were designed respectively based on ROS.The effectiveness of the proposed method is verified through multiple independent learning and training.
Keywords:Deep reinforcement learning  Unmanned surface vehicle  Maritime rescue  Target tracking
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号