首页 | 官方网站   微博 | 高级检索  
     

基于改进深度强化学习的三维环境路径规划
引用本文:封硕,舒红,谢步庆.基于改进深度强化学习的三维环境路径规划[J].计算机应用与软件,2021,38(1):250-255.
作者姓名:封硕  舒红  谢步庆
作者单位:长安大学工程机械学院 陕西 西安 710064;长安大学理学院 陕西 西安 710064;长安大学工程机械学院 陕西 西安 710064
基金项目:陕西省自然科学基金项目;陕西省科技计划项目自然科学基础研究计划一般项目
摘    要:提出一种改进深度强化学习算法(NDQN),解决传统Q-learning算法处理复杂地形中移动机器人路径规划时面临的维数灾难。提出一种将深度学习融于Q-learning框架中,以网络输出代替Q值表的深度强化学习方法。针对深度Q网络存在严重的过估计问题,利用更正函数对深度Q网络中的评价函数进行改进。将改进深度强化学习算法与DQN算法在同样的三维环境下进行仿真实验,从最优路径长度、损失函数值、得到稳定的奖励值、收敛速度等方面进行对比,改进深度强化学习算法比DQN算法得到很大的改善,说明改进的算法比DQN算法寻得了更优的策略。

关 键 词:深度Q网络  强化学习  过估计问题  三维环境的路径规划

3D ENVIRONMENT PATH PLANNING BASED ON IMPROVED DEEP REINFORCEMENT LEARNING
Feng Shuo,Shu Hong,Xie Buqing.3D ENVIRONMENT PATH PLANNING BASED ON IMPROVED DEEP REINFORCEMENT LEARNING[J].Computer Applications and Software,2021,38(1):250-255.
Authors:Feng Shuo  Shu Hong  Xie Buqing
Affiliation:(School of Construction Machinery,Chang’an University,Xi’an 710064,Shaanxi,China;School of Sciences,Chang’an University,Xi’an 710064,Shaanxi,China)
Abstract:This paper proposes an improved deep reinforcement learning algorithm(NDQN)to solve the dimensionality disaster faced by the traditional Q-learning algorithm in dealing with mobile robot path planning in complex terrain.A deep reinforcement learning method that incorporates deep learning into the Q-learning framework and replaces the Q-value table with network output was proposed;due to the serious overestimation problem for the depth Q network,a correction function was used to improve the evaluation function in the deep Q network.The improved deep reinforcement learning algorithm and DQN algorithm were tested in the same 3 D environment.The optimal path length,the loss function value,the stable reward value and the convergence speed were compared.The improved deep reinforcement learning algorithm is much better than the DQN algorithm.It shows that the improved algorithm finds a better strategy than the DQN algorithm.
Keywords:Depth Q network  Reinforcement learning  Overestimation problem  Path planning for 3D environment
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号