首页 | 官方网站   微博 | 高级检索  
     

基于线性滤波器的四旋翼无人机强化学习控制策略
引用本文:华和安,方勇纯,钱辰,张雪涛.基于线性滤波器的四旋翼无人机强化学习控制策略[J].电子与信息学报,2021,43(12):3407-3417.
作者姓名:华和安  方勇纯  钱辰  张雪涛
作者单位:南开大学人工智能学院 天津 300350;大连理工大学智能机器人实验室 大连 116024
基金项目:国家自然科学基金(61873132, 61633012)
摘    要:针对四旋翼无人机(UAVs)系统,该文提出一种基于线性降阶滤波器的深度强化学习(RL)策略,进而设计了一种新型的智能控制方法,有效地提高了旋翼无人机对外界干扰和未建模动态的鲁棒性。首先,基于线性降阶滤波技术,设计了维数更少的滤波器变量作为深度网络的输入,减小了策略的探索空间,提高了策略的探索效率。在此基础上,为了增强策略对稳态误差的感知,该文结合滤波器变量和积分项,设计集总误差作为策略的新输入,提高了旋翼无人机的定位精度。该文的新颖之处在于,首次提出一种基于线性滤波器的深度强化学习策略,有效地消除了未知干扰和未建模动态对四旋翼无人机控制系统的影响,提高了系统的定位精度。对比实验结果表明,该方法能显著地提升旋翼无人机的定位精度和对干扰的鲁棒性。

关 键 词:四旋翼无人机  智能控制  强化学习  未知干扰
收稿时间:2021-03-26

Reinforcement Learning Control Strategy of Quadrotor Unmanned Aerial Vehicles Based on Linear Filter
He’an HUA,Yongchun FANG,Chen QIAN,Xuetao ZHANG.Reinforcement Learning Control Strategy of Quadrotor Unmanned Aerial Vehicles Based on Linear Filter[J].Journal of Electronics & Information Technology,2021,43(12):3407-3417.
Authors:He’an HUA  Yongchun FANG  Chen QIAN  Xuetao ZHANG
Affiliation:1.College of Artificial Intelligence, Nankai University, Tianjin 300350, China2.Intelligent Robotic Laboratory, Dalian University of Technology, Dalian 116024, China
Abstract:In this paper, based on linear filter, a deep Reinforcement Learning (RL) strategy is proposed, then a novel intelligent control method is put forward for quadrotor Unmanned Aerial Vehicles (UAVs), which improves effectively the robustness against disturbance and unmodeled dynamics. First of all, based on linear reduced-order filtering technology, filter variables with fewer dimensions are designed as the input of the deep network, which reduces the exploration space of the strategy and improves the exploration efficiency. On this basis, to enhance strategy perception of steady-state errors, the filter variables and integration terms are combined to design the lumped error as the new network input, which improves the positioning accuracy of quadrotor UAVs. The novelty of this paper lies in that it is the first intelligent approach based on linear filtering technology, to eliminate successfully the influence of unknown disturbance and unmodeled dynamics of quadrotor UAVs, which improves the positioning accuracy. The results of comparative experiments show the effectiveness of the proposed method in terms of improving positioning accuracy and enhancing robustness.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号