首页 | 官方网站   微博 | 高级检索  
     

基于深度时空Q网络的机器人疏散人群算法
引用本文:谭嵋,刘士豪,周婉,陈国文,胡学敏.基于深度时空Q网络的机器人疏散人群算法[J].计算机工程,2021,47(6):305-311.
作者姓名:谭嵋  刘士豪  周婉  陈国文  胡学敏
作者单位:湖北大学 计算机与信息工程学院, 武汉 430062
摘    要:针对目前人群疏散方法中机器人灵活性低、场景适应性有限与疏散效率低的问题,提出一种基于深度强化学习的机器人疏散人群算法。利用人机社会力模型模拟突发事件发生时的人群疏散状态,设计一种卷积神经网络结构提取人群疏散场景中复杂的空间特征,将传统的深度Q网络与长短期记忆网络相结合,解决机器人在学习中无法记忆长期时间信息的问题。实验结果表明,与现有基于人机社会力模型的机器人疏散人群方法相比,该算法能够提高在不同仿真场景中机器人疏散人群的效率,从而验证了算法的有效性。

关 键 词:深度时空Q网络  长短期记忆网络  人群疏散  机器人  深度强化学习  
收稿时间:2020-03-27
修稿时间:2020-05-12

Robot-Assisted Crowd Evacuation Algorithm Based on Deep Spatio-Temporal Q-network
TAN Mei,LIU Shihao,ZHOU Wan,CHEN Guowen,HU Xuemin.Robot-Assisted Crowd Evacuation Algorithm Based on Deep Spatio-Temporal Q-network[J].Computer Engineering,2021,47(6):305-311.
Authors:TAN Mei  LIU Shihao  ZHOU Wan  CHEN Guowen  HU Xuemin
Affiliation:School of Computer Science and Information Engineering, Hubei University, Wuhan 430062, China
Abstract:The application of robots to crowd evacuation is limited by the low flexibility, low scenario adaptability, and low evacuation efficiency of robots.To address the problem, this paper proposes an algorithm for robot-assisted crowd evacuation based on deep reinforcement learning.The human-machine social force model is used to simulate the crowd evacuation state when an emergency occurs, and the complex spatial features in crowd evacuation scenarios are extracted by a designed convolutional neural network structure.The traditional deep Q-network is combined with Long Short-Term Memory(LSTM) network to solve the problem that robots cannot remember long-term temporal information in the learning process.Experimental results show that compared with the existing robot-assisted evacuation methods based on the human-machine social force model, the proposed algorithm improves the efficiency of robot-assisted evacuation in different simulation scenarios, which verifies its validity and feasibility.
Keywords:Deep Spatio-Temporal Q-Network(DSTQN)  Long Short-Term Memory(LSTM)  crowd evacuation  robot  deep reinforcement learning  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号