首页 | 官方网站   微博 | 高级检索  
     

基于深度残差学习的自动驾驶道路场景理解
引用本文:宋锐,施智平,渠瀛,邵振洲,关永.基于深度残差学习的自动驾驶道路场景理解[J].计算机应用研究,2019,36(9).
作者姓名:宋锐  施智平  渠瀛  邵振洲  关永
作者单位:首都师范大学信息工程学院 成像技术北京市高精尖创新中心,北京,100048;田纳西大学诺克斯维尔分校工程学院,田纳西美国37996;首都师范大学信息工程学院 轻型工业机器人与安全验证北京市重点实验室,北京,100048
基金项目:国家自然科学基金资助项目(61702348,61772351,61572331,61472468,61602325);国家科技支撑计划资助项目(2015BAF13B01);国际科技合作计划项目(2011DFG13000);北京市科委项目(Z141100002014001);北京市属高等学校创新团队建设与教师职业发展计划项目(IDHT20150507)
摘    要:随着道路场景理解技术的快速发展,自主驾驶领域取得了长足的进步。在相关任务中,包括道路分割、分类和车辆检测的实时性和准确性是安全性的一个关键问题。为此,提出了一个具有编/解码器网络结构的基于深度残差学习的方法。一方面,编码器网络结构使用不同层次的残差网络来提取高维中的抽象特征,这些特征在接下来的三个任务中共享使用;另一方面,解码器网络结构采用一种子任务的并行计算机制,即道路分割、车辆检测和道路分类任务同时执行。此外,全卷积神经网络用于对提取的图像特征进行上采样以解决道路分割问题。最终,实验结果表明在保证高精度的前提下处理帧率可达到15 fps以上。

关 键 词:道路场景理解  深度残差学习  编/解码器结构  全卷积网络
收稿时间:2018/3/30 0:00:00
修稿时间:2019/8/8 0:00:00

Road scene understanding for autonomous driving via deep residual learning
Rui Song and Zhiping Shi.Road scene understanding for autonomous driving via deep residual learning[J].Application Research of Computers,2019,36(9).
Authors:Rui Song and Zhiping Shi
Affiliation:Capital Normal University,
Abstract:It is making great progress in the autonomous driving field with the rapid development of road scene understanding techniques. The safety is a concerning issue with respect to the real-time and accurate performance in the related tasks which contains the road segmentation, road classification and vehicle detection. To this end, this paper proposed an approach based on deep residual learning with an encoder-decoder network structure. On the one hand, the encoder network structure used different layers of residual networks to extract the abstract features in the high dimension, which shared in the next three tasks. On the other hand, the decoder network structure adopted a mechanism of parallel computing for sub-tasks, i. e., executed the road segmentation, vehicle detection and road classification tasks simultaneously. Additionally, it used the fully convolutional networks to upsample the extracted features to specifically solve the problem of road segmentation. At last, the experimental results show that the processing rate can effectively reach more than 15 fps with the high accuracy guaranteed.
Keywords:road scene understanding  deep residual learning  encoder-decoder structure  fully convolutional networks
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号