首页 | 官方网站   微博 | 高级检索  
     

面向语义分割模型的外接多尺度投票网络
引用本文:朱杰,龚声蓉,周立凡,徐少杰,钟珊.面向语义分割模型的外接多尺度投票网络[J].计算机工程,2022,48(10):279-287.
作者姓名:朱杰  龚声蓉  周立凡  徐少杰  钟珊
作者单位:1. 东北石油大学 计算机与信息技术学院, 黑龙江 大庆 163318;2. 常熟理工学院 计算机科学与工程学院, 江苏 苏州 215000
基金项目:国家自然科学基金(61972059,42071438);江苏省自然科学基金(BK20191474)。
摘    要:针对语义分割领域中多尺度共享网络训练复杂度高,以及网络在小目标、长条状目标、目标边缘处拟合效果不佳的问题,提出一种新型外接多尺度投票网络。通过投票网络融合各尺度分割结果,降低网络训练复杂度,并将共享网络中的分割网络与各尺度注意力头剥离开,仅训练各尺度注意力头,以便于网络收敛。在投票网络的结构设计中,使用多类别投票方法扩大投票空间,通过融入混合池化模块聚合近程与远程权值,扩大网络感受野,缓解权值图中长条状目标拟合间断与缺失的问题。在此基础上引入类内、类间投票注意力模块获取权值及类间关系,并采用不规则卷积,改善投票权值图的边缘拟合效果。在Cityscapes数据集上的实验结果表明,相比FCN、PSPNet、DeepLabv3+分割网络,该网络的平均交并比分别提升了0.92、0.88、0.80个百分点,与共享网络相比,其训练复杂度更低,精度更高。

关 键 词:语义分割  多尺度投票网络  平均交并比  不规则卷积  目标边缘  
收稿时间:2021-10-13
修稿时间:2021-12-02

External Multi-Scale Voting Network for Semantic Segmentation Model
ZHU Jie,GONG Shengrong,ZHOU Lifan,XU Shaojie,ZHONG Shan.External Multi-Scale Voting Network for Semantic Segmentation Model[J].Computer Engineering,2022,48(10):279-287.
Authors:ZHU Jie  GONG Shengrong  ZHOU Lifan  XU Shaojie  ZHONG Shan
Affiliation:1. School of Computer & Information Technology, Northeast Petroleum University, Daqing, Heilongjiang 163318, China;2. School of Computer Science and Engineering, Changshu Institute of Technology, Suzhou, Jiangsu 215000, China
Abstract:To address the high training complexity of multi-scale shared networks in semantic segmentation and the unsatisfactory fitting effect of multi-scale attention maps for small targets, long targets, and target edges, a new external multi-scale voting network is proposed.The segmentation results of each scale are combined via the voting network to reduce the training complexity.Moreover, the segmentation network in the shared network is separated from the attention heads of each scale, and only the attention heads of each scale are trained to facilitate the convergence of the network.The multi-category voting method is used to expand the voting space.By integrating the hybrid pooling module to aggregate short and long-range weights, the receptive field of the network is expanded, and the discontinuity and missed fitting of long bar-shaped targets in the weight map are alleviated.Subsequently, intra-class and inter-class voting attention modules are introduced to obtain the weights and inter-class relations, and irregular convolution is used to improve the edge-fitting effect of the voting weight map.Experimental results on the Cityscapes dataset show that the mean Intersection over Union (mIoU) of the proposed network is higher than those of FCN, PSPNet, and DeepLabv3+ segmentation networks by 0.92, 0.88, and 0.80 percentage points, respectively.Compared with a shared network, the proposed multiscale voting network offers higher accuracy and requires less complex training.
Keywords:semantic segmentaton  multi-scale voting network  mean Intersection over Union(mIoU)  irregular convolution  target edge  
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号