首页 | 官方网站   微博 | 高级检索  
     

基于自适应多尺度图卷积网络的多标签图像识别
引用本文:王雪松,荣小龙,程玉虎,陈正升. 基于自适应多尺度图卷积网络的多标签图像识别[J]. 控制与决策, 2022, 37(7): 1737-1744
作者姓名:王雪松  荣小龙  程玉虎  陈正升
作者单位:中国矿业大学地下空间智能控制教育部工程研究中心,江苏徐州221116;中国矿业大学信息与控制工程学院,江苏徐州221116
基金项目:国家自然科学基金项目(61772532,61976215).
摘    要:利用一阶谱图卷积探索类别标签间关系是目前多标签图像识别常用的手段,但是,较多的图卷积层数易出现过度平滑现象,使得该方法存在局限性.为此,提出一种基于自适应多尺度图卷积网络的多标签图像识别方法,主要思路为:采用块Krylov子空间形式的谱图卷积来挖掘类别标签间的相关性,在每个图卷积层中拼接多尺度信息并扩展到深层结构,并在自适应标签关系图模块所构建的关系图上学习分类器,从而更加有效地进行多标签图像识别.通过两个公开数据集PASCAL VOC 2007和MS-COCO 2014上的实验结果验证了所提出方法的有效性.

关 键 词:自适应关系图  多尺度图卷积网络  多标签图像识别  块Krylov子空间

Multi-label image recognition based on adaptive multi-scale graph convolutional network
WANG Xue-song,RONG Xiao-long,CHENG Yu-hu,CHEN Zheng-sheng. Multi-label image recognition based on adaptive multi-scale graph convolutional network[J]. Control and Decision, 2022, 37(7): 1737-1744
Authors:WANG Xue-song  RONG Xiao-long  CHENG Yu-hu  CHEN Zheng-sheng
Affiliation:1. Engineering Research Center of Ministry of Education for Intelligent Control of Underground Space,China University of Mining and Technology,Xuzhou 221116,China;2. School of Information and Control Engineering,China University of Mining and Technology,Xuzhou 221116,China
Abstract:Utilizing the first-order spectral graph convolution to explore the correlation between category labels is a common method for multi-label image recognition. However, more graph convolution layers are prone to over-smoothing, which causes some limitations for the method. With respect to the aforementioned problem, a multi-label image recognition method based on the adaptive multi-scale graph convolutional network is proposed. The main idea is as follows: the spectral graph convolution in the form of block Krylov subspace is employed to mine the correlation between category labels, and the multi-scale information eixsted in the convolutional layer is spliced and extended to the deep structure. At the same time, the classifier is learned on the relation graph constructed by the adaptive label relation graph module, accordingly the multi-label image recognition is performed more effectively. Experimental results on two public datasets including PASCAL VOC 2007 and MS-COCO 2014 verify the effectiveness of the proposed method.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号