首页 | 官方网站   微博 | 高级检索  
     

基于卷积块注意力模块的图像描述生成模型
引用本文:余海波,陈金广. 基于卷积块注意力模块的图像描述生成模型[J]. 计算机系统应用, 2021, 30(8): 194-200. DOI: 10.15888/j.cnki.csa.008043
作者姓名:余海波  陈金广
作者单位:西安工程大学 计算机科学学院, 西安 710600;河南省电子商务大数据处理与分析重点实验室, 洛阳 471934
基金项目:河南省电子商务大数据处理与分析重点实验室开放课题(2020-KF-7);陕西省教育厅科研计划(21JP049)
摘    要:图像描述生成模型是使用自然语言描述图片的内容及其属性之间关系的算法模型.对现有模型描述质量不高、图片重要部分特征提取不足和模型过于复杂的问题进行了研究,提出了一种基于卷积块注意力机制模块(CBAM)的图像描述生成模型.该模型采用编码器-解码器结构,在特征提取网络Inception-v4中加入CBAM,并作为编码器提取图...

关 键 词:图像描述生成  卷积块注意力模块  卷积神经网络  长短期记忆网络
收稿时间:2020-11-24
修稿时间:2020-12-22

Image Caption Generation Model Based on Convolutional Block Attention Module
YU Hai-Bo,CHEN Jin-Guang. Image Caption Generation Model Based on Convolutional Block Attention Module[J]. Computer Systems& Applications, 2021, 30(8): 194-200. DOI: 10.15888/j.cnki.csa.008043
Authors:YU Hai-Bo  CHEN Jin-Guang
Abstract:The image caption generation model uses natural language to describe the content of images and the relationship between attributes. In the existing models, there are problems of low description quality, insufficient feature extraction of important parts of images, and high complexity. Therefore, this study proposes an image caption generation model based on a Convolutional Block Attention Module (CBAM), which has an encoder-decoder structure. CBAM is added into the feature extraction network Inception-v4 and as an encoder, extracts the important feature information of the images. The information is then sent into the Long Short-Term Memory (LSTM) of the decoder to generate the caption of the corresponding pictures. The MSCOCO2014 data set is applied to training and testing, and multiple evaluation criteria are used to evaluate the accuracy of the model. The experimental results show that the improved model has a higher evaluation criterion score than other models, and Model2 can better extract image features and generate a more accurate description.
Keywords:image caption generation  Convolutional Block Attention Module (CBAM)  Convolution Neural Network (CNN)  Long Short-Term Memory (LSTM)
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号