首页 | 官方网站   微博 | 高级检索  
     

基于迁移学习与权重支持向量机的图像多标签标注算法
引用本文:陈磊,李菲菲,陈虬.基于迁移学习与权重支持向量机的图像多标签标注算法[J].电子科技,2020,33(3):12-16.
作者姓名:陈磊  李菲菲  陈虬
作者单位:上海理工大学 光电信息与计算机工程学院,上海 200093
基金项目:上海市高校特聘教授(东方学者)岗位计划(ES2012XX);上海市高校特聘教授(东方学者)岗位计划(ES2014XX)
摘    要:为解决图像的多标签自动标注中标签不平衡性的问题,提出了一种基于迁移学习与权重支持向量机的图像自动标注方法。为了解决所选数据集规模较小无法训练出最优的卷积神经网络的问题,文中采用迁移学习的方法,将通过Imagenet数据集训练出的Alexnet的参数迁移到文中所用的卷积神经网络模型中,并对最后一层全连接层进行微调,利用多标签分类多合页损失函数构成多分类的支持向量机。最后,文中对低频标签进行权重排序以得到图像的多标签标注结果。在Corel-5k、Esp-Game和IAPR-TC12共3个数据集上进行了实验,权重支持向量机获得的平均召回率分别提升了10%、9%和6%,低频标签对其平均精确率均提升了12%。实验结果表明,基于迁移学习的权重支持向量机的图像多标签标注方法可在有效提高数据集的召回率的同时提升低频标签的平均精确度。

关 键 词:图像多标签标注  迁移学习  权重支持向量机  卷积神经网络  多合页损失函数  低频标签  
收稿时间:2019-01-24

Multi-label Image Annotation Algorithm Based on Transfer Learning and Weighted Ranking Support Vector Machine
CHEN Lei,LI Feifei,CHEN Qiu.Multi-label Image Annotation Algorithm Based on Transfer Learning and Weighted Ranking Support Vector Machine[J].Electronic Science and Technology,2020,33(3):12-16.
Authors:CHEN Lei  LI Feifei  CHEN Qiu
Affiliation:School of Optical-Electrical and Computer Engineering,University of Shanghai for Science and Technology,Shanghai 200093,China
Abstract:In order to resolve the class imbalance problem in multi-label image annotation, an improved annotation method based on transfer learning and WRSVM was proposed in this paper. As it was difficult to train a CNN from scratch by using small datasets, transfer learning was adopted to transfer the parameters of Alexnet trained by Imagenet dataset to the convolutional neural network model utilized in the study. Besides, the last fully connected layer was fine-tuned and the multi-label multi-hinge loss function was applied to constitute multi-class support vector machine. Finally, the weighted ranking was used to label the low-frequency labels to obtain the multi-label image annotation results. The experiments were performed on three datasets including Corel-5k, Esp-Game and IAPR-TC12. The experimental results showed that the average recall of the proposed method increased 10%, 9%, and 6%, respectively, and the average of precision increased 12% for the low-frequency labels, indicating the proposed CNN-WRSVM method could improve the average of recall and the average of precision for low-frequency labels.
Keywords:multi label image annotation  transfer learning  WRSVM  CNN  MHL  low frequency labels  
点击此处可从《电子科技》浏览原始摘要信息
点击此处可从《电子科技》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号