首页 | 官方网站   微博 | 高级检索  
     

N步长距离视觉语言模型的图像分类方法
引用本文:王挺进,赵永威,李弼程.N步长距离视觉语言模型的图像分类方法[J].信息工程大学学报,2014,15(4):453-458.
作者姓名:王挺进  赵永威  李弼程
作者单位:信息工程大学,河南郑州450001
基金项目:国家自然科学基金资助项目(60872142)
摘    要:传统的视觉语言模型(visual language model,VLM)只考虑了相邻视觉单词之间的空间位置关系,不考虑不相邻视觉单词之间的贡献.针对传统视觉语言模型的不足,提出了一种N步长距离视觉语言模型,并将其用于图像分类.该方法首先给出了相隔N个视觉单词的二元依赖关系,然后训练长距离视觉语言模型,最后通过不同的权重分配方式进行融合,得到3种不同的图像分类方法.实验比较了不同参数和分类方法对图像分类的影响,结果表明,文章方法能在一定程度上改善视觉语言模型对图像表达的准确度,进而提高图像分类的准确率.

关 键 词:视觉语言模型  图像分类  空间位置信息  长距离

Image Categorization of the Skip-N Long Distance Visual Language Model
WANG Ting-jin,ZHAO Yong-wei,LI Bi-cheng.Image Categorization of the Skip-N Long Distance Visual Language Model[J].Journal of Information Engineering University,2014,15(4):453-458.
Authors:WANG Ting-jin  ZHAO Yong-wei  LI Bi-cheng
Affiliation:( Information Engineering University, Zhengzhou 450001, China)
Abstract:Traditional visual language models only considers the spatial relationship between adja- cent visual words, and ignores the non-adjacent pairs. Aiming at the shortcomings of traditional visual language models, this paper presents a improved skip-N long distance visual language model which can get more spatial information between words. It also describes the relations by which the words are separated by compared words, and trains the model. Finally, three classification methods are experimentally N by different weight allocations and the effects of different parameters. The resuits show that the skip-N long distance visual language model can improve the accuracy of image classification to certain extent.
Keywords:visual language model  image classification  spatial information  long distance
本文献已被 维普 等数据库收录!
点击此处可从《信息工程大学学报》浏览原始摘要信息
点击此处可从《信息工程大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号