首页 | 官方网站   微博 | 高级检索  
     

跨层融合与多模型投票的动作识别
引用本文:罗会兰,卢飞,严源.跨层融合与多模型投票的动作识别[J].电子与信息学报,2019,41(3):649-655.
作者姓名:罗会兰  卢飞  严源
作者单位:江西理工大学信息工程学院 赣州 341000;江西理工大学信息工程学院 赣州 341000;江西理工大学信息工程学院 赣州 341000
基金项目:国家自然科学基金;国家自然科学基金;江西省青年科学家培养项目;江西省自然科学基金
摘    要:针对动作特征在卷积神经网络模型传输时的损失问题以及网络模型过拟合的问题,该文提出一种跨层融合模型和多个模型投票的动作识别方法。在预处理阶段,借助排序池化的方法聚集视频中的运动信息,生成近似动态图像。在全连接层前设置对特征信息进行水平翻转结构,构成无融合模型。在无融合模型的基础上添加第2层的输出特征与第5层的输出特征融合结构,构造成跨层融合模型。训练时,对无融合模型和跨层融合模型两种基本模型采用3种数据划分方式以及两种生成近似动态图像顺序进行训练,得到多个不同的分类器。测试时使用多个分类器进行预测,对它们得到的结果进行投票集成,作为最终分类结果。在UCF101数据集上,提出的无融合模型和跨层融合模型的识别方法与动态图像网络模型的方法相比,识别率有较大提高;多模型投票的识别方法能有效缓解模型的过拟合现象,增加算法的鲁棒性,得到更好的平均性能。

关 键 词:动作识别    跨层融合    多模型投票    近似动态图像    水平翻转
收稿时间:2018-04-24

Action Recognition Based on Multi-model Voting with Cross Layer Fusion
Huilan LUO,Fei LU,Yuan YAN.Action Recognition Based on Multi-model Voting with Cross Layer Fusion[J].Journal of Electronics & Information Technology,2019,41(3):649-655.
Authors:Huilan LUO  Fei LU  Yuan YAN
Affiliation:School of Information Engineering, Jiangxi University of Science and Technology, Ganzhou 341000, China
Abstract:To solve the problem of the loss in the motion features during the transmission of deep convolution neural networks and the overfitting of the network model, a cross layer fusion model and a multi-model voting action recognition method are proposed. In the preprocessing stage, the motion information in a video is gathered by the rank pooling method to form approximate dynamic images. Two basic models are presented. One model with two horizontally flipping layers is called " non-fusion model”, and then a fusion structure of the second layer and the fifth layer is added to form a new model named " cross layer fusion model”. The two basic models of " non-fusion model” and " cross layer fusion model” are trained respectively on three different data partitions. The positive and negative sequences of each video are used to generate two approximate dynamic images. So many different classifiers can be obtained by training the two proposed models using different training approximate dynamic images. In testing, the final classification results can be obtained by averaging the results of all these classifiers. Compared with the dynamic image network model, the recognition rate of the non-fusion model and the cross layer fusion model is greatly improved on the UCF101 dataset. The multi-model voting method can effectively alleviate the overfitting of the model, increase the robustness of the algorithm and get better average performance.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号