首页 | 官方网站   微博 | 高级检索  
     

结合时序动态图和双流卷积网络的人体行为识别
引用本文:张文强,王增强,张良. 结合时序动态图和双流卷积网络的人体行为识别[J]. 激光与光电子学进展, 2021, 58(2): 96-104
作者姓名:张文强  王增强  张良
作者单位:中国民航大学天津市智能信号与图像处理重点实验室,天津300300;中国民航大学天津市智能信号与图像处理重点实验室,天津300300;中国民航大学天津市智能信号与图像处理重点实验室,天津300300
基金项目:国家自然科学基金(61179045)。
摘    要:为了更好地对人体动作的长时时域信息进行建模,提出了一种结合时序动态图和双流卷积网络的人体行为识别算法.首先,利用双向顺序池化算法来构建时序动态图,实现视频从三维空间到二维空间的映射,用来提取动作的表观和长时时序信息;然后提出了基于inceptionV3的双流卷积网络,包含表观及长时运动流和短时运动流,分别以时序动态图和...

关 键 词:图像处理  双流卷积网络  人体行为识别  时序动态图  数据增强

Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network
Zhang Wenqiang,Wang Zengqiang,Zhang Liang. Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network[J]. Laser & Optoelectronics Progress, 2021, 58(2): 96-104
Authors:Zhang Wenqiang  Wang Zengqiang  Zhang Liang
Affiliation:(Tianjin Key Laboratory of Advanced Signal and Image Processing,Civil Aviation University of China,Tianjin 300300,China)
Abstract:In order to well model the long-term time-domain information of human action,a human action recognition algorithm based on sequential dynamic images and two-stream convolution network is proposed.First of all,the sequential dynamic images are constructed by using sequential pooling algorithm to realize the mapping of video from three-dimensional space to two-dimensional space,which is used to extract the apparent and long-term sequential information of actions.Then,a two-stream convolution network based on inceptionV3 is proposed,which includes apparent and long-time motion flow and short-time motion flow.The input of the network is sequential dynamic images and stacked frame sequence of optical flow,and it combines data augmentation,pre-trained model,and sparse sampling.Finally,the classification judgment scores output by each branch is fused by average pooling.Experimental results on UCF101 and HMDB51 datasets show that,compared with the traditional two-stream convolution network,this method can effective use the temporal and spatial information of the action,and the recognition rate can be improved greatly,which shows effectiveness and robustness.
Keywords:image processing  two-stream convolutional network  human action recognition  sequential dynamic images  data augmentation
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号