基于特征关联的视频中群体人物行为语义抽取 Crowd Activity Semantic Extraction in Video Based on Feature Association期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于特征关联的视频中群体人物行为语义抽取

引用本文：	掌静,陈志,岳文静.基于特征关联的视频中群体人物行为语义抽取[J].计算机技术与发展,2020(4):26-30.

作者姓名：	掌静陈志岳文静

作者单位：	南京邮电大学计算机学院;南京邮电大学通信与信息工程学院

基金项目：	国家自然科学基金(61501253);江苏省基础研究计划(自然科学基金)项目(BK20151506);江苏省“六大人才高峰”第十一批高层次人才选拔培养资助项目(XXRJ-009);江苏省重点研发计划(社会发展)项目(BE2016778,BE2019739);南京邮电大学科研项目(NY217054);江苏省研究生科研与实践创新计划项目(KYCX17_0799,SJCX18_0296).

摘要：	为解决视频中群体人物行为语义抽取中群体人物相互遮挡、追踪困难等问题,构建一种基于特征关联的视频中群体行为人物语义抽取算法。该算法首先对视频帧提取多尺度融合特征图,通过特征图检测视频帧中可能存在的人物,利用去重算法筛除检测到的重复人物,精准定位群体人物边界框;接着预测群体人物特征掩码,通过比对相邻视频帧人物特征掩码的差异度追踪群体人物的运动轨迹;最后结合群体人物的运动轨迹推理每帧视频帧的群体人物行为语义,根据群体人物行为特点抽取视频群体人物行为语义。实验结果表明,该算法能够准确提取、定位群体人物的动态线索,解决群体人物复杂时空关系导致的语义抽取低效问题,有效地提高群体人物语义抽取的准确率和鲁棒性。
关键词：	群体人物行为语义抽取目标检测人物追踪特征掩码运动轨迹
Crowd Activity Semantic Extraction in Video Based on Feature Association

ZHANG Jing,CHEN Zhi,YUE Wen-jing.Crowd Activity Semantic Extraction in Video Based on Feature Association[J].Computer Technology and Development,2020(4):26-30.

Authors:	ZHANG Jing CHEN Zhi YUE Wen-jing

Affiliation:	(School of Computer,Nanjing University of Posts and Telecommunications,Nanjing 210023,China;School of Communication and Information Technology,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)

Abstract:	In order to solve the problems of mutual occlusion and tracking of group characters for crowd activity semantic extraction in video,a crowd activity semantic extraction algorithm in video is presented based on feature association. The proposed algorithm first extracts the multi-scale fusion feature map of the video frame,detects the possible human in the video frame through the feature map,uses the deduplication algorithm to filter out the detected duplicate human,and accurately locates the target group’s bounding boxes. Then it predicts the feature masks of group characters. The motion trajectory of group characters is tracked by comparing the difference degree of the character mask of the adjacent video frames. Finally it infers crowd activity semantics of each frame according to the motion trajectory and combines the characteristics of crowd activity to exact crowd activity semantics in video. The experiment shows that the proposed algorithm can accurately extract and locate the dynamic clues of group characters,solve the inefficiency of semantic extraction caused by complex spatial-temporal relationship of group characters,thus effectively improving the accuracy and robustness of crowd activity semantic extraction.

Keywords:	crowd activity semantic extraction target detection human tracking feature mask motion trajectory
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏