首页 | 官方网站   微博 | 高级检索  
     

基于双语动态系统包的视角无关的人体行为识别
引用本文:杨顺卿,陈昌红.基于双语动态系统包的视角无关的人体行为识别[J].南京邮电大学学报(自然科学版),2014(1):103-110.
作者姓名:杨顺卿  陈昌红
作者单位:南京邮电大学江苏省图像处理与图像通信重点实验室,江苏南京210003
基金项目:国家自然科学基金(61172118,61001152)、江苏省自然科学基金(BK2010523)、江苏省属高校自然科学研究项目(11KJB510012)和南京邮电大学校科研基金(NY210073)资助项目
摘    要:视角无关的人体行为识别是计算机视觉领域研究的热点和难点之一.现有的视角无关的行为识别算法的识别率随着角度的改变差异很大,尤其与俯角相关的识别效果还不够理想.提出了一种基于双语动态系统包的视角无关的人体行为识别方法.首先结合兴趣点检测器和密集采样算法提取视频帧中的时空立方体并对每个时空立方体建立线性动态系统(LDS);其次对LDSs进行非线性降维聚类形成码本,并根据LDSs在码本中的分布及权重用一个动态系统包(bag of dynamical systems)来表示每个动作样本;最后同时对两个视角下的BoDS采用K-奇异值分解(K-SVD)算法得到一对可迁移字典对,然后根据这对字典对采用正交匹配追踪(OMP)算法得到两个视角下每个动作的稀疏表示.在IXMAS多视角数据库的实验结果表明了文中算法的稳定性和有效性.

关 键 词:视角无关动作识别  迁移学习  双语动态系统包

View-Invariant Action Recognition Based on Bilingual Bag of Dynamic Systems
YANG Shun-qing,CHEN Chang-hong.View-Invariant Action Recognition Based on Bilingual Bag of Dynamic Systems[J].Journal of Nanjing University of Posts and Telecommunications,2014(1):103-110.
Authors:YANG Shun-qing  CHEN Chang-hong
Affiliation:1.Jiangsu Provincial Key Lab of Image Processing and Image Communication, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;)
Abstract:View-invariant action recognition is one of the difficult and hot spots in computer vision.The recognition rates of existing algorithms vary with the variant viewpoints,especially the recognition effect on the top view is not ideal.A new framework is proposed for cross-view action recognition.Spatio-temporal patches are extracted as a low-level feature with the combination of interest point detection and dense sampling algorithm,and each patch is represented as a linear dynamic system (LDS).The codebook is formed through nonlinear reduced dimension aggregation of LDSs and,as the middle-level representation,bag of dynamic system (BoDS) is formed based on the distribution and weight of LDSs within the codebook.Using K-singular value decomposition (K-SVD) algorithm,a BoDS pair corresponding to two viewpoints is transformed into a transferable dictionary pair.An orthogonal matching pursuit (OMP) algorithm is applied to the dictionary pair to generate the sparse representation of the action,thus it ensures that the same action from the two views with the same high-level representation.Experimental results on the Ⅸ-MAS multi-view dataset show the effectiveness and the stabilization of the proposed algorithm.
Keywords:view-invariant action recognition  transfer leaning  bilingual bag of dynamic systems
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号