首页 | 官方网站   微博 | 高级检索  
     

基于策略梯度的目标跟踪方法
引用本文:王康豪,殷海兵,黄晓峰.基于策略梯度的目标跟踪方法[J].浙江大学学报(自然科学版 ),2020,54(10):1923-1928.
作者姓名:王康豪  殷海兵  黄晓峰
作者单位:杭州电子科技大学 通信工程学院,浙江 杭州 310018
基金项目:国家自然科学基金资助项目(61572449,61972123,61901150);科技部重点研发课题资助项目(2018YFC0830106);浙江省自然科学基金资助项目(Q19F010030)
摘    要:针对目标跟踪过程中的遮挡、形变和快速运动等问题,提出基于策略梯度的目标跟踪方法. 该方法利用策略梯度算法训练策略网络. 该策略网络能够根据当前跟踪结果的可靠性进行动作决策,以避免错误的模板更新或者重新检测丢失的目标. 在决策过程中,通过计算加权置信度差值分析当前跟踪结果的鲁棒性和准确性,使得策略网络能够更准确地评估跟踪结果. 在重检测过程中,提出有效的重检测方法,对大量的搜索区域进行过滤,大大提高了搜索效率,利用决策模块检验重检测结果,确保重检测结果的准确性. 利用提出的算法在OTB数据集及LaSOT数据集上进行评估. 实验结果表明,提出的跟踪算法在原算法的基础上提高了2.5%~4.0%的性能.

关 键 词:目标跟踪  决策  策略梯度  重检测  模板更新  

Visual object tracking based on policy gradient
Kang-hao WANG,Hai-bing YIN,Xiao-feng HUANG.Visual object tracking based on policy gradient[J].Journal of Zhejiang University(Engineering Science),2020,54(10):1923-1928.
Authors:Kang-hao WANG  Hai-bing YIN  Xiao-feng HUANG
Abstract:An object tracking method based on policy gradient was proposed aiming at the problems of occlusion, deformation and fast motion in the process of object tracking. The policy gradient algorithm was used to train the policy network. The policy network can make action decisions founded on the reliability of current tracking results to avoid the incorrect template update or re-detect the missing targets. During the decision-making process, the robustness and accuracy of the current tracking result were both analyzed by calculating the weighted confidence margin, which helped the policy network to evaluate the tracking results more accurately. During the re-detection process, an efficient re-detection method was proposed to filter a large number of searching areas, which greatly improved the search efficiency. The decision-making module was utilized to examine the re-detected result, which ensured the accuracy of the re-detected results. The proposed algorithm was evaluated on OTB dataset and LaSOT dataset. The experimental results show that the proposed tracking algorithm improves performance by 2.5%-4.0% based on the original algorithm.
Keywords:visual object tracking  decision making  policy gradient  re-detection  template update  
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
点击此处可从《浙江大学学报(自然科学版 )》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号