基于策略梯度的目标跟踪方法 Visual object tracking based on policy gradient期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于策略梯度的目标跟踪方法

引用本文：	王康豪,殷海兵,黄晓峰.基于策略梯度的目标跟踪方法[J].浙江大学学报(自然科学版 ),2020,54(10):1923-1928.

作者姓名：	王康豪殷海兵黄晓峰

作者单位：	杭州电子科技大学通信工程学院，浙江杭州 310018

基金项目：	国家自然科学基金资助项目（61572449，61972123，61901150）；科技部重点研发课题资助项目（2018YFC0830106）；浙江省自然科学基金资助项目（Q19F010030）

摘要：	针对目标跟踪过程中的遮挡、形变和快速运动等问题，提出基于策略梯度的目标跟踪方法. 该方法利用策略梯度算法训练策略网络. 该策略网络能够根据当前跟踪结果的可靠性进行动作决策，以避免错误的模板更新或者重新检测丢失的目标. 在决策过程中，通过计算加权置信度差值分析当前跟踪结果的鲁棒性和准确性，使得策略网络能够更准确地评估跟踪结果. 在重检测过程中，提出有效的重检测方法，对大量的搜索区域进行过滤，大大提高了搜索效率，利用决策模块检验重检测结果，确保重检测结果的准确性. 利用提出的算法在OTB数据集及LaSOT数据集上进行评估. 实验结果表明，提出的跟踪算法在原算法的基础上提高了2.5%~4.0%的性能.
关键词：	目标跟踪决策策略梯度重检测模板更新
Visual object tracking based on policy gradient

Kang-hao WANG,Hai-bing YIN,Xiao-feng HUANG.Visual object tracking based on policy gradient[J].Journal of Zhejiang University(Engineering Science),2020,54(10):1923-1928.

Authors:	Kang-hao WANG Hai-bing YIN Xiao-feng HUANG

Abstract:	An object tracking method based on policy gradient was proposed aiming at the problems of occlusion, deformation and fast motion in the process of object tracking. The policy gradient algorithm was used to train the policy network. The policy network can make action decisions founded on the reliability of current tracking results to avoid the incorrect template update or re-detect the missing targets. During the decision-making process, the robustness and accuracy of the current tracking result were both analyzed by calculating the weighted confidence margin, which helped the policy network to evaluate the tracking results more accurately. During the re-detection process, an efficient re-detection method was proposed to filter a large number of searching areas, which greatly improved the search efficiency. The decision-making module was utilized to examine the re-detected result, which ensured the accuracy of the re-detected results. The proposed algorithm was evaluated on OTB dataset and LaSOT dataset. The experimental results show that the proposed tracking algorithm improves performance by 2.5%-4.0% based on the original algorithm.

Keywords:	visual object tracking decision making policy gradient re-detection template update
本文献已被 CNKI 等数据库收录！
	点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
	点击此处可从《浙江大学学报(自然科学版 )》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏