基于Monte Carlo 粒子滤波的POMDPs 在线算法 Online algorithm based on Monte Carlo particle filtering in POMDPs期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于Monte Carlo 粒子滤波的POMDPs 在线算法

引用本文：	仵博,吴敏.基于Monte Carlo 粒子滤波的POMDPs 在线算法[J].控制与决策,2013,28(6):925-929.

作者姓名：	仵博吴敏

作者单位：	1. 中南大学信息科学与工程学院,长沙 410083 2. 中南大学先进控制与智能自动化湖南省工程实验室,长沙 410083 3. 深圳职业技术学院教育技术与信息中心,广东深圳 518055

基金项目：	国家自然科学基金：基于供应链低碳化的企业行为与运营优化决策研究;教育部博士点基金项目

摘要：	针对部分可观察马尔可夫决策过程(POMDPs)的信念状态空间是一个双指数规模问题,提出一种基于 Monte Carlo 粒子滤波的 POMDPs 在线算法.首先,分别采用粒子滤波和粒子映射更新和扩展信念状态,建立可达信念状态与或树;然后,采用分支界限裁剪方法对信念状态与或树进行裁剪,降低求解规模.实验结果表明,所提出算法具有较低的误差率和较快的收敛性,能够满足系统实时性的要求.
关键词：	部分可观察马尔可夫决策过程信念状态 Monte Carlo 粒子滤波在线算法
收稿时间：	2012/2/13 0:00:00
修稿时间：	2012/4/25 0:00:00
Online algorithm based on Monte Carlo particle filtering in POMDPs

WU Bo,WU Min.Online algorithm based on Monte Carlo particle filtering in POMDPs[J].Control and Decision,2013,28(6):925-929.

Authors:	WU Bo WU Min

Abstract:	In order to solve the double exponential size problem of belief states space in partially observable Markov decision processes(POMDPs), an online algorithm based on Monte Carlo particle filtering(MCPF) is proposed. Firstly, the methods of particle filtering and particle projection are used to update and expand the belief states respectively, and the and-or tree of reachable belief states is built. Then, a branch-and-bound pruning method is proposed to prune the tree to reduce computation. Finally, the experiment and simulation results show that the proposed algorithm has the effectiveness in retaining the quality of the policies and reducing the cost of computing policies, so it can meet the requirement of a real-time system.

Keywords:	partially observable Markov decision processes belief states Monte Carlo particle filtering online algorithm
本文献已被万方数据等数据库收录！
	点击此处可从《控制与决策》浏览原始摘要信息
	点击此处可从《控制与决策》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏