一种大规模离散空间中的高斯强化学习方法 Gaussian Processes Reinforcement Learning Method in Large Discrete States Space期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种大规模离散空间中的高斯强化学习方法

引用本文：	周文云,刘全,李志涛.一种大规模离散空间中的高斯强化学习方法[J].计算机科学,2009,36(8):247-249.

作者姓名：	周文云刘全李志涛

作者单位：	1. 苏州大学计算机科学与技术学院,苏州,215006 2. 苏州大学计算机科学与技术学院,苏州,215006;南京大学软件新技术国家重点实验室,南京,210093

基金项目：	国家自然科学基金项目，教育部科学技术研究重点项目，中国博士后科研基金，江苏省商校自然科学基金

摘要：	针对大规模离散空间中强化学习的"维数灾"问题,即状态空间的大小随着特征的增加而发生指教级的增长,提出了一种基于高斯过程的强化学习方法.在本方法中,高斯过程模型有表示函数分布的能力,使用该模型之后,可以得到的不只是一个所需的估计值,而是关于该值的一个分布.实验结果表明,结合了高斯过程的强化学习方法在各方面性能,如收敛速度以及最终实验效果等都有所提高.使用高斯方法的回归模型可以在一定程度上解决大规模离散空间上的"维数灾"问题.
关键词：	强化学习维数灾高斯过程回归函数分布
收稿时间：	2008/9/25 0:00:00
修稿时间：	2008/12/23 0:00:00
Gaussian Processes Reinforcement Learning Method in Large Discrete States Space

ZHOU Wen-yun,LIU Quan,LI Zhi-tao.Gaussian Processes Reinforcement Learning Method in Large Discrete States Space[J].Computer Science,2009,36(8):247-249.

Authors:	ZHOU Wen-yun LIU Quan LI Zhi-tao

Affiliation:	Institute of Computer Science and Technology;Soochow University;Soochow 215006;China;State Key Laboratory for Novel Software Technology;Nanjing University;Nanjing 210093;China

Abstract:	In order to solve the problem of "curse of dimensionality",which means that the states space will grow exponentially in the number of features,in large discrete states space in reinforcement learning,a reinforcement learning method based on Gaussian processes was proposed.The Gaussian processes model can represent the distribution of functions,and it can be used to get a distribution of the expectation instead of its value.The experiment result shows that the performance such as speed of convergence and fin...

Keywords:	Reinforcement learning Curse of dimensionality Gaussian processes Regression Distribution of functions
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机科学》浏览原始摘要信息
	点击此处可从《计算机科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏