首页 | 官方网站   微博 | 高级检索  
     

基于多智能体的Option自动生成算法
引用本文:沈晶,顾国昌,刘海波.基于多智能体的Option自动生成算法[J].智能系统学报,2006,1(1):84-87.
作者姓名:沈晶  顾国昌  刘海波
作者单位:哈尔滨工程大学,计算机科学与技术学院,黑龙江,哈尔滨,150001
基金项目:哈尔滨工程大学基础研究基金资助项目(HEUFT05021,HEUFT05068).
摘    要:目前分层强化学习中的任务自动分层都是采用基于单智能体的串行学习算法,为解决串行算法学习速度较慢的问题,以Sutton的Option分层强化学习方法为基础框架,提出了一种基于多智能体的Option自动生成算法,该算法由多智能体合作对状态空间进行并行探测并集中应用aiNet实现免疫聚类产生状态子空间,然后并行学习生成各子空间上的内部策略,最终生成Option.以二维有障碍栅格空间内2点间最短路径规划为任务背景给出了算法并进行了仿真实验和分析.结果表明,基于多智能体的Option自动生成算法速度明显快于基于单智能体的算法。

关 键 词:分层强化学习  自动分层  多智能体系统  Option  aiNet
文章编号:1673-4785(2006)01-0084-04
收稿时间:2005-12-28
修稿时间:2005-12-28

Algorithm for automatic constructing Option based on multi-agent
SHEN Jing,GU Guo-chang,LIU Hai-bo.Algorithm for automatic constructing Option based on multi-agent[J].CAAL Transactions on Intelligent Systems,2006,1(1):84-87.
Authors:SHEN Jing  GU Guo-chang  LIU Hai-bo
Affiliation:School of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China
Abstract:In current hierarchical reinforcement learning, the automatic task hierarchies are constructed by low speed serial learning algorithm based on single-agent. A multi-agent based algorithm for constructing Options automatically was presented for speeding up the learning algorithm. The algorithm was developed on the basis of the Option HRL framework proposed by Sutton. Firstly, multiple agents cooperated in parallel exploring the state space. Then the state space was partitioned into several sub spaces via immune clustering based on aiNet. Next, the agents learned the local strategies of the different sub-space concurrently. Consequently, the Options were constructed. The theoretical analyses and experiments with shortest path planning in a two-dimensional grid space with obstacles show that the speed of muhi-agent based algorithm for automatically constructing Options was obviously faster than that of single-agent based algorithms.
Keywords:hierarchical reinforcement learning  automatic hierarchy  muhi-agent system  Option  aiNet
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号