首页 | 官方网站   微博 | 高级检索  
     


An actor-critic algorithm for constrained Markov decision processes
Authors:VS Borkar  
Affiliation:School of Technology and Computer Science, Tata Institute of Fundamental Research, Homi Bhabha Road, Mumbai 400005, India
Abstract:An actor-critic type reinforcement learning algorithm is proposed and analyzed for constrained controlled Markov decision processes. The analysis uses multiscale stochastic approximation theory and the envelope theorem' of mathematical economics.
Keywords:Actor-critic algorithms  Reinforcement learning  Constrained Markov decision processes  Stochastic approximation  Envelope theorem
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号