首页 | 官方网站   微博 | 高级检索  
     

分布式Q学习多目标函数优化策略
引用本文:宋天恒,李大字,高彦臣.分布式Q学习多目标函数优化策略[J].北京化工大学学报(自然科学版),2011,38(5):125-129.
作者姓名:宋天恒  李大字  高彦臣
作者单位:北京化工大学信息科学与技术学院,北京,100029;青岛高校软控股份有限公司,山东青岛,266555
基金项目:北京市优秀人才资助项目
摘    要:将分布式Q学习算法与Pareto排序法相结合,提出了一种利用强化学习算法解决多目标优化问题的策略。该策略充分利用Q学习语句式的奖赏机制来描述问题的多重目标函数,并结合一般的Pareto排序法,在有限的迭代过程后输出可以充分接近于Pareto前沿的非支配解集。与其他智能搜索算法相比,该策略具有结构简单、无需先验知识、参数设置少的特点。测试函数优化问题验证了算法的有效性,为智能算法解决多目标优化问题提供了一种新思路。

关 键 词:Q学习算法  多目标优化  Pareto排序法
收稿时间:2011-03-10

A Q-learning based multi-objective function optimization strategy
SONG TianHeng,LI DaZi,GAO YanChen.A Q-learning based multi-objective function optimization strategy[J].Journal of Beijing University of Chemical Technology,2011,38(5):125-129.
Authors:SONG TianHeng  LI DaZi  GAO YanChen
Affiliation:1.College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029; 2.Qing Dao Mesnac Co, Ltd, Qingdao Shandong 266555, China
Abstract:In this paper,a multi-optimization strategy is proposed based on combining the Q-learning algorithm and Pareto sorting.Multiple objective functions of the problem are described with the help of a Q-learning rewards strategy.Combined with Pareto sorting,the proposed strategy generates a non-dominated solution set close enough to a real Pareto front after limited iterations.Compared with other intelligent algorithms,it offers the advantages of a simpler structure,learning without prior knowledge,and fewer parameters.The results with test functions prove the validity of the proposed strategy.This method therefore provides an alternative means of intelligent optimization in this area.
Keywords:Q-learning algorithm  multi-objective optimization  Pareto sorting
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京化工大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京化工大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号