深度强化学习算法求解作业车间调度问题 Job Shop Scheduling Problem Based on Deep Reinforcement Learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

深度强化学习算法求解作业车间调度问题

引用本文：	李宝帅,叶春明.深度强化学习算法求解作业车间调度问题[J].计算机工程与应用,2021,57(23):248-254.

作者姓名：	李宝帅叶春明

作者单位：	上海理工大学管理学院，上海 200093

摘要：	由于传统车间调度方法实时响应能力有限，难以在复杂调度环境中取得良好效果，提出一种基于深度Q网络的深度强化学习算法。该方法结合了深度神经网络的学习能力与强化学习的决策能力，将车间调度问题视作序列决策问题，用深度神经网络拟合价值函数，将调度状态表示为矩阵形式进行输入，使用多个调度规则作为动作空间，并设置基于机器利用率的奖励函数，不断与环境交互，获得每个决策点的最佳调度规则。通过与智能优化算法、调度规则在标准问题集上的测试对比证明了算法有效性。
关键词：	强化学习深度强化学习作业车间调度深度Q网络
Job Shop Scheduling Problem Based on Deep Reinforcement Learning

LI Baoshuai,YE Chunming.Job Shop Scheduling Problem Based on Deep Reinforcement Learning[J].Computer Engineering and Applications,2021,57(23):248-254.

Authors:	LI Baoshuai YE Chunming

Affiliation:	Business School, University of Shanghai for Science & Technology, Shanghai 200093, China

Abstract:	This paper proposes a method to deal with the changeable scheduling environment. This method combines the learning ability of deep neural network with the decision-making ability of reinforcement learning. The approach regards the job shop scheduling problem as a sequential decision-making problem. Deep neural network fits the value function. Scheduling state is represented as a matrix form for input. Some of scheduling rules are used as the action space to directly select the behavior strategy. It sets the reward function related to machine utilization, interacts with the environment to obtain the best scheduling rules for each decision point. The results on the OR-Library show the effectiveness of the algorithm.

Keywords:	reinforcement learning deep reinforcement learning job shop scheduling deep Q network
本文献已被万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏