首页 | 官方网站   微博 | 高级检索  
     

基于强化学习的群体进化算法求解双边多目标同步并行拆解线平衡问题
引用本文:郭洪飞,陆鑫宇,任亚平,张超勇,李建庆.基于强化学习的群体进化算法求解双边多目标同步并行拆解线平衡问题[J].机械工程学报,2023,59(7):355-366.
作者姓名:郭洪飞  陆鑫宇  任亚平  张超勇  李建庆
作者单位:1. 暨南大学智能科学与工程学院 珠海 519070;2. 暨南大学物联网与物流工程研究院 珠海 519070;3. 暨南大学管理学院 广州 510632;4. 华中科技大学数字制造装备与技术国家重点实验室 武汉 430074;5. 澳门科技大学计算机科学与工程学院 澳门 999078
基金项目:国家自然科学基金(52205526,52205538)、科技部国家外国专家(G2021199026L)、广州市基础研究计划基础与应用基础研究(202201010284)、广东省研究生教育创新计划(82620516)、广州市创新领军团队(201909010006)和广东省“质量工程”建设(210308)资助项目
摘    要:双碳背景下,工业制造领域朝着绿色、节能的方向转型,对废旧产品进行回收拆解以及再制造有利于推动高质量发展。论文针对大型复杂产品拆解过程中难以改变拆卸方向的问题,并综合考虑实际拆卸过程中任务之间存在的多种约束类型,研究了同步并行模式下的双边拆解线平衡问题(two-sided disassembly line balancing problem, TDLBP)。首先引入双边布局的拆解线模式,定义了与优先和或优先关系,建立了TDLBP的数学模型以优化产线布置、经济效益和安全环保三个方面共六个指标。然后提出了一种基于强化学习的群体进化算法,采用Q-learing利用所学知识选择迭代中的最佳算子,通过拥挤距离筛选Pareto解集,利用精英保留策略加速算法收敛,进而高效获取近似最优的拆解方案。最后通过求解小规模算例并对比分析,验证了所提出算法的有效性和优越性,并进行大规模案例的应用。

关 键 词:双边拆解线  同步并行  强化学习  群体进化算法  Pareto
收稿时间:2022-05-23

Reinforcement Learning-based Swarm Evolutionary Algorithm To Solve Two-sided Multi-objective Synchronous Parallel Disassembly Line Balancing Problem
GUO Hongfei,LU Xinyu,REN Yaping,ZHANG Chaoyong,LI Jianqing.Reinforcement Learning-based Swarm Evolutionary Algorithm To Solve Two-sided Multi-objective Synchronous Parallel Disassembly Line Balancing Problem[J].Chinese Journal of Mechanical Engineering,2023,59(7):355-366.
Authors:GUO Hongfei  LU Xinyu  REN Yaping  ZHANG Chaoyong  LI Jianqing
Affiliation:1. School of Intelligent Systems Science and Engineering, Jinan University, Zhuhai 519070;2. Institute of Physical Internet, Jinan University, Zhuhai 519070;3. School of Management, Jinan University, Guangzhou 510632;4. State Key Lab of Digital Manufacturing Equipment and Technology, Huazhong University of Science and Technology, Wuhan 430074;5. School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078
Abstract:Under the background of dual carbon, the industrial manufacturing field is transforming in the direction of green and energy-saving. The recycling, disassembling and remanufacturing of waste products is conducive to promoting high-quality development. Aiming at the problem that it is difficult to change the disassembly direction during the disassembly process of large and complex products, and comprehensively considering the various constraints that exist between tasks in the actual disassembly process, the paper studies the two-sided disassembly line balancing problem (TDLBP) in the synchronous parallel mode. Firstly, the disassembly line mode of two-sided layout is introduced, and the relationship of and priority and or priority is defined, and the mathematical model of TDLBP is established to optimize the layout of the production line, economic benefits, safety and environmental protection with a total of six indicators. Then, a swarm evolutionary algorithm based on reinforcement learning is proposed. Q-learning is used to select the best operator in the iteration by using the knowledge learned, the Pareto solution set is screened by the crowding distance, and the elite retention strategy is used to accelerate the convergence of the algorithm, so as to efficiently obtain the approximate optimal disassembly scheme. Finally, the effectiveness and superiority of the proposed algorithm are verified by solving small-scale cases and comparative analysis, and the application of large-scale case is carried out.
Keywords:two-sided disassembly line  synchronous parallel  reinforcement learning  swarm evolutionary algorithm  Pareto  
点击此处可从《机械工程学报》浏览原始摘要信息
点击此处可从《机械工程学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号