Deep reinforcement learning for semiconductor production scheduling

2018 29th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC) Pub Date : 2018-04-01 DOI:10.1109/ASMC.2018.8373191

Bernd Waschneck, André Reichstaller, Lenz Belzner, Thomas Altenmüller, T. Bauernhansl, Alexander Knapp, A. Kyek

引用次数: 67

Abstract

Despite producing tremendous success stories by identifying cat videos [1] or solving computer as well as board games [2], [3], the adoption of deep learning in the semiconductor industry is moderatre. In this paper, we apply Google DeepMind's Deep Q Network (DQN) agent algorithm for Reinforcement Learning (RL) to semiconductor production scheduling. In an RL environment several cooperative DQN agents, which utilize deep neural networks, are trained with flexible user-defined objectives. We show benchmarks comparing standard dispatching heuristics with the DQN agents in an abstract frontend-of-line semiconductor production facility. Results are promising and show that DQN agents optimize production autonomously for different targets.

查看原文本刊更多论文

半导体生产调度的深度强化学习

尽管通过识别猫视频[1]或解决计算机和棋盘游戏[2]，[3]产生了巨大的成功故事，但半导体行业对深度学习的采用是温和的。在本文中，我们将Google DeepMind的Deep Q Network (DQN)用于强化学习(RL)的代理算法应用于半导体生产调度。在RL环境中，几个利用深度神经网络的协作DQN代理使用灵活的用户定义目标进行训练。我们展示了在一个抽象的前端半导体生产设施中比较标准调度启发式与DQN代理的基准测试。结果表明，DQN代理可以针对不同的目标自主优化生产。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 29th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC)

自引率

0.00%

发文量