Reinforcement learning for cooling rate control during quenching

IF 5.1 3区工程技术 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

International Journal of Numerical Methods for Heat & Fluid Flow Pub Date : 2024-08-26 DOI:10.1108/hff-11-2023-0713

Elie Hachem, Abhijeet Vishwasrao, Maxime Renault, Jonathan Viquerat, P. Meliga

{"title":"Reinforcement learning for cooling rate control during quenching","authors":"Elie Hachem, Abhijeet Vishwasrao, Maxime Renault, Jonathan Viquerat, P. Meliga","doi":"10.1108/hff-11-2023-0713","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>The premise of this research is that the coupling of reinforcement learning algorithms and computational dynamics can be used to design efficient control strategies and to improve the cooling of hot components by quenching, a process that is classically carried out based on professional experience and trial-error methods. Feasibility and relevance are assessed on various 2-D numerical experiments involving boiling problems simulated by a phase change model. The purpose of this study is then to integrate reinforcement learning with boiling modeling involving phase change to optimize the cooling process during quenching.</p>\n<h3>Design/methodology/approach</h3>\n<p>The proposed approach couples two state-of-the-art in-house models: a single-step proximal policy optimization (PPO) deep reinforcement learning (DRL) algorithm (for data-driven selection of control parameters) and an in-house stabilized finite elements environment combining variational multi-scale (VMS) modeling of the governing equations, immerse volume method and multi-component anisotropic mesh adaptation (to compute the numerical reward used by the DRL agent to learn), that simulates boiling after a phase change model formulated after pseudo-compressible Navier–Stokes and heat equations.</p>\n<h3>Findings</h3>\n<p>Relevance of the proposed methodology is illustrated by controlling natural convection in a closed cavity with aspect ratio 4:1, for which DRL alleviates the flow-induced enhancement of heat transfer by approximately 20%. Regarding quenching applications, the DRL algorithm finds optimal insertion angles that adequately homogenize the temperature distribution in both simple and complex 2-D workpiece geometries, and improve over simpler trial-and-error strategies classically used in the quenching industry.</p>\n<h3>Originality/value</h3>\n<p>To the best of the authors’ knowledge, this constitutes the first attempt to achieve DRL-based control of complex heat and mass transfer processes involving boiling. The obtained results have important implications for the quenching cooling flows widely used to achieve the desired microstructure and material properties of steel, and for which differential cooling in various zones of the quenched component will yield irregular residual stresses that can affect the serviceability of critical machinery in sensitive industries.</p>","PeriodicalId":14263,"journal":{"name":"International Journal of Numerical Methods for Heat & Fluid Flow","volume":"17 1","pages":""},"PeriodicalIF":5.1000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Numerical Methods for Heat & Fluid Flow","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1108/hff-11-2023-0713","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

The premise of this research is that the coupling of reinforcement learning algorithms and computational dynamics can be used to design efficient control strategies and to improve the cooling of hot components by quenching, a process that is classically carried out based on professional experience and trial-error methods. Feasibility and relevance are assessed on various 2-D numerical experiments involving boiling problems simulated by a phase change model. The purpose of this study is then to integrate reinforcement learning with boiling modeling involving phase change to optimize the cooling process during quenching.

Design/methodology/approach

The proposed approach couples two state-of-the-art in-house models: a single-step proximal policy optimization (PPO) deep reinforcement learning (DRL) algorithm (for data-driven selection of control parameters) and an in-house stabilized finite elements environment combining variational multi-scale (VMS) modeling of the governing equations, immerse volume method and multi-component anisotropic mesh adaptation (to compute the numerical reward used by the DRL agent to learn), that simulates boiling after a phase change model formulated after pseudo-compressible Navier–Stokes and heat equations.

Findings

Relevance of the proposed methodology is illustrated by controlling natural convection in a closed cavity with aspect ratio 4:1, for which DRL alleviates the flow-induced enhancement of heat transfer by approximately 20%. Regarding quenching applications, the DRL algorithm finds optimal insertion angles that adequately homogenize the temperature distribution in both simple and complex 2-D workpiece geometries, and improve over simpler trial-and-error strategies classically used in the quenching industry.

Originality/value

To the best of the authors’ knowledge, this constitutes the first attempt to achieve DRL-based control of complex heat and mass transfer processes involving boiling. The obtained results have important implications for the quenching cooling flows widely used to achieve the desired microstructure and material properties of steel, and for which differential cooling in various zones of the quenched component will yield irregular residual stresses that can affect the serviceability of critical machinery in sensitive industries.

查看原文本刊更多论文

淬火过程中冷却速率控制的强化学习

目的本研究的前提是，强化学习算法与计算动力学的耦合可用于设计高效的控制策略，并通过淬火改进热部件的冷却，而淬火过程通常是基于专业经验和试错方法进行的。通过相变模型模拟沸腾问题的各种二维数值实验，对其可行性和相关性进行了评估。本研究的目的是将强化学习与涉及相变的沸腾模型相结合，以优化淬火过程中的冷却过程。设计/方法/途径所提出的方法结合了两种最先进的内部模型：单步近端策略优化（PPO）深度强化学习（DRL）算法（用于数据驱动的控制参数选择）和内部稳定有限元环境，结合了管理方程的变分多尺度（VMS）建模、浸没体积法和多组分各向异性网格适应（用于计算 DRL 代理学习所使用的数值奖励），模拟了根据伪可压缩 Navier-Stokes 和热方程制定的相变模型后的沸腾。研究结果通过控制长宽比为 4:1 的封闭空腔中的自然对流，说明了所提方法的相关性。在淬火应用方面，DRL 算法找到了最佳插入角，可充分均匀简单和复杂的二维工件几何形状中的温度分布，并改进了淬火行业通常使用的简单试错策略。所获得的结果对广泛用于实现钢材所需微观结构和材料性能的淬火冷却流具有重要意义，因为淬火部件不同区域的冷却差异会产生不规则的残余应力，从而影响敏感行业关键设备的适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Numerical Methods for Heat & Fluid Flow 工程技术-力学

CiteScore

9.50

自引率

11.90%

发文量

100

审稿时长

6-12 weeks

期刊介绍： The main objective of this international journal is to provide applied mathematicians, engineers and scientists engaged in computer-aided design and research in computational heat transfer and fluid dynamics, whether in academic institutions of industry, with timely and accessible information on the development, refinement and application of computer-based numerical techniques for solving problems in heat and fluid flow. - See more at: http://emeraldgrouppublishing.com/products/journals/journals.htm?id=hff#sthash.Kf80GRt8.dpuf