{"title":"Generation and optimization of gold nanoclusters via reinforcement learning","authors":"Muhammad Usman, Fuyi Chen","doi":"10.1140/epjd/s10053-025-01006-w","DOIUrl":null,"url":null,"abstract":"<div><p>The identification and prediction of atomic cluster structures are crucial in nanocluster and materials research, as the molecular structure significantly influences the properties of nanoclusters. This study presents an innovative approach for generating and optimizing gold Au<sub>13</sub>, Au<sub>7</sub>, Au<sub>6,</sub> and Au<sub>5</sub> nanoclusters using reinforcement learning (RL). Conventional techniques for optimizing nanoparticle structures are significantly expensive in computation and have some restrictions when exploring a broad range of design possibilities. To overcome these challenges, we used a policy-based RL model that learns how to arrange atoms on a canvas to minimize the potential energy of the nanocluster, like an actor–critic model. The agent works under a reward function based on the molecule’s energy, systematically positioning atoms on a canvas until it reaches convergence. The performance and evaluation of our RL model are assessed by local optimization techniques, specifically the BFGS optimization algorithm and simulated annealing. We conclude that the RL method is effective for identifying the configuration of Au<sub>13</sub> nanoparticles and achieving a stable and low-energy icosahedral structure. The complexity of the energy landscape of nanoalloys renders the determination of their structure a complicated task. This study points out the potential of reinforcement learning in materials science for designing and optimizing nanoparticles with stability characteristics.</p><h3>Graphic abstract</h3><div><figure><div><div><picture><source><img></source></picture></div><div><p>A schematic representation of the actor-critic reinforcement learning model. The input data is processed into a state, which the critic evaluates to estimate the value function. The actor uses the state to determine the action parameters, influencing the next state. The process continues as the agent learns to maximize the reward</p></div></div></figure></div></div>","PeriodicalId":789,"journal":{"name":"The European Physical Journal D","volume":"79 5","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal D","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjd/s10053-025-01006-w","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
The identification and prediction of atomic cluster structures are crucial in nanocluster and materials research, as the molecular structure significantly influences the properties of nanoclusters. This study presents an innovative approach for generating and optimizing gold Au13, Au7, Au6, and Au5 nanoclusters using reinforcement learning (RL). Conventional techniques for optimizing nanoparticle structures are significantly expensive in computation and have some restrictions when exploring a broad range of design possibilities. To overcome these challenges, we used a policy-based RL model that learns how to arrange atoms on a canvas to minimize the potential energy of the nanocluster, like an actor–critic model. The agent works under a reward function based on the molecule’s energy, systematically positioning atoms on a canvas until it reaches convergence. The performance and evaluation of our RL model are assessed by local optimization techniques, specifically the BFGS optimization algorithm and simulated annealing. We conclude that the RL method is effective for identifying the configuration of Au13 nanoparticles and achieving a stable and low-energy icosahedral structure. The complexity of the energy landscape of nanoalloys renders the determination of their structure a complicated task. This study points out the potential of reinforcement learning in materials science for designing and optimizing nanoparticles with stability characteristics.
Graphic abstract
A schematic representation of the actor-critic reinforcement learning model. The input data is processed into a state, which the critic evaluates to estimate the value function. The actor uses the state to determine the action parameters, influencing the next state. The process continues as the agent learns to maximize the reward
期刊介绍:
The European Physical Journal D (EPJ D) presents new and original research results in:
Atomic Physics;
Molecular Physics and Chemical Physics;
Atomic and Molecular Collisions;
Clusters and Nanostructures;
Plasma Physics;
Laser Cooling and Quantum Gas;
Nonlinear Dynamics;
Optical Physics;
Quantum Optics and Quantum Information;
Ultraintense and Ultrashort Laser Fields.
The range of topics covered in these areas is extensive, from Molecular Interaction and Reactivity to Spectroscopy and Thermodynamics of Clusters, from Atomic Optics to Bose-Einstein Condensation to Femtochemistry.