David van der Spoel, Julián Marrades, Kristian Kříž, A. Najla Hosseini, Alfred T. Nordman, João Paulo, Marie-Madeleine Walz, Paul J. van Maaren and Mohammad M. Ghahremanpour
{"title":"高维参数空间中基于物理力场的进化机器学习","authors":"David van der Spoel, Julián Marrades, Kristian Kříž, A. Najla Hosseini, Alfred T. Nordman, João Paulo, Marie-Madeleine Walz, Paul J. van Maaren and Mohammad M. Ghahremanpour","doi":"10.1039/D5DD00178A","DOIUrl":null,"url":null,"abstract":"<p >This work presents the Alexandria Chemistry Toolkit (ACT), an open-source software for machine learning of physics-based force fields (FFs) from scratch, based on user-specified potential functions. In this approach, a set of FF parameters for molecular simulation is described as a chromosome consisting of atom and bond genes. The accuracy of a FF, that is how well quantum chemical training data are reproduced, determines the fitness of the chromosome. The ACT implements a hierarchical parallel scheme that iterates between a genetic algorithm and Monte-Carlo steps for global and local search, to find “genomes” with high fitness. As a sample application, genome evolution is performed to create physical models that allow the prediction of properties of organic molecules in the gas and liquid phases. Evaluation of the prediction accuracy of different models showcases how force field science can contribute to systematically improve prediction accuracy of physicochemical observables.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 7","pages":" 1925-1935"},"PeriodicalIF":6.2000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00178a?page=search","citationCount":"0","resultStr":"{\"title\":\"Evolutionary machine learning of physics-based force fields in high-dimensional parameter-space†\",\"authors\":\"David van der Spoel, Julián Marrades, Kristian Kříž, A. Najla Hosseini, Alfred T. Nordman, João Paulo, Marie-Madeleine Walz, Paul J. van Maaren and Mohammad M. Ghahremanpour\",\"doi\":\"10.1039/D5DD00178A\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >This work presents the Alexandria Chemistry Toolkit (ACT), an open-source software for machine learning of physics-based force fields (FFs) from scratch, based on user-specified potential functions. In this approach, a set of FF parameters for molecular simulation is described as a chromosome consisting of atom and bond genes. The accuracy of a FF, that is how well quantum chemical training data are reproduced, determines the fitness of the chromosome. The ACT implements a hierarchical parallel scheme that iterates between a genetic algorithm and Monte-Carlo steps for global and local search, to find “genomes” with high fitness. As a sample application, genome evolution is performed to create physical models that allow the prediction of properties of organic molecules in the gas and liquid phases. Evaluation of the prediction accuracy of different models showcases how force field science can contribute to systematically improve prediction accuracy of physicochemical observables.</p>\",\"PeriodicalId\":72816,\"journal\":{\"name\":\"Digital discovery\",\"volume\":\" 7\",\"pages\":\" 1925-1935\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d5dd00178a?page=search\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d5dd00178a\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d5dd00178a","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Evolutionary machine learning of physics-based force fields in high-dimensional parameter-space†
This work presents the Alexandria Chemistry Toolkit (ACT), an open-source software for machine learning of physics-based force fields (FFs) from scratch, based on user-specified potential functions. In this approach, a set of FF parameters for molecular simulation is described as a chromosome consisting of atom and bond genes. The accuracy of a FF, that is how well quantum chemical training data are reproduced, determines the fitness of the chromosome. The ACT implements a hierarchical parallel scheme that iterates between a genetic algorithm and Monte-Carlo steps for global and local search, to find “genomes” with high fitness. As a sample application, genome evolution is performed to create physical models that allow the prediction of properties of organic molecules in the gas and liquid phases. Evaluation of the prediction accuracy of different models showcases how force field science can contribute to systematically improve prediction accuracy of physicochemical observables.