{"title":"基于质心距离评价的强化RNN的支架驱动分子生成","authors":"Xingzheng Zhu , Zhihong Zhao , Fei Zhu","doi":"10.1016/j.eswa.2025.128606","DOIUrl":null,"url":null,"abstract":"<div><div>De novo molecular design is learning from existing data to propose a new chemical structure that meets the desired properties. Still, it is difficult to de novo design a variety of novel molecules with desirable properties because the different properties of molecules cannot be balanced using a simple generative method. To solve this problem, this study proposes a reinforced Pareto-optimized molecular scaffold clustering generation method, ScaRL-P. Molecular scaffold information can help us identify molecules of the same pattern, cluster according to the core characteristics of the scaffold, and screen out the molecules with ideal properties. In addition, this study uses Pareto optimization to construct a three-dimensional Pareto frontier - biological activity, diversity, and in-cluster reward value, and cooperate with molecular scaffold clustering to obtain the dominant molecules. The multi-dimensional frontier obtained by adjusting the Pareto frontier in reinforcement learning modeling is transformed into the final reward return, which is provided to the agent in reinforcement learning to learn a molecular generation strategy that is close to the optimal attribute distribution.ScaRL-P demonstrated superior performance in binding affinity and optimization for three protein objectives (KOR, PIK3CA, and JAK2), outperforming several GPC benchmark methods.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"292 ","pages":"Article 128606"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Scaffold-driven molecular generation via reinforced RNN with centroid distance evaluation\",\"authors\":\"Xingzheng Zhu , Zhihong Zhao , Fei Zhu\",\"doi\":\"10.1016/j.eswa.2025.128606\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>De novo molecular design is learning from existing data to propose a new chemical structure that meets the desired properties. Still, it is difficult to de novo design a variety of novel molecules with desirable properties because the different properties of molecules cannot be balanced using a simple generative method. To solve this problem, this study proposes a reinforced Pareto-optimized molecular scaffold clustering generation method, ScaRL-P. Molecular scaffold information can help us identify molecules of the same pattern, cluster according to the core characteristics of the scaffold, and screen out the molecules with ideal properties. In addition, this study uses Pareto optimization to construct a three-dimensional Pareto frontier - biological activity, diversity, and in-cluster reward value, and cooperate with molecular scaffold clustering to obtain the dominant molecules. The multi-dimensional frontier obtained by adjusting the Pareto frontier in reinforcement learning modeling is transformed into the final reward return, which is provided to the agent in reinforcement learning to learn a molecular generation strategy that is close to the optimal attribute distribution.ScaRL-P demonstrated superior performance in binding affinity and optimization for three protein objectives (KOR, PIK3CA, and JAK2), outperforming several GPC benchmark methods.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"292 \",\"pages\":\"Article 128606\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425022250\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425022250","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Scaffold-driven molecular generation via reinforced RNN with centroid distance evaluation
De novo molecular design is learning from existing data to propose a new chemical structure that meets the desired properties. Still, it is difficult to de novo design a variety of novel molecules with desirable properties because the different properties of molecules cannot be balanced using a simple generative method. To solve this problem, this study proposes a reinforced Pareto-optimized molecular scaffold clustering generation method, ScaRL-P. Molecular scaffold information can help us identify molecules of the same pattern, cluster according to the core characteristics of the scaffold, and screen out the molecules with ideal properties. In addition, this study uses Pareto optimization to construct a three-dimensional Pareto frontier - biological activity, diversity, and in-cluster reward value, and cooperate with molecular scaffold clustering to obtain the dominant molecules. The multi-dimensional frontier obtained by adjusting the Pareto frontier in reinforcement learning modeling is transformed into the final reward return, which is provided to the agent in reinforcement learning to learn a molecular generation strategy that is close to the optimal attribute distribution.ScaRL-P demonstrated superior performance in binding affinity and optimization for three protein objectives (KOR, PIK3CA, and JAK2), outperforming several GPC benchmark methods.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.