{"title":"复合优化的迭代DeepSARM建模","authors":"Atsushi Yoshimori , Jürgen Bajorath","doi":"10.1016/j.ailsci.2021.100015","DOIUrl":null,"url":null,"abstract":"<div><p>The Structure-Activity Relationship (SAR) Matrix (SARM) method systematically extracts structurally related compound series from any source and organizes these series in a unique data structure formed by matrices similar to R-group tables from medicinal chemistry. In addition, the SARM method generates virtual analogues for structurally organized series that consist of new combinations of existing core structures and R-groups. For active compounds, SARMs visualize SAR patterns and aid in compound design. The SARM methodology and data structure was integrated with a recurrent neural network architecture to further expand the compound design capacity with deep generative models, leading to the DeepSARM approach. Herein, we present an extension of the DeepSARM framework for compound optimization termed iterative DeepSARM (iDeepSARM), which involves multiple iterations of deep generative modeling and fine-tuning to obtain increasingly likely active compounds for targets of interest. Hence, iDeepSARM adds computational hit-to-lead and lead optimization capability to the DeepSARM framework. In addition to detailing methodological features and calculation protocols, an exemplary compound design application is reported to illustrate the iDeepSARM approach.</p></div>","PeriodicalId":72304,"journal":{"name":"Artificial intelligence in the life sciences","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667318521000155/pdfft?md5=64c96435c7527c83c4f92d37a0c7edc8&pid=1-s2.0-S2667318521000155-main.pdf","citationCount":"2","resultStr":"{\"title\":\"Iterative DeepSARM modeling for compound optimization\",\"authors\":\"Atsushi Yoshimori , Jürgen Bajorath\",\"doi\":\"10.1016/j.ailsci.2021.100015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The Structure-Activity Relationship (SAR) Matrix (SARM) method systematically extracts structurally related compound series from any source and organizes these series in a unique data structure formed by matrices similar to R-group tables from medicinal chemistry. In addition, the SARM method generates virtual analogues for structurally organized series that consist of new combinations of existing core structures and R-groups. For active compounds, SARMs visualize SAR patterns and aid in compound design. The SARM methodology and data structure was integrated with a recurrent neural network architecture to further expand the compound design capacity with deep generative models, leading to the DeepSARM approach. Herein, we present an extension of the DeepSARM framework for compound optimization termed iterative DeepSARM (iDeepSARM), which involves multiple iterations of deep generative modeling and fine-tuning to obtain increasingly likely active compounds for targets of interest. Hence, iDeepSARM adds computational hit-to-lead and lead optimization capability to the DeepSARM framework. In addition to detailing methodological features and calculation protocols, an exemplary compound design application is reported to illustrate the iDeepSARM approach.</p></div>\",\"PeriodicalId\":72304,\"journal\":{\"name\":\"Artificial intelligence in the life sciences\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2667318521000155/pdfft?md5=64c96435c7527c83c4f92d37a0c7edc8&pid=1-s2.0-S2667318521000155-main.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial intelligence in the life sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667318521000155\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence in the life sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667318521000155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Iterative DeepSARM modeling for compound optimization
The Structure-Activity Relationship (SAR) Matrix (SARM) method systematically extracts structurally related compound series from any source and organizes these series in a unique data structure formed by matrices similar to R-group tables from medicinal chemistry. In addition, the SARM method generates virtual analogues for structurally organized series that consist of new combinations of existing core structures and R-groups. For active compounds, SARMs visualize SAR patterns and aid in compound design. The SARM methodology and data structure was integrated with a recurrent neural network architecture to further expand the compound design capacity with deep generative models, leading to the DeepSARM approach. Herein, we present an extension of the DeepSARM framework for compound optimization termed iterative DeepSARM (iDeepSARM), which involves multiple iterations of deep generative modeling and fine-tuning to obtain increasingly likely active compounds for targets of interest. Hence, iDeepSARM adds computational hit-to-lead and lead optimization capability to the DeepSARM framework. In addition to detailing methodological features and calculation protocols, an exemplary compound design application is reported to illustrate the iDeepSARM approach.
Artificial intelligence in the life sciencesPharmacology, Biochemistry, Genetics and Molecular Biology (General), Computer Science Applications, Health Informatics, Drug Discovery, Veterinary Science and Veterinary Medicine (General)