SMARTpy: a Python package for the generation of cavity steric molecular descriptors and applications to diverse systems†

IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY
Beck R. Miller, Ryan C. Cammarota and Matthew S. Sigman
{"title":"SMARTpy: a Python package for the generation of cavity steric molecular descriptors and applications to diverse systems†","authors":"Beck R. Miller, Ryan C. Cammarota and Matthew S. Sigman","doi":"10.1039/D4DD00329B","DOIUrl":null,"url":null,"abstract":"<p >Steric molecular descriptors designed for machine learning (ML) applications are critical for connecting structure–function relationships to mechanistic insight. However, many of these descriptors are not suitable for application to complex systems, such as catalyst reactive site pockets. In this context, we recently disclosed a new set of 3D steric molecular descriptors that were originally designed for dirhodium(<small>II</small>) tetra-carboxylate catalysts. Herein, we expand the spatial molding for rigid targets (SMART) descriptor toolkit by releasing SMARTpy; an automated, open-source Python API package for computational workflow integration of SMART descriptors. The impact of the structure of the molecular probe for generation of SMART descriptors was analyzed. Resultant SMART descriptors and pocket features were found to be highly dependent upon probe selection, and do not scale linearly. Flexible probes with smaller substituents can explore narrow pocket regions resulting in a higher resolution pocket imprint. Macrocyclic probes with larger substituents are more applicable to larger cavities with smooth boundaries, such as dirhodium paddlewheel complexes. In these cases, SMARTpy provides comparable descriptors to the original calculation method using UCSF Chimera. Finally, we analyzed a series of case studies demonstrating how SMART descriptors can impact other areas of catalysis, such as organocatalysis, biocatalysis, and protein pocket analysis.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 2","pages":" 451-463"},"PeriodicalIF":6.2000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d4dd00329b?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d4dd00329b","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Steric molecular descriptors designed for machine learning (ML) applications are critical for connecting structure–function relationships to mechanistic insight. However, many of these descriptors are not suitable for application to complex systems, such as catalyst reactive site pockets. In this context, we recently disclosed a new set of 3D steric molecular descriptors that were originally designed for dirhodium(II) tetra-carboxylate catalysts. Herein, we expand the spatial molding for rigid targets (SMART) descriptor toolkit by releasing SMARTpy; an automated, open-source Python API package for computational workflow integration of SMART descriptors. The impact of the structure of the molecular probe for generation of SMART descriptors was analyzed. Resultant SMART descriptors and pocket features were found to be highly dependent upon probe selection, and do not scale linearly. Flexible probes with smaller substituents can explore narrow pocket regions resulting in a higher resolution pocket imprint. Macrocyclic probes with larger substituents are more applicable to larger cavities with smooth boundaries, such as dirhodium paddlewheel complexes. In these cases, SMARTpy provides comparable descriptors to the original calculation method using UCSF Chimera. Finally, we analyzed a series of case studies demonstrating how SMART descriptors can impact other areas of catalysis, such as organocatalysis, biocatalysis, and protein pocket analysis.

Abstract Image

一个Python包,用于生成腔空间分子描述符和应用于不同的系统†
为机器学习(ML)应用设计的立体分子描述符对于将结构-功能关系与机制洞察力联系起来至关重要。然而,这些描述符中的许多不适合应用于复杂的系统,如催化剂活性位点口袋。在这种情况下,我们最近披露了一组新的三维立体分子描述符,最初是为四羧酸二钠催化剂设计的。在此,我们通过释放SMARTpy扩展了刚性目标空间成型描述符工具包;一个自动化的,开源的Python API包,用于SMART描述符的计算工作流集成。分析了分子探针结构对SMART描述符生成的影响。由此产生的SMART描述符和口袋特征被发现高度依赖于探针选择,并且不线性缩放。具有较小取代基的柔性探针可以探测狭窄的口袋区域,从而产生更高分辨率的口袋印记。具有较大取代基的大环探针更适用于具有光滑边界的较大腔体,如镝桨轮配合物。在这些情况下,SMARTpy提供了与使用UCSF Chimera的原始计算方法类似的描述符。最后,我们分析了一系列案例研究,展示了SMART描述符如何影响其他催化领域,如有机催化、生物催化和蛋白质口袋分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信