Torsion angular bin strings: algorithmic update and additional validation.

IF 5.7 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Jessica Braun, Djahan Lamei, Philippe H Hünenberger, Gregory A Landrum, Sereina Riniker
{"title":"Torsion angular bin strings: algorithmic update and additional validation.","authors":"Jessica Braun, Djahan Lamei, Philippe H Hünenberger, Gregory A Landrum, Sereina Riniker","doi":"10.1186/s13321-026-01194-6","DOIUrl":null,"url":null,"abstract":"<p><p>In our previous work, we introduced the concept of torsion angular bin strings (TABS), which is a discrete vector representation of a conformer's torsional angles. Through this discretization, conformational states can be counted, yielding an estimate of the upper limit of the expected conformational ensemble size (nTABS). Besides nTABS being used as a quantitative measure of molecular flexibility, TABS itself is a way of grouping the conformers of a molecule without picking thresholds. This feature of TABS is especially valuable, as selecting suitable thresholds for metrics such as heavy-atom root-mean-square deviation (RMSD) or shape Tanimoto is highly system-dependent and can thus be challenging when working with large sets of molecules. Here, we describe the update to the nTABS algorithm of the TABS package since the last release. In addition, we present a classification study of conformer ensembles by TABS and compare it to classifications by a shape Tanimoto metric. Scientific contribution In contrast to our previous implementation, which handled molecular topological symmetry by enumerating all possible combinations that were simply permutations of one another, the new implementation treats TABS as mathematical objects governed by group theory, specifically Burnside's Lemma. This approach requires substantially less code and delivers a notable improvement in computational speed. The study also builds upon our previously developed framework for categorization comparisons between TABS and heavy-atom RMSD. Here, we show the results of a similar comparison with a shape Tanimoto metric, which further support the hypothesis that TABS encode the shape of conformers in a meaningful way.</p>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":" ","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2026-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13321-026-01194-6","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

In our previous work, we introduced the concept of torsion angular bin strings (TABS), which is a discrete vector representation of a conformer's torsional angles. Through this discretization, conformational states can be counted, yielding an estimate of the upper limit of the expected conformational ensemble size (nTABS). Besides nTABS being used as a quantitative measure of molecular flexibility, TABS itself is a way of grouping the conformers of a molecule without picking thresholds. This feature of TABS is especially valuable, as selecting suitable thresholds for metrics such as heavy-atom root-mean-square deviation (RMSD) or shape Tanimoto is highly system-dependent and can thus be challenging when working with large sets of molecules. Here, we describe the update to the nTABS algorithm of the TABS package since the last release. In addition, we present a classification study of conformer ensembles by TABS and compare it to classifications by a shape Tanimoto metric. Scientific contribution In contrast to our previous implementation, which handled molecular topological symmetry by enumerating all possible combinations that were simply permutations of one another, the new implementation treats TABS as mathematical objects governed by group theory, specifically Burnside's Lemma. This approach requires substantially less code and delivers a notable improvement in computational speed. The study also builds upon our previously developed framework for categorization comparisons between TABS and heavy-atom RMSD. Here, we show the results of a similar comparison with a shape Tanimoto metric, which further support the hypothesis that TABS encode the shape of conformers in a meaningful way.

扭转角bin字符串:算法更新和额外的验证。
在我们之前的工作中,我们介绍了扭转角线束(TABS)的概念,这是一个离散矢量表示的一个共形器的扭转角。通过这种离散化,可以计算构象状态,从而估计出预期构象系综尺寸(nTABS)的上限。除了nTABS被用作分子灵活性的定量测量之外,TABS本身是一种不需要选择阈值就可以对分子的构象进行分组的方法。TABS的这个特性特别有价值,因为为重原子均方根偏差(RMSD)或谷本形状等指标选择合适的阈值是高度依赖系统的,因此在处理大量分子时可能具有挑战性。这里,我们描述了自上次发布以来TABS包的nTABS算法的更新。此外,我们提出了一个分类研究的共形集成的标签,并比较其分类的形状谷本度量。我们之前的实现通过列举所有可能的组合来处理分子拓扑对称,这些组合只是彼此的简单排列,与此相反,新的实现将tab视为受群论(特别是Burnside引理)支配的数学对象。这种方法需要更少的代码,并显著提高了计算速度。该研究还建立在我们之前开发的tab和重原子RMSD之间分类比较的框架之上。在这里,我们展示了与形状谷本度量的类似比较的结果,这进一步支持了tab以有意义的方式编码构象形状的假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cheminformatics
Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS
CiteScore
14.10
自引率
7.00%
发文量
82
审稿时长
3 months
期刊介绍: Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书