SMILES alignment: a dynamic programming approach for the alignment of metabolites and other small organic molecules.

IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
Alexis L Tang, David A Liberles
{"title":"SMILES alignment: a dynamic programming approach for the alignment of metabolites and other small organic molecules.","authors":"Alexis L Tang, David A Liberles","doi":"10.1186/s12859-025-06278-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There is a need for computational approaches to compare small organic molecules based on chemical similarity or for evaluating biochemical transformations. No tool currently exists to generate global molecular alignments for small organic molecules. The study introduces a new approach to molecular alignment in the Simplified Molecular Input Line Entry System (SMILES) format. This method leverages programming and scoring alignments to minimize differences in electronegativity, here using a measure of atomic partial charges to address the challenge of understanding structural transformations in reaction pathways. This can be applied to study transitions from linear to cyclical pathways.</p><p><strong>Results: </strong>The proposed method is based on the Needleman-Wunsch algorithm for sequence alignment, but it uses a modified scoring function for different input data. Validation against a benchmarked dataset from the Krebs cycle, based on the known chemical transformations in the pathway, confirmed the efficacy of the approach in aligning atoms that are known to be the same across the transformation. The algorithm also quantified each transformation of metabolites in the Pentose Phosphate Pathway and in Glycolysis. The method was used to study the difference in chemical similarity over transformations between linear and cyclical pathways. The study found a midpoint dissimilarity peak in cyclical pathways (particularly the Krebs Cycle) and a progressive decrease in molecular similarity in linear pathways, consistent with expectations.</p><p><strong>Conclusions: </strong>The study introduces an algorithm that quantifies molecular transformations in metabolic pathways. The algorithm effectively highlights structural changes and was applied to a hypothesis about the transition from linear to cyclical structures. The software, which provides valuable insights into molecular transformations, is available at: https://github.com/24atang/SMILES-Alignment.git.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"251"},"PeriodicalIF":3.3000,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12534939/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06278-y","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: There is a need for computational approaches to compare small organic molecules based on chemical similarity or for evaluating biochemical transformations. No tool currently exists to generate global molecular alignments for small organic molecules. The study introduces a new approach to molecular alignment in the Simplified Molecular Input Line Entry System (SMILES) format. This method leverages programming and scoring alignments to minimize differences in electronegativity, here using a measure of atomic partial charges to address the challenge of understanding structural transformations in reaction pathways. This can be applied to study transitions from linear to cyclical pathways.

Results: The proposed method is based on the Needleman-Wunsch algorithm for sequence alignment, but it uses a modified scoring function for different input data. Validation against a benchmarked dataset from the Krebs cycle, based on the known chemical transformations in the pathway, confirmed the efficacy of the approach in aligning atoms that are known to be the same across the transformation. The algorithm also quantified each transformation of metabolites in the Pentose Phosphate Pathway and in Glycolysis. The method was used to study the difference in chemical similarity over transformations between linear and cyclical pathways. The study found a midpoint dissimilarity peak in cyclical pathways (particularly the Krebs Cycle) and a progressive decrease in molecular similarity in linear pathways, consistent with expectations.

Conclusions: The study introduces an algorithm that quantifies molecular transformations in metabolic pathways. The algorithm effectively highlights structural changes and was applied to a hypothesis about the transition from linear to cyclical structures. The software, which provides valuable insights into molecular transformations, is available at: https://github.com/24atang/SMILES-Alignment.git.

SMILES校准:用于代谢物和其他小有机分子校准的动态规划方法。
背景:需要基于化学相似性比较小有机分子或评估生化转化的计算方法。目前还没有工具可以生成小有机分子的全局分子比对。该研究介绍了一种简化分子输入线输入系统(SMILES)格式的分子定位新方法。该方法利用编程和评分对齐来最小化电负性差异,这里使用原子部分电荷的测量来解决理解反应途径中的结构转变的挑战。这可以应用于研究从线性路径到周期性路径的转变。结果:该方法基于Needleman-Wunsch算法进行序列比对,但对不同的输入数据使用了改进的评分函数。基于该途径中已知的化学转化,对来自克雷布斯循环的基准数据集进行验证,证实了该方法在对齐已知在整个转化过程中相同的原子方面的有效性。该算法还量化了戊糖磷酸途径和糖酵解过程中代谢物的每一次转化。该方法被用来研究在线性和循环途径之间的转换的化学相似性的差异。研究发现,在周期性途径(尤其是克雷布斯循环)中存在一个中点差异峰值,而在线性途径中,分子相似性逐渐降低,这与预期一致。结论:该研究引入了一种量化代谢途径中分子转化的算法。该算法有效地突出了结构变化,并应用于线性结构向周期性结构过渡的假设。该软件为分子转化提供了有价值的见解,可在https://github.com/24atang/SMILES-Alignment.git上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Bioinformatics
BMC Bioinformatics 生物-生化研究方法
CiteScore
5.70
自引率
3.30%
发文量
506
审稿时长
4.3 months
期刊介绍: BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信