ASMC: investigating the amino acid diversity of enzyme active sites.

Thomas Bailly, Eddy Elisée, David Vallenet
{"title":"ASMC: investigating the amino acid diversity of enzyme active sites.","authors":"Thomas Bailly, Eddy Elisée, David Vallenet","doi":"10.1093/bioinformatics/btaf307","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The analysis of enzyme active sites is essential for understanding their activity in terms of catalyzed reaction and substrate specificity, providing insights for engineering to obtain targeted properties or modify the substrate scope. In 2010, a first version of the Active Site Modeling and Clustering (ASMC) workflow was published. ASMC predicts isofunctional clusters from enzyme families, based on structural modeling and clustering of active sites. Since then, structure- and sequence-based methods have developed considerably.</p><p><strong>Results: </strong>We present here a redesign of the ASMC workflow. This new major version includes recent pocket prediction, structural alignment and clustering methods, as well as a refined amino acid distance matrix, thereby improving the relevance of results and reducing the need for laborious manual analysis to obtain relevant clusters. In addition, we have implemented multiple sequence alignment (MSA) as a possible input for the clustering step, along with an additional script to compare 2D and 3D active sites. Finally, the code has been unified from three to one programming language (Python) to facilitate its installation and maintenance. This new version of ASMC was evaluated on a set of protein families, resulting in overall better performances compared to its original version.</p><p><strong>Availability and implementation: </strong>ASMC is supported on Linux operating system and freely available at https://github.com/labgem/ASMC, along with a complete documentation (wiki, tutorial).</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: The analysis of enzyme active sites is essential for understanding their activity in terms of catalyzed reaction and substrate specificity, providing insights for engineering to obtain targeted properties or modify the substrate scope. In 2010, a first version of the Active Site Modeling and Clustering (ASMC) workflow was published. ASMC predicts isofunctional clusters from enzyme families, based on structural modeling and clustering of active sites. Since then, structure- and sequence-based methods have developed considerably.

Results: We present here a redesign of the ASMC workflow. This new major version includes recent pocket prediction, structural alignment and clustering methods, as well as a refined amino acid distance matrix, thereby improving the relevance of results and reducing the need for laborious manual analysis to obtain relevant clusters. In addition, we have implemented multiple sequence alignment (MSA) as a possible input for the clustering step, along with an additional script to compare 2D and 3D active sites. Finally, the code has been unified from three to one programming language (Python) to facilitate its installation and maintenance. This new version of ASMC was evaluated on a set of protein families, resulting in overall better performances compared to its original version.

Availability and implementation: ASMC is supported on Linux operating system and freely available at https://github.com/labgem/ASMC, along with a complete documentation (wiki, tutorial).

Supplementary information: Supplementary data are available at Bioinformatics online.

ASMC:研究酶活性位点的氨基酸多样性。
动机:酶活性位点的分析对于了解它们在催化反应和底物特异性方面的活性是必不可少的,为工程获得目标性质或修改底物范围提供见解。2010年,发布了第一个版本的活动站点建模和集群(ASMC)工作流。ASMC预测来自酶家族的同功能簇,基于结构建模和活性位点的聚类。从那时起,基于结构和序列的方法有了很大的发展。结果:我们在这里提出了一个重新设计的ASMC工作流程。这个新的主要版本包括最近的口袋预测,结构对齐和聚类方法,以及精炼的氨基酸距离矩阵,从而提高了结果的相关性,减少了费力的人工分析以获得相关聚类的需要。此外,我们还实现了多序列比对(MSA)作为聚类步骤的可能输入,以及用于比较2D和3D活性位点的附加脚本。最后,将代码从三种编程语言统一为一种编程语言(Python),以方便其安装和维护。这个新版本的ASMC在一组蛋白质家族上进行了评估,结果与原始版本相比,整体性能更好。可用性和实现:ASMC在Linux操作系统上得到支持,可以在https://github.com/labgem/ASMC上免费获得,并提供完整的文档(wiki、教程)。补充信息:补充数据可在生物信息学在线获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信