Modeling interactions between Heparan sulfate and proteins based on the Heparan sulfate microarray analysis.

IF 3.4 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Cleber C Melo-Filho, Guowei Su, Kevin Liu, Eugene N Muratov, Alexander Tropsha, Jian Liu
{"title":"Modeling interactions between Heparan sulfate and proteins based on the Heparan sulfate microarray analysis.","authors":"Cleber C Melo-Filho, Guowei Su, Kevin Liu, Eugene N Muratov, Alexander Tropsha, Jian Liu","doi":"10.1093/glycob/cwae039","DOIUrl":null,"url":null,"abstract":"<p><p>Heparan sulfate (HS), a sulfated polysaccharide abundant in the extracellular matrix, plays pivotal roles in various physiological and pathological processes by interacting with proteins. Investigating the binding selectivity of HS oligosaccharides to target proteins is essential, but the exhaustive inclusion of all possible oligosaccharides in microarray experiments is impractical. To address this challenge, we present a hybrid pipeline that integrates microarray and in silico techniques to design oligosaccharides with desired protein affinity. Using fibroblast growth factor 2 (FGF2) as a model protein, we assembled an in-house dataset of HS oligosaccharides on microarrays and developed two structural representations: a standard representation with all atoms explicit and a simplified representation with disaccharide units as \"quasi-atoms.\" Predictive Quantitative Structure-Activity Relationship (QSAR) models for FGF2 affinity were developed using the Random Forest (RF) algorithm. The resulting models, considering the applicability domain, demonstrated high predictivity, with a correct classification rate of 0.81-0.80 and improved positive predictive values (PPV) up to 0.95. Virtual screening of 40 new oligosaccharides using the simplified model identified 15 computational hits, 11 of which were experimentally validated for high FGF2 affinity. This hybrid approach marks a significant step toward the targeted design of oligosaccharides with desired protein interactions, providing a foundation for broader applications in glycobiology.</p>","PeriodicalId":12766,"journal":{"name":"Glycobiology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2024-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11180703/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Glycobiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/glycob/cwae039","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Heparan sulfate (HS), a sulfated polysaccharide abundant in the extracellular matrix, plays pivotal roles in various physiological and pathological processes by interacting with proteins. Investigating the binding selectivity of HS oligosaccharides to target proteins is essential, but the exhaustive inclusion of all possible oligosaccharides in microarray experiments is impractical. To address this challenge, we present a hybrid pipeline that integrates microarray and in silico techniques to design oligosaccharides with desired protein affinity. Using fibroblast growth factor 2 (FGF2) as a model protein, we assembled an in-house dataset of HS oligosaccharides on microarrays and developed two structural representations: a standard representation with all atoms explicit and a simplified representation with disaccharide units as "quasi-atoms." Predictive Quantitative Structure-Activity Relationship (QSAR) models for FGF2 affinity were developed using the Random Forest (RF) algorithm. The resulting models, considering the applicability domain, demonstrated high predictivity, with a correct classification rate of 0.81-0.80 and improved positive predictive values (PPV) up to 0.95. Virtual screening of 40 new oligosaccharides using the simplified model identified 15 computational hits, 11 of which were experimentally validated for high FGF2 affinity. This hybrid approach marks a significant step toward the targeted design of oligosaccharides with desired protein interactions, providing a foundation for broader applications in glycobiology.

根据硫酸头孢菌素芯片分析,建立硫酸头孢菌素与蛋白质之间的相互作用模型。
硫酸肝素(HS)是一种硫酸化多糖,在细胞外基质中含量丰富,通过与蛋白质相互作用在各种生理和病理过程中发挥着关键作用。研究 HS 寡糖与目标蛋白的结合选择性至关重要,但在芯片实验中穷尽所有可能的寡糖是不切实际的。为了应对这一挑战,我们提出了一种混合管道,它整合了微阵列和硅学技术来设计具有所需蛋白质亲和力的寡糖。我们以成纤维细胞生长因子 2(FGF2)为模型蛋白质,在微阵列上建立了一个 HS 寡糖的内部数据集,并开发了两种结构表示法:一种是所有原子都明确的标准表示法,另一种是以双糖单位作为 "准原子 "的简化表示法。使用随机森林(RF)算法建立了 FGF2 亲和力的定量结构-活性关系(QSAR)预测模型。考虑到适用领域,所建立的模型具有很高的预测性,正确分类率为 0.81-0.80,阳性预测值(PPV)提高到 0.95。利用简化模型对 40 种新寡糖进行了虚拟筛选,发现了 15 个计算结果,其中 11 个经实验验证具有较高的 FGF2 亲和力。这种混合方法标志着我们在有针对性地设计具有所需蛋白质相互作用的寡糖方面迈出了重要一步,为糖生物学的更广泛应用奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Glycobiology
Glycobiology 生物-生化与分子生物学
CiteScore
7.50
自引率
4.70%
发文量
73
审稿时长
3 months
期刊介绍: Established as the leading journal in the field, Glycobiology provides a unique forum dedicated to research into the biological functions of glycans, including glycoproteins, glycolipids, proteoglycans and free oligosaccharides, and on proteins that specifically interact with glycans (including lectins, glycosyltransferases, and glycosidases). Glycobiology is essential reading for researchers in biomedicine, basic science, and the biotechnology industries. By providing a single forum, the journal aims to improve communication between glycobiologists working in different disciplines and to increase the overall visibility of the field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信