IF 7.6 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Michael Russelle S. Alvarez, Xavier A. Holmes, Armin Oloumi, Sheryl Joyce Grijaldo-Alvarez, Ryan Schindler, Qingwen Zhou, Anirudh Yadlapati, Atit Silsirivanit, Carlito B. Lebrilla
{"title":"Integration of RNAseq transcriptomics and N-glycomics reveal biosynthetic pathways and predict structure-specific N-glycan expression","authors":"Michael Russelle S. Alvarez, Xavier A. Holmes, Armin Oloumi, Sheryl Joyce Grijaldo-Alvarez, Ryan Schindler, Qingwen Zhou, Anirudh Yadlapati, Atit Silsirivanit, Carlito B. Lebrilla","doi":"10.1039/d5sc00467e","DOIUrl":null,"url":null,"abstract":"The processes involved in protein <em>N</em>-glycosylation represent new therapeutic targets for diseases but their stepwise and overlapping biosynthetic processes make it challenging to identify the specific glycogenes involved. In this work, we aimed to elucidate the interactions between glycogene expression and <em>N</em>-glycan abundance by constructing supervised machine-learning models for each <em>N</em>-glycan composition. Regression models were trained to predict <em>N</em>-glycan abundance (response variable) from glycogene expression (predictors) using paired LC-MS/MS <em>N</em>-glycomic and 3′-TagSeq transcriptomic datasets from cells derived from multiple tissue origins and treatment conditions. The datasets include cells from several tissue origins – B cell, brain, colon, lung, muscle, prostate – encompassing nearly 400 <em>N</em>-glycan compounds and over 160 glycogenes filtered from an 18 000-gene transcriptome. Accurate models (validation <em>R</em><small><sup>2</sup></small> &gt; 0.8) predicted <em>N</em>-glycan abundance across cell types, including GLC01 (lung cancer), CCD19-Lu (lung fibroblast), and Tib-190 (B cell). Model importance scores ranked glycogene contributions to <em>N</em>-glycan predictions, revealing significant glycogene associations with specific <em>N</em>-glycan types. The predictions were consistent across input cell quantities, unlike LC-MS/MS glycomics which showed inconsistent results. This suggests that the models can reliably predict <em>N</em>-glycosylation even in samples with low cell amounts and by extension, single-cell samples. These findings can provide insights into cellular <em>N</em>-glycosylation machinery, offering potential therapeutic strategies for diseases linked to aberrant glycosylation, such as cancer, and neurodegenerative and autoimmune disorders.","PeriodicalId":9909,"journal":{"name":"Chemical Science","volume":"183 1","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Science","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d5sc00467e","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

蛋白质 N-糖基化过程是治疗疾病的新靶点,但其逐步和重叠的生物合成过程使得确定所涉及的特定糖基具有挑战性。在这项工作中,我们旨在通过为每种 N-糖组成构建有监督的机器学习模型来阐明糖基因表达与 N-糖丰度之间的相互作用。我们使用成对的 LC-MS/MS N-glycomic 和 3′-TagSeq 转录组数据集对回归模型进行了训练,以便根据糖基因表达(预测因子)预测 N-糖丰度(响应变量)。这些数据集包括来自多个组织来源的细胞--B 细胞、脑、结肠、肺、肌肉和前列腺--涵盖了从 18000 个基因转录组中筛选出的近 400 种 N-糖化合物和 160 多种糖元。准确的模型(验证 R2 > 0.8)预测了各种细胞类型的 N-糖丰度,包括 GLC01(肺癌)、CCD19-Lu(肺成纤维细胞)和 Tib-190(B 细胞)。模型重要性评分对糖基因对 N-聚糖预测的贡献进行了排序,揭示了糖基因与特定 N-聚糖类型的重要关联。不同输入细胞数量的预测结果是一致的,不像 LC-MS/MS 糖化学分析结果不一致。这表明,即使是细胞数量较少的样本,甚至是单细胞样本,这些模型也能可靠地预测 N-糖基化。这些发现有助于深入了解细胞 N-糖基化机制,为癌症、神经退行性疾病和自身免疫性疾病等与糖基化异常有关的疾病提供潜在的治疗策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Integration of RNAseq transcriptomics and N-glycomics reveal biosynthetic pathways and predict structure-specific N-glycan expression

Integration of RNAseq transcriptomics and N-glycomics reveal biosynthetic pathways and predict structure-specific N-glycan expression
The processes involved in protein N-glycosylation represent new therapeutic targets for diseases but their stepwise and overlapping biosynthetic processes make it challenging to identify the specific glycogenes involved. In this work, we aimed to elucidate the interactions between glycogene expression and N-glycan abundance by constructing supervised machine-learning models for each N-glycan composition. Regression models were trained to predict N-glycan abundance (response variable) from glycogene expression (predictors) using paired LC-MS/MS N-glycomic and 3′-TagSeq transcriptomic datasets from cells derived from multiple tissue origins and treatment conditions. The datasets include cells from several tissue origins – B cell, brain, colon, lung, muscle, prostate – encompassing nearly 400 N-glycan compounds and over 160 glycogenes filtered from an 18 000-gene transcriptome. Accurate models (validation R2 > 0.8) predicted N-glycan abundance across cell types, including GLC01 (lung cancer), CCD19-Lu (lung fibroblast), and Tib-190 (B cell). Model importance scores ranked glycogene contributions to N-glycan predictions, revealing significant glycogene associations with specific N-glycan types. The predictions were consistent across input cell quantities, unlike LC-MS/MS glycomics which showed inconsistent results. This suggests that the models can reliably predict N-glycosylation even in samples with low cell amounts and by extension, single-cell samples. These findings can provide insights into cellular N-glycosylation machinery, offering potential therapeutic strategies for diseases linked to aberrant glycosylation, such as cancer, and neurodegenerative and autoimmune disorders.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Chemical Science
Chemical Science CHEMISTRY, MULTIDISCIPLINARY-
CiteScore
14.40
自引率
4.80%
发文量
1352
审稿时长
2.1 months
期刊介绍: Chemical Science is a journal that encompasses various disciplines within the chemical sciences. Its scope includes publishing ground-breaking research with significant implications for its respective field, as well as appealing to a wider audience in related areas. To be considered for publication, articles must showcase innovative and original advances in their field of study and be presented in a manner that is understandable to scientists from diverse backgrounds. However, the journal generally does not publish highly specialized research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信