Margot Bligh, Sebastian Silva-Solar, Linda Biehler, Christopher C J Fitzgerald, Conor J Crawford, Mikkel Schultz-Johansen, Sofie Niggemeier, Peter H Seeberger, Manuel Liebeke, Jan-Hendrik Hehemann
{"title":"<i>De Novo</i> Glycan Annotation of Mass Spectrometry Data.","authors":"Margot Bligh, Sebastian Silva-Solar, Linda Biehler, Christopher C J Fitzgerald, Conor J Crawford, Mikkel Schultz-Johansen, Sofie Niggemeier, Peter H Seeberger, Manuel Liebeke, Jan-Hendrik Hehemann","doi":"10.1021/jasms.5c00093","DOIUrl":null,"url":null,"abstract":"<p><p>Carbohydrates are fundamental molecules of life that are involved in virtually all biological processes. The chemical diversity of glycans─carbohydrate chains─enables diverse functions but also challenges analytics. Annotation of glycans in mass spectrometry (MS) data relies heavily on experimental databases or manual calculations, hindering the discovery of novel glycan compositions and structures. Here, we introduce GlycoAnnotateR─a package in the open-source programming language R─for <i>de novo</i> annotation of glycan compositions in MS data. GlycoAnnotateR calculates all possible monomer and modification combinations, which are then filtered against a defined set of chemical rules to provide biologically relevant compositions. The \"glycoPredict\" function can return compositions for oligosaccharides ranging from 1 to 22 monomers in length while accounting for four different modifications in under 10 min with less than 4 GB of random-access memory (RAM). Here, three case studies demonstrate the efficacy and versatility of GlycoAnnotateR: (1) accurate identification of mono- and oligosaccharide standards, (2) characterization of sulfated fucan oligosaccharides obtained by enzymatic digestion of fucoidan, a complex algal glycan, and (3) reproduction and expansion of glycan annotations for a published mouse lung MALDI-MS imaging data set previously annotated by NGlycDB. GlycoAnnotateR rapidly provides accurate annotations and complements existing R packages for MS data processing, enabling metabolomic and glycomic data integration. This combinatorial, rule-based approach enhances glycan annotation capabilities and supports hypothesis generation in glycoscience, expanding our ability to explore the chemical space of glycan diversity.</p>","PeriodicalId":672,"journal":{"name":"Journal of the American Society for Mass Spectrometry","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Society for Mass Spectrometry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/jasms.5c00093","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Carbohydrates are fundamental molecules of life that are involved in virtually all biological processes. The chemical diversity of glycans─carbohydrate chains─enables diverse functions but also challenges analytics. Annotation of glycans in mass spectrometry (MS) data relies heavily on experimental databases or manual calculations, hindering the discovery of novel glycan compositions and structures. Here, we introduce GlycoAnnotateR─a package in the open-source programming language R─for de novo annotation of glycan compositions in MS data. GlycoAnnotateR calculates all possible monomer and modification combinations, which are then filtered against a defined set of chemical rules to provide biologically relevant compositions. The "glycoPredict" function can return compositions for oligosaccharides ranging from 1 to 22 monomers in length while accounting for four different modifications in under 10 min with less than 4 GB of random-access memory (RAM). Here, three case studies demonstrate the efficacy and versatility of GlycoAnnotateR: (1) accurate identification of mono- and oligosaccharide standards, (2) characterization of sulfated fucan oligosaccharides obtained by enzymatic digestion of fucoidan, a complex algal glycan, and (3) reproduction and expansion of glycan annotations for a published mouse lung MALDI-MS imaging data set previously annotated by NGlycDB. GlycoAnnotateR rapidly provides accurate annotations and complements existing R packages for MS data processing, enabling metabolomic and glycomic data integration. This combinatorial, rule-based approach enhances glycan annotation capabilities and supports hypothesis generation in glycoscience, expanding our ability to explore the chemical space of glycan diversity.
期刊介绍:
The Journal of the American Society for Mass Spectrometry presents research papers covering all aspects of mass spectrometry, incorporating coverage of fields of scientific inquiry in which mass spectrometry can play a role.
Comprehensive in scope, the journal publishes papers on both fundamentals and applications of mass spectrometry. Fundamental subjects include instrumentation principles, design, and demonstration, structures and chemical properties of gas-phase ions, studies of thermodynamic properties, ion spectroscopy, chemical kinetics, mechanisms of ionization, theories of ion fragmentation, cluster ions, and potential energy surfaces. In addition to full papers, the journal offers Communications, Application Notes, and Accounts and Perspectives