Sanjyot Vinayak Shenoy, Deeptarup Biswas, Arthur Zalevsky, Audrey Kishishita, Ayushi Verma, Ishan Upadhay, Yi He, Andrej Sali, Rosa Viner, Kamal Mandal, Sanjeeva Srivastava, Arun P Wiita
{"title":"AlphaCross-XL: A Seamless Tool for Automated and Proteome-Scale Mapping of Cross-Linked Peptides Onto 3D Protein Structures.","authors":"Sanjyot Vinayak Shenoy, Deeptarup Biswas, Arthur Zalevsky, Audrey Kishishita, Ayushi Verma, Ishan Upadhay, Yi He, Andrej Sali, Rosa Viner, Kamal Mandal, Sanjeeva Srivastava, Arun P Wiita","doi":"10.1016/j.mcpro.2025.101057","DOIUrl":null,"url":null,"abstract":"<p><p>Crosslinking mass spectrometry (XL-MS) is an exciting proteomics technology to capture native protein conformations in real time within biological systems. Historically, however, implementation of this technology has typically been limited to single purified recombinant proteins or in vitro-assembled protein complexes. These limitations are associated with inherent challenges in XL-MS analysis, including extremely low abundance of crosslinked (XL) peptides and complex deconvolution of XL peptide-derived spectral data. However, impressive recent developments in computation and instrumentation have now made it feasible to address biological questions using proteome-wide XL-MS analysis. Although some XL mapping software tools exist, these require manual input of specific Protein Data Bank structures at the single protein level and do not function at the high-throughput scale required to analyze datasets derived from thousands of proteins. To address this need, we therefore sought to develop a strategy enabling automated mapping of XL peptides onto the 3D structures of proteins, at a proteome-wide scale. Herein, we describe AlphaCross-XL, a first-in-class seamless computational tool for automated mapping of XL peptides onto the protein structures for intraprotein crosslinks and loop links. The AlphaCross-XL software first retrieves protein structures from the AlphaFold Protein Structure Database and maps all the identified crosslinks onto the 3D structure. It also calculates the Euclidian distance between the crosslinked residues and reports the violated and satisfied crosslink distances based on a user-defined distance threshold, which is visually discriminated by color in PyMOL. Last, the tool also supports further validation of user-submitted protein structures, which can include any computer-predicted protein structure and experimentally derived protein structures (i.e., from Protein Data Bank). AlphaCross-XL is available at https://github.com/sanjyotshenoy/alphacross-xl.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101057"},"PeriodicalIF":5.5000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2025.101057","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Crosslinking mass spectrometry (XL-MS) is an exciting proteomics technology to capture native protein conformations in real time within biological systems. Historically, however, implementation of this technology has typically been limited to single purified recombinant proteins or in vitro-assembled protein complexes. These limitations are associated with inherent challenges in XL-MS analysis, including extremely low abundance of crosslinked (XL) peptides and complex deconvolution of XL peptide-derived spectral data. However, impressive recent developments in computation and instrumentation have now made it feasible to address biological questions using proteome-wide XL-MS analysis. Although some XL mapping software tools exist, these require manual input of specific Protein Data Bank structures at the single protein level and do not function at the high-throughput scale required to analyze datasets derived from thousands of proteins. To address this need, we therefore sought to develop a strategy enabling automated mapping of XL peptides onto the 3D structures of proteins, at a proteome-wide scale. Herein, we describe AlphaCross-XL, a first-in-class seamless computational tool for automated mapping of XL peptides onto the protein structures for intraprotein crosslinks and loop links. The AlphaCross-XL software first retrieves protein structures from the AlphaFold Protein Structure Database and maps all the identified crosslinks onto the 3D structure. It also calculates the Euclidian distance between the crosslinked residues and reports the violated and satisfied crosslink distances based on a user-defined distance threshold, which is visually discriminated by color in PyMOL. Last, the tool also supports further validation of user-submitted protein structures, which can include any computer-predicted protein structure and experimentally derived protein structures (i.e., from Protein Data Bank). AlphaCross-XL is available at https://github.com/sanjyotshenoy/alphacross-xl.
期刊介绍:
The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action.
The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data.
Scope:
-Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights
-Novel experimental and computational technologies
-Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes
-Pathway and network analyses of signaling that focus on the roles of post-translational modifications
-Studies of proteome dynamics and quality controls, and their roles in disease
-Studies of evolutionary processes effecting proteome dynamics, quality and regulation
-Chemical proteomics, including mechanisms of drug action
-Proteomics of the immune system and antigen presentation/recognition
-Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease
-Clinical and translational studies of human diseases
-Metabolomics to understand functional connections between genes, proteins and phenotypes