Yuxuan Song, Chengxin Zhang, Gilbert S. Omenn, Matthew J. O’Meara, Joshua D. Welch
{"title":"Predicting the structural impact of human alternative splicing","authors":"Yuxuan Song, Chengxin Zhang, Gilbert S. Omenn, Matthew J. O’Meara, Joshua D. Welch","doi":"10.1186/s13059-025-03744-x","DOIUrl":null,"url":null,"abstract":"Protein structure prediction with neural networks is a powerful new method for linking protein sequence, structure, and function, but structures have generally been predicted for only a single isoform of each gene, neglecting splice variants. To investigate the structural implications of alternative splicing, we use AlphaFold2 to predict the structures of more than 11,000 human isoforms. We employ multiple metrics to identify splicing-induced structural alterations, including template matching score, secondary structure composition, surface charge distribution, radius of gyration, accessibility of post-translational modification sites, and structure-based function prediction. We identify examples of how alternative splicing induces clear changes in each of these properties. Structural similarity between isoforms largely correlates with degree of sequence identity, but we identify a subset of isoforms with low structural similarity despite high sequence similarity. Exon skipping and alternative last exons tend to increase the surface charge and radius of gyration. Splicing also buries or exposes numerous post-translational modification sites, most notably among the isoforms of BAX. Functional prediction identifies numerous functional differences between isoforms of the same gene, with loss of function compared to the reference predominating. Finally, we use single-cell RNA-seq data from the Tabula Sapiens to determine the cell types in which each structure is expressed. Our work represents an important resource for studying the structure and function of splice isoforms across the cell types of the human body.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":"17 1","pages":""},"PeriodicalIF":10.1000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-025-03744-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Protein structure prediction with neural networks is a powerful new method for linking protein sequence, structure, and function, but structures have generally been predicted for only a single isoform of each gene, neglecting splice variants. To investigate the structural implications of alternative splicing, we use AlphaFold2 to predict the structures of more than 11,000 human isoforms. We employ multiple metrics to identify splicing-induced structural alterations, including template matching score, secondary structure composition, surface charge distribution, radius of gyration, accessibility of post-translational modification sites, and structure-based function prediction. We identify examples of how alternative splicing induces clear changes in each of these properties. Structural similarity between isoforms largely correlates with degree of sequence identity, but we identify a subset of isoforms with low structural similarity despite high sequence similarity. Exon skipping and alternative last exons tend to increase the surface charge and radius of gyration. Splicing also buries or exposes numerous post-translational modification sites, most notably among the isoforms of BAX. Functional prediction identifies numerous functional differences between isoforms of the same gene, with loss of function compared to the reference predominating. Finally, we use single-cell RNA-seq data from the Tabula Sapiens to determine the cell types in which each structure is expressed. Our work represents an important resource for studying the structure and function of splice isoforms across the cell types of the human body.
Genome BiologyBiochemistry, Genetics and Molecular Biology-Genetics
CiteScore
21.00
自引率
3.30%
发文量
241
审稿时长
2 months
期刊介绍:
Genome Biology stands as a premier platform for exceptional research across all domains of biology and biomedicine, explored through a genomic and post-genomic lens.
With an impressive impact factor of 12.3 (2022),* the journal secures its position as the 3rd-ranked research journal in the Genetics and Heredity category and the 2nd-ranked research journal in the Biotechnology and Applied Microbiology category by Thomson Reuters. Notably, Genome Biology holds the distinction of being the highest-ranked open-access journal in this category.
Our dedicated team of highly trained in-house Editors collaborates closely with our esteemed Editorial Board of international experts, ensuring the journal remains on the forefront of scientific advances and community standards. Regular engagement with researchers at conferences and institute visits underscores our commitment to staying abreast of the latest developments in the field.