{"title":"Combining full-length gene assay and SpliceAI to interpret the splicing impact of all possible<i>SPINK1</i>coding variants","authors":"Hao Wu, Jin-Huan Lin, Xin-Ying Tang, Wen-Bin Zou, Sacha Schutz, Emmanuelle Masson, Yann Fichou, Gerald Le Gac, Claude Ferec, Zhuan Liao, Jian-Min Chen","doi":"10.1101/2023.11.14.23298498","DOIUrl":null,"url":null,"abstract":"Background: Single-nucleotide variants (SNVs) within gene coding sequences can significantly impact pre-mRNA splicing, bearing profound implications for pathogenic mechanisms and precision medicine. However, reliable splicing analysis often faces practical limitations, especially when the relevant tissues are challenging to access. While in silico predictions are valuable, they alone do not meet clinical classification standards. In this study, we aim to harness the well-established full-length gene splicing assay (FLGSA) in conjunction with SpliceAI to prospectively interpret the splicing effects of all potential coding SNVs within the four-exon SPINK1 gene, a gene associated with chronic pancreatitis. Results: We initiated the study with a retrospective correlation analysis (involving 27 previously FLGSA-analyzed SPINK1 coding SNVs), progressed to a prospective correlation analysis (incorporating 35 newly FLGSA-tested SPINK1 coding SNVs), followed by data extrapolation, and ended with further validation. In total, we analyzed 67 SPINK1 coding SNVs, representing 9.3% of all 720 possible coding SNVs and affecting 19.2% of the 240 coding nucleotides. Among these 67 FLGSA-analyzed SNVs, 12 were found to impact splicing. Through extensive cross-correlation of the FLGSA-obtained and SpliceAI-predicted data, we reasonably extrapolated that none of the unanalyzed 653 coding SNVs in the SPINK1 gene are likely to exert a significant effect on splicing. Out of these 12 splice-altering events, nine produced both wild-type and aberrant transcripts, while the remaining three exclusively generated aberrant transcripts. These splice-altering SNVs were predominantly concentrated in exons 1 and 2, particularly affecting the first and/or last coding nucleotide of each exon. Among the 12 splice-altering events, 11 were missense variants, constituting 2.17% of the 506 potential missense variants, while one was synonymous, accounting for 0.61% of the 164 potential synonymous variants. Conclusions: Integrating FLGSA with SpliceAI, we conclude that less than 2% (1.67%) of all possible SPINK1 coding SNVs have a discernible influence on splicing outcomes. Our findings underscore the importance of performing splicing analysis in the broader genomic sequence context of the study gene, highlight the inherent uncertainties associated with intermediate SpliceAI scores (i.e., those ranging from 0.20 to 0.80), and have general implications for the shift from \"retrospective\" to \"prospective\" analysis in terms of variant classification.","PeriodicalId":478577,"journal":{"name":"medRxiv (Cold Spring Harbor Laboratory)","volume":"8 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv (Cold Spring Harbor Laboratory)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.11.14.23298498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Single-nucleotide variants (SNVs) within gene coding sequences can significantly impact pre-mRNA splicing, bearing profound implications for pathogenic mechanisms and precision medicine. However, reliable splicing analysis often faces practical limitations, especially when the relevant tissues are challenging to access. While in silico predictions are valuable, they alone do not meet clinical classification standards. In this study, we aim to harness the well-established full-length gene splicing assay (FLGSA) in conjunction with SpliceAI to prospectively interpret the splicing effects of all potential coding SNVs within the four-exon SPINK1 gene, a gene associated with chronic pancreatitis. Results: We initiated the study with a retrospective correlation analysis (involving 27 previously FLGSA-analyzed SPINK1 coding SNVs), progressed to a prospective correlation analysis (incorporating 35 newly FLGSA-tested SPINK1 coding SNVs), followed by data extrapolation, and ended with further validation. In total, we analyzed 67 SPINK1 coding SNVs, representing 9.3% of all 720 possible coding SNVs and affecting 19.2% of the 240 coding nucleotides. Among these 67 FLGSA-analyzed SNVs, 12 were found to impact splicing. Through extensive cross-correlation of the FLGSA-obtained and SpliceAI-predicted data, we reasonably extrapolated that none of the unanalyzed 653 coding SNVs in the SPINK1 gene are likely to exert a significant effect on splicing. Out of these 12 splice-altering events, nine produced both wild-type and aberrant transcripts, while the remaining three exclusively generated aberrant transcripts. These splice-altering SNVs were predominantly concentrated in exons 1 and 2, particularly affecting the first and/or last coding nucleotide of each exon. Among the 12 splice-altering events, 11 were missense variants, constituting 2.17% of the 506 potential missense variants, while one was synonymous, accounting for 0.61% of the 164 potential synonymous variants. Conclusions: Integrating FLGSA with SpliceAI, we conclude that less than 2% (1.67%) of all possible SPINK1 coding SNVs have a discernible influence on splicing outcomes. Our findings underscore the importance of performing splicing analysis in the broader genomic sequence context of the study gene, highlight the inherent uncertainties associated with intermediate SpliceAI scores (i.e., those ranging from 0.20 to 0.80), and have general implications for the shift from "retrospective" to "prospective" analysis in terms of variant classification.