Cassandra R Woolley, Julia H Chariker, Eric C Rouchka, Easton E Ford, Elizabeth Hudson, Kamille M Rasche, Caleb S Whitley, Zachary Vanwinkle, Carolyn R Casella, Melissa L Smith, Thomas C Mitchell
{"title":"Full-length mRNA sequencing resolves novel variation in 5' UTR length for genes expressed during human CD4 T-cell activation.","authors":"Cassandra R Woolley, Julia H Chariker, Eric C Rouchka, Easton E Ford, Elizabeth Hudson, Kamille M Rasche, Caleb S Whitley, Zachary Vanwinkle, Carolyn R Casella, Melissa L Smith, Thomas C Mitchell","doi":"10.1007/s00251-025-01371-1","DOIUrl":null,"url":null,"abstract":"<p><p>Isoform sequencing (Iso-Seq) uses long-read technology to produce highly accurate full-length reads of mRNA transcripts. Visualization of individual mRNA molecules can reveal new details of transcript variation within understudied portions of mRNA, such as the 5' untranslated region (UTR). Differential 5' UTRs may contain motifs, upstream open reading frames (uORFs), and secondary structures that can serve to regulate translation or further indicate changes in promoter usage, where transcriptional control may impact protein expression levels. To begin to explore isoform variation during T-cell activation, we generated the first Iso-Seq reference transcriptome of activated human CD4 T cells. Within this dataset, we discovered many novel splice- and end-variant transcripts. Remarkably, one in every eight genes expressed in our dataset was found to have a notable proportion of transcripts with 5' UTR lengthened by over 100 bp compared to the longest corresponding UTR within the Gencode dataset. Among these end-variant transcripts, two novel isoforms were identified for CXCR5, a chemokine receptor associated with T follicular helper cell (Tfh) function and differentiation. When investigated in a model cell system, these lengthened UTR conferred reduced transcript stability and, for one of these isoforms, short uORFs introduced by the added length altered protein expression kinetics. This study highlights instances in which current reference databases are incomplete relative to the information obtained by long-read sequencing of intact mRNA. Iso-Seq is thus a promising approach to better understanding the plasticity of promoter usage, alternative splicing, and UTR sequences that influence RNA stability and translation efficiency.</p>","PeriodicalId":13446,"journal":{"name":"Immunogenetics","volume":"77 1","pages":"14"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11794378/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunogenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00251-025-01371-1","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Isoform sequencing (Iso-Seq) uses long-read technology to produce highly accurate full-length reads of mRNA transcripts. Visualization of individual mRNA molecules can reveal new details of transcript variation within understudied portions of mRNA, such as the 5' untranslated region (UTR). Differential 5' UTRs may contain motifs, upstream open reading frames (uORFs), and secondary structures that can serve to regulate translation or further indicate changes in promoter usage, where transcriptional control may impact protein expression levels. To begin to explore isoform variation during T-cell activation, we generated the first Iso-Seq reference transcriptome of activated human CD4 T cells. Within this dataset, we discovered many novel splice- and end-variant transcripts. Remarkably, one in every eight genes expressed in our dataset was found to have a notable proportion of transcripts with 5' UTR lengthened by over 100 bp compared to the longest corresponding UTR within the Gencode dataset. Among these end-variant transcripts, two novel isoforms were identified for CXCR5, a chemokine receptor associated with T follicular helper cell (Tfh) function and differentiation. When investigated in a model cell system, these lengthened UTR conferred reduced transcript stability and, for one of these isoforms, short uORFs introduced by the added length altered protein expression kinetics. This study highlights instances in which current reference databases are incomplete relative to the information obtained by long-read sequencing of intact mRNA. Iso-Seq is thus a promising approach to better understanding the plasticity of promoter usage, alternative splicing, and UTR sequences that influence RNA stability and translation efficiency.
期刊介绍:
Immunogenetics publishes original papers, brief communications, and reviews on research in the following areas: genetics and evolution of the immune system; genetic control of immune response and disease susceptibility; bioinformatics of the immune system; structure of immunologically important molecules; and immunogenetics of reproductive biology, tissue differentiation, and development.