{"title":"RNA-seq based T cell repertoire extraction compared with TCR-seq.","authors":"Linoy Menda Dabran, Alona Zilberberg, Sol Efroni","doi":"10.1093/oxfimm/iqaf001","DOIUrl":null,"url":null,"abstract":"<p><p>The purpose of this study is to evaluate the feasibility of using RNA sequencing data as substrate for the computational extraction of T cell receptor sequences. Data from hundreds of thousands of samples is available as RNA sequencing. However, the use of these data for repertoires has not been contrasted against a gold standard. We conducted a benchmarking analysis, comparing T cell receptor data extracted from RNA sequencing to those obtained from T cell receptor sequencing (as gold standard) of the same tissue samples. The focus was on the extraction of Complementarity-Determining Region 3 (CDR3) sequences. To evaluate the influence of sequencing read lengths, samples were analyzed using both 75 base pair single-end and 150 base pair paired-end sequencing methods. In addition we calculated T cell abundance in these samples to test for any correlation between reads and abundance. The findings reveal a significant, perhaps too great, discrepancy between the ability to extract Complementarity-Determining Region 3 sequences from RNA sequencing data and the results obtained from TCR sequencing. The lack of significant improvement with longer read lengths, combined with the absence of correlation to T cell abundance, emphasize the necessity of using T cell receptor sequencing methodologies.</p>","PeriodicalId":74384,"journal":{"name":"Oxford open immunology","volume":"6 1","pages":"iqaf001"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11972113/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oxford open immunology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/oxfimm/iqaf001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The purpose of this study is to evaluate the feasibility of using RNA sequencing data as substrate for the computational extraction of T cell receptor sequences. Data from hundreds of thousands of samples is available as RNA sequencing. However, the use of these data for repertoires has not been contrasted against a gold standard. We conducted a benchmarking analysis, comparing T cell receptor data extracted from RNA sequencing to those obtained from T cell receptor sequencing (as gold standard) of the same tissue samples. The focus was on the extraction of Complementarity-Determining Region 3 (CDR3) sequences. To evaluate the influence of sequencing read lengths, samples were analyzed using both 75 base pair single-end and 150 base pair paired-end sequencing methods. In addition we calculated T cell abundance in these samples to test for any correlation between reads and abundance. The findings reveal a significant, perhaps too great, discrepancy between the ability to extract Complementarity-Determining Region 3 sequences from RNA sequencing data and the results obtained from TCR sequencing. The lack of significant improvement with longer read lengths, combined with the absence of correlation to T cell abundance, emphasize the necessity of using T cell receptor sequencing methodologies.