{"title":"An automatic annotation tool and reference database for T cell subtypes and states at single-cell resolution.","authors":"Wen-Kang Shen, Chu-Yu Zhang, Yi-Min Gu, Tao Luo, Si-Yi Chen, Tao Yue, Gui-Yan Xie, Yu Liao, Yong Yuan, Qian Lei, An-Yuan Guo","doi":"10.1016/j.scib.2025.02.043","DOIUrl":null,"url":null,"abstract":"<p><p>T cells have various subtypes and states with different functions. However, a reference list and automated annotation tool for T cell subtypes and states are lacking, which is critical for analyzing and comparing T cells under various conditions. We constructed the largest human T cell reference, containing 1,348,268 T cells from 35 conditions and 16 tissues. We classified T cells into 33 subtypes and further stratified them into 68 categories according to subtype and state. Based on this reference, we developed a tool named STCAT to automatically annotate T cells from scRNA-seq data by hierarchical models and marker correction. The accuracy of STCAT was 28% higher than that of existing tools validated on six independent datasets, including cancer and healthy samples. Using STCAT, we consistently discovered that CD4<sup>+</sup> Th17 cells were enriched in late-stage lung cancer patients in multiple datasets, whereas MAIT cells were prevalent in milder-stage COVID-19 patients. We also confirmed a decrease in Treg cytotoxicity in post-treatment ovarian cancer. Systematic landscape analyses of CD4<sup>+</sup> and CD8<sup>+</sup> T cell references revealed that CD4<sup>+</sup> Treg cells were enriched in tumor samples and that CD8<sup>+</sup> naive-related cells were abundant in healthy individuals. Finally, we deposited all the T cell references and annotations into a TCellAtlas (https://guolab.wchscu.cn/TCellAtlas) database, which allows users to browse T cell expression profiles and analyze customized scRNA-seq data by STCAT. In conclusion, comprehensive human T cell subtypes and states reference, automated annotation tool, and database will greatly facilitate research on T cell immunity and tumor immunology.</p>","PeriodicalId":421,"journal":{"name":"Science Bulletin","volume":" ","pages":""},"PeriodicalIF":18.8000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science Bulletin","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1016/j.scib.2025.02.043","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
T cells have various subtypes and states with different functions. However, a reference list and automated annotation tool for T cell subtypes and states are lacking, which is critical for analyzing and comparing T cells under various conditions. We constructed the largest human T cell reference, containing 1,348,268 T cells from 35 conditions and 16 tissues. We classified T cells into 33 subtypes and further stratified them into 68 categories according to subtype and state. Based on this reference, we developed a tool named STCAT to automatically annotate T cells from scRNA-seq data by hierarchical models and marker correction. The accuracy of STCAT was 28% higher than that of existing tools validated on six independent datasets, including cancer and healthy samples. Using STCAT, we consistently discovered that CD4+ Th17 cells were enriched in late-stage lung cancer patients in multiple datasets, whereas MAIT cells were prevalent in milder-stage COVID-19 patients. We also confirmed a decrease in Treg cytotoxicity in post-treatment ovarian cancer. Systematic landscape analyses of CD4+ and CD8+ T cell references revealed that CD4+ Treg cells were enriched in tumor samples and that CD8+ naive-related cells were abundant in healthy individuals. Finally, we deposited all the T cell references and annotations into a TCellAtlas (https://guolab.wchscu.cn/TCellAtlas) database, which allows users to browse T cell expression profiles and analyze customized scRNA-seq data by STCAT. In conclusion, comprehensive human T cell subtypes and states reference, automated annotation tool, and database will greatly facilitate research on T cell immunity and tumor immunology.
期刊介绍:
Science Bulletin (Sci. Bull., formerly known as Chinese Science Bulletin) is a multidisciplinary academic journal supervised by the Chinese Academy of Sciences (CAS) and co-sponsored by the CAS and the National Natural Science Foundation of China (NSFC). Sci. Bull. is a semi-monthly international journal publishing high-caliber peer-reviewed research on a broad range of natural sciences and high-tech fields on the basis of its originality, scientific significance and whether it is of general interest. In addition, we are committed to serving the scientific community with immediate, authoritative news and valuable insights into upcoming trends around the globe.