{"title":"基于大型语言模型标识符的单元格类型标注可靠性评估。","authors":"Wenjin Ye, Yuanchen Ma, Junkai Xiang, Hongjie Liang, Jintian Luo, Yuantao Li, Tao Wang, Qiuling Xiang, Wu Song, Weiqiang Li, Weijun Huang","doi":"10.1038/s42003-025-08745-x","DOIUrl":null,"url":null,"abstract":"<p><p>Ensuring accurate cell type annotation in single-cell RNA sequencing data is a significant challenge, as both expert and automated methods can be biased or constrained by their training data, leading to errors and time-consuming revisions. To address this, we developed LICT (Large Language Model-based Identifier for Cell Types), a tool that leverages multi-model integration and a \"talk-to-machine\" approach. Validated across diverse datasets, LICT consistently aligns with expert annotations. With its objective framework for assessing annotation reliability, LICT can interpret cases where a single cell population exhibits multifaceted traits, allowing researchers to focus on the underlying biological insights. Comparisons with existing tools highlight LICT's superiority in efficiency, consistency, accuracy, and reliability, establishing it as a powerful tool for single-cell RNA sequencing analysis. Furthermore, its independence from reference data emphasizes LICT's generalizability, enhancing reproducibility and ensuring more reliable results in cellular research.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"8 1","pages":"1360"},"PeriodicalIF":5.1000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462508/pdf/","citationCount":"0","resultStr":"{\"title\":\"Evaluation of cell type annotation reliability using a large language model-based identifier.\",\"authors\":\"Wenjin Ye, Yuanchen Ma, Junkai Xiang, Hongjie Liang, Jintian Luo, Yuantao Li, Tao Wang, Qiuling Xiang, Wu Song, Weiqiang Li, Weijun Huang\",\"doi\":\"10.1038/s42003-025-08745-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Ensuring accurate cell type annotation in single-cell RNA sequencing data is a significant challenge, as both expert and automated methods can be biased or constrained by their training data, leading to errors and time-consuming revisions. To address this, we developed LICT (Large Language Model-based Identifier for Cell Types), a tool that leverages multi-model integration and a \\\"talk-to-machine\\\" approach. Validated across diverse datasets, LICT consistently aligns with expert annotations. With its objective framework for assessing annotation reliability, LICT can interpret cases where a single cell population exhibits multifaceted traits, allowing researchers to focus on the underlying biological insights. Comparisons with existing tools highlight LICT's superiority in efficiency, consistency, accuracy, and reliability, establishing it as a powerful tool for single-cell RNA sequencing analysis. Furthermore, its independence from reference data emphasizes LICT's generalizability, enhancing reproducibility and ensuring more reliable results in cellular research.</p>\",\"PeriodicalId\":10552,\"journal\":{\"name\":\"Communications Biology\",\"volume\":\"8 1\",\"pages\":\"1360\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462508/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s42003-025-08745-x\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-025-08745-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
Evaluation of cell type annotation reliability using a large language model-based identifier.
Ensuring accurate cell type annotation in single-cell RNA sequencing data is a significant challenge, as both expert and automated methods can be biased or constrained by their training data, leading to errors and time-consuming revisions. To address this, we developed LICT (Large Language Model-based Identifier for Cell Types), a tool that leverages multi-model integration and a "talk-to-machine" approach. Validated across diverse datasets, LICT consistently aligns with expert annotations. With its objective framework for assessing annotation reliability, LICT can interpret cases where a single cell population exhibits multifaceted traits, allowing researchers to focus on the underlying biological insights. Comparisons with existing tools highlight LICT's superiority in efficiency, consistency, accuracy, and reliability, establishing it as a powerful tool for single-cell RNA sequencing analysis. Furthermore, its independence from reference data emphasizes LICT's generalizability, enhancing reproducibility and ensuring more reliable results in cellular research.
期刊介绍:
Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.