Watshara Shoombuatong , Saeed Ahmed , SM Hasan Mahmud , Nalini Schaduangrat
{"title":"基于机器学习的肿瘤T细胞抗原识别方法综述与评价","authors":"Watshara Shoombuatong , Saeed Ahmed , SM Hasan Mahmud , Nalini Schaduangrat","doi":"10.1016/j.compbiolchem.2025.108440","DOIUrl":null,"url":null,"abstract":"<div><div>The precise identification of tumor T-cell antigens (TTCAs) is crucial for advancements in cancer immunotherapy and other clinical uses. In contrast to the labor-intensive and time-consuming process of experimentally identifying TTCAs, computational prediction offers a complementary approach by providing a shortlist of probable TTCA candidates for further experimental validation. Currently, several computational approaches, primarily based on machine learning (ML) methods, have garnered considerable attention for the <em>in silico</em> identification of tumor T-cell antigens (TTCAs). Therefore, this study presents a comprehensive survey on the existing state-of-the-art TTCA predictors. Based on our research, this is the first comprehensive review focused on both traditional ML and ensemble learning methods for TTCA identification. Specifically, we examine critical aspects of TTCA predictor development, including core algorithms, methodologies, benchmark datasets, feature encoding methods, feature selection approaches, and web server usability. We then analyze and compare the effectiveness and robustness of existing predictors across well-known benchmark datasets and case studies. Finally, we provide a detailed summary of the advantages and disadvantages of current TTCA predictors, along with essential insights and suggestions for developing novel computational approaches to accurately identify TTCAs. The insights gained from this review and benchmarking survey are expected to offer valuable guidance to researchers, aiding in the development of high-accuracy TTCA predictors for improved antigen identification in the future.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"118 ","pages":"Article 108440"},"PeriodicalIF":2.6000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comprehensive review and evaluation of machine learning-based approaches for identifying tumor T cell antigens\",\"authors\":\"Watshara Shoombuatong , Saeed Ahmed , SM Hasan Mahmud , Nalini Schaduangrat\",\"doi\":\"10.1016/j.compbiolchem.2025.108440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The precise identification of tumor T-cell antigens (TTCAs) is crucial for advancements in cancer immunotherapy and other clinical uses. In contrast to the labor-intensive and time-consuming process of experimentally identifying TTCAs, computational prediction offers a complementary approach by providing a shortlist of probable TTCA candidates for further experimental validation. Currently, several computational approaches, primarily based on machine learning (ML) methods, have garnered considerable attention for the <em>in silico</em> identification of tumor T-cell antigens (TTCAs). Therefore, this study presents a comprehensive survey on the existing state-of-the-art TTCA predictors. Based on our research, this is the first comprehensive review focused on both traditional ML and ensemble learning methods for TTCA identification. Specifically, we examine critical aspects of TTCA predictor development, including core algorithms, methodologies, benchmark datasets, feature encoding methods, feature selection approaches, and web server usability. We then analyze and compare the effectiveness and robustness of existing predictors across well-known benchmark datasets and case studies. Finally, we provide a detailed summary of the advantages and disadvantages of current TTCA predictors, along with essential insights and suggestions for developing novel computational approaches to accurately identify TTCAs. The insights gained from this review and benchmarking survey are expected to offer valuable guidance to researchers, aiding in the development of high-accuracy TTCA predictors for improved antigen identification in the future.</div></div>\",\"PeriodicalId\":10616,\"journal\":{\"name\":\"Computational Biology and Chemistry\",\"volume\":\"118 \",\"pages\":\"Article 108440\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Biology and Chemistry\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1476927125001008\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927125001008","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
A comprehensive review and evaluation of machine learning-based approaches for identifying tumor T cell antigens
The precise identification of tumor T-cell antigens (TTCAs) is crucial for advancements in cancer immunotherapy and other clinical uses. In contrast to the labor-intensive and time-consuming process of experimentally identifying TTCAs, computational prediction offers a complementary approach by providing a shortlist of probable TTCA candidates for further experimental validation. Currently, several computational approaches, primarily based on machine learning (ML) methods, have garnered considerable attention for the in silico identification of tumor T-cell antigens (TTCAs). Therefore, this study presents a comprehensive survey on the existing state-of-the-art TTCA predictors. Based on our research, this is the first comprehensive review focused on both traditional ML and ensemble learning methods for TTCA identification. Specifically, we examine critical aspects of TTCA predictor development, including core algorithms, methodologies, benchmark datasets, feature encoding methods, feature selection approaches, and web server usability. We then analyze and compare the effectiveness and robustness of existing predictors across well-known benchmark datasets and case studies. Finally, we provide a detailed summary of the advantages and disadvantages of current TTCA predictors, along with essential insights and suggestions for developing novel computational approaches to accurately identify TTCAs. The insights gained from this review and benchmarking survey are expected to offer valuable guidance to researchers, aiding in the development of high-accuracy TTCA predictors for improved antigen identification in the future.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.