Yang Lv , Ting Liu , Chang Liu , Yuchen Ma , Yunfei Liu , Ze Liu , Yin Li
{"title":"LYnet:利用卷积和循环神经网络计算识别肿瘤T细胞抗原","authors":"Yang Lv , Ting Liu , Chang Liu , Yuchen Ma , Yunfei Liu , Ze Liu , Yin Li","doi":"10.1016/j.compbiolchem.2025.108630","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Immunotherapy represents a paradigm shift in oncology, offering advantages in efficacy and specificity over traditional therapies. Key to its success is the identification of T-cell antigens, which are essential for triggering an effective antitumor immune response. Current methodologies for antigen prediction, however, lack the precision required for optimal vaccine development.</div></div><div><h3>Purpose</h3><div>This study aims to address this gap by introducing a novel deep learning model for the accurate prediction of tumor T-cell antigens. It seeks to improve the identification process, thereby facilitating the creation of more effective therapeutic cancer vaccines.</div></div><div><h3>Methods</h3><div>A hybrid architecture, designated LYnet, was constructed by integrating one-dimensional Convolutional Neural Networks with bidirectional Long Short-Term Memory layers, thereby capturing both local motif patterns and long-range sequence dependencies. Nineteen complementary sequence-derived descriptors—including AAindex, AAK-mer, CKSAAP/CKSAAGP, and physicochemical composition vectors—were concatenated to form the input feature space. Class imbalance in the training set was mitigated with the SMOTE-Tomek resampling strategy. Model robustness was quantified by stratified 10-fold cross-validation, and generalisation was verified on two independent benchmark collections (TAP 1.0 and iTTCA-RF).</div></div><div><h3>Results</h3><div>Across 10-fold validation on the LYnet benchmark, the proposed model achieved an AUC of 0.992, together with a sensitivity of 0.863, specificity of 0.925 and MCC of 0.776. Independent evaluation confirmed the advantage: LYnet yielded AUCs of 0.949 on the TAP 1.0 set and 0.836 on the iTTCA-RF set, surpassing the strongest competing method by 2.4–6.9 percentage points in AUC and up to 10.6 percentage points in MCC. These results demonstrate that LYnet attains state-of-the-art accuracy and balanced prediction for tumour T-cell antigen identification.</div></div><div><h3>Conclusions</h3><div>The introduction of this deep learning model represents a significant advancement in the prediction of tumor T-cell antigens. Its superior accuracy and robustness offer substantial potential to enhance the development and efficacy of cancer immunotherapies. This work not only underscores the importance of precise antigen identification in immunotherapy but also provides a powerful computational tool to aid in the design of next-generation cancer vaccines.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"120 ","pages":"Article 108630"},"PeriodicalIF":3.1000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LYnet: Computational identification of tumor T cell antigens using convolutional and recurrent neural networks\",\"authors\":\"Yang Lv , Ting Liu , Chang Liu , Yuchen Ma , Yunfei Liu , Ze Liu , Yin Li\",\"doi\":\"10.1016/j.compbiolchem.2025.108630\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Immunotherapy represents a paradigm shift in oncology, offering advantages in efficacy and specificity over traditional therapies. Key to its success is the identification of T-cell antigens, which are essential for triggering an effective antitumor immune response. Current methodologies for antigen prediction, however, lack the precision required for optimal vaccine development.</div></div><div><h3>Purpose</h3><div>This study aims to address this gap by introducing a novel deep learning model for the accurate prediction of tumor T-cell antigens. It seeks to improve the identification process, thereby facilitating the creation of more effective therapeutic cancer vaccines.</div></div><div><h3>Methods</h3><div>A hybrid architecture, designated LYnet, was constructed by integrating one-dimensional Convolutional Neural Networks with bidirectional Long Short-Term Memory layers, thereby capturing both local motif patterns and long-range sequence dependencies. Nineteen complementary sequence-derived descriptors—including AAindex, AAK-mer, CKSAAP/CKSAAGP, and physicochemical composition vectors—were concatenated to form the input feature space. Class imbalance in the training set was mitigated with the SMOTE-Tomek resampling strategy. Model robustness was quantified by stratified 10-fold cross-validation, and generalisation was verified on two independent benchmark collections (TAP 1.0 and iTTCA-RF).</div></div><div><h3>Results</h3><div>Across 10-fold validation on the LYnet benchmark, the proposed model achieved an AUC of 0.992, together with a sensitivity of 0.863, specificity of 0.925 and MCC of 0.776. Independent evaluation confirmed the advantage: LYnet yielded AUCs of 0.949 on the TAP 1.0 set and 0.836 on the iTTCA-RF set, surpassing the strongest competing method by 2.4–6.9 percentage points in AUC and up to 10.6 percentage points in MCC. These results demonstrate that LYnet attains state-of-the-art accuracy and balanced prediction for tumour T-cell antigen identification.</div></div><div><h3>Conclusions</h3><div>The introduction of this deep learning model represents a significant advancement in the prediction of tumor T-cell antigens. Its superior accuracy and robustness offer substantial potential to enhance the development and efficacy of cancer immunotherapies. This work not only underscores the importance of precise antigen identification in immunotherapy but also provides a powerful computational tool to aid in the design of next-generation cancer vaccines.</div></div>\",\"PeriodicalId\":10616,\"journal\":{\"name\":\"Computational Biology and Chemistry\",\"volume\":\"120 \",\"pages\":\"Article 108630\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Biology and Chemistry\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1476927125002919\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927125002919","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
LYnet: Computational identification of tumor T cell antigens using convolutional and recurrent neural networks
Background
Immunotherapy represents a paradigm shift in oncology, offering advantages in efficacy and specificity over traditional therapies. Key to its success is the identification of T-cell antigens, which are essential for triggering an effective antitumor immune response. Current methodologies for antigen prediction, however, lack the precision required for optimal vaccine development.
Purpose
This study aims to address this gap by introducing a novel deep learning model for the accurate prediction of tumor T-cell antigens. It seeks to improve the identification process, thereby facilitating the creation of more effective therapeutic cancer vaccines.
Methods
A hybrid architecture, designated LYnet, was constructed by integrating one-dimensional Convolutional Neural Networks with bidirectional Long Short-Term Memory layers, thereby capturing both local motif patterns and long-range sequence dependencies. Nineteen complementary sequence-derived descriptors—including AAindex, AAK-mer, CKSAAP/CKSAAGP, and physicochemical composition vectors—were concatenated to form the input feature space. Class imbalance in the training set was mitigated with the SMOTE-Tomek resampling strategy. Model robustness was quantified by stratified 10-fold cross-validation, and generalisation was verified on two independent benchmark collections (TAP 1.0 and iTTCA-RF).
Results
Across 10-fold validation on the LYnet benchmark, the proposed model achieved an AUC of 0.992, together with a sensitivity of 0.863, specificity of 0.925 and MCC of 0.776. Independent evaluation confirmed the advantage: LYnet yielded AUCs of 0.949 on the TAP 1.0 set and 0.836 on the iTTCA-RF set, surpassing the strongest competing method by 2.4–6.9 percentage points in AUC and up to 10.6 percentage points in MCC. These results demonstrate that LYnet attains state-of-the-art accuracy and balanced prediction for tumour T-cell antigen identification.
Conclusions
The introduction of this deep learning model represents a significant advancement in the prediction of tumor T-cell antigens. Its superior accuracy and robustness offer substantial potential to enhance the development and efficacy of cancer immunotherapies. This work not only underscores the importance of precise antigen identification in immunotherapy but also provides a powerful computational tool to aid in the design of next-generation cancer vaccines.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.