Leveraging attention-based deep multiple instance and multiple task learning for improved neoepitope identification.

IF 7.7

Cell systems Pub Date : 2025-10-06 DOI:10.1016/j.cels.2025.101404

Wei Qu, Shanfeng Zhu

引用次数: 0

Abstract

Accurate prediction of major histocompatibility complex class I (MHC class I) neoepitopes is crucial for personalized cancer immunotherapy. Current methods struggle with predicting ligand presentation for multiple alleles and identifying neoepitopes. We introduce NeoMHCI, a deep learning model that combines attention-based multiple instance learning (MIL) and multi-task learning for precise MHC class I neoepitope identification. NeoMHCI uses MIL to generate high-quality peptide embeddings with multiple MHC class I molecules and enhances immunogenicity prioritization through fine-tuning. Analyses on benchmark datasets show NeoMHCI outperforms existing methods, achieving an area under the receiver operating characteristic curve of 0.948 and an area under the precision-recall curve of 0.496 on unobserved multi-allele ligand presentation prediction and the highest top-5 accuracy (42.3%) for neoepitope recognition, indicating potential for personalized vaccines and therapies. A record of this paper's transparent peer review process is included in the supplemental information.

查看原文本刊更多论文

利用基于注意力的深度多实例和多任务学习改进新表位识别。

准确预测主要组织相容性复合体I类（MHC I类）新表位对于个性化癌症免疫治疗至关重要。目前的方法难以预测多个等位基因的配体呈现和识别新表位。我们介绍了NeoMHCI，这是一种深度学习模型，结合了基于注意力的多实例学习（MIL）和多任务学习，用于MHCI类新表位的精确识别。NeoMHCI使用MIL生成含有多个MHCI类分子的高质量肽包埋，并通过微调提高免疫原性优先级。对基准数据集的分析表明，NeoMHCI优于现有方法，对未观察到的多等位基因配体呈现预测的受试者工作特征曲线下面积为0.948，精度-召回曲线下面积为0.496，新表位识别的前5名准确率最高（42.3%），表明了个性化疫苗和治疗的潜力。本文的透明同行评议过程记录包含在补充信息中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cell systems

自引率

0.00%

发文量