A deep learning model for predicting systemic lupus erythematosus-associated epitopes.

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Jiale He, Zixia Liu, Xiaopo Tang
{"title":"A deep learning model for predicting systemic lupus erythematosus-associated epitopes.","authors":"Jiale He, Zixia Liu, Xiaopo Tang","doi":"10.1186/s12911-025-03056-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The accurate prediction of epitopes associated with Systemic Lupus Erythematosus (SLE) plays a vital role in advancing our understanding of autoimmune pathogenesis and in designing effective immunotherapeutics. Traditional bioinformatics methods often struggle to capture the intricate sequence patterns and high-dimensional signals characteristic of epitope data. Deep learning presents a compelling alternative, with its ability to perform automatic feature learning and model complex dependencies inherent in biological sequences. This study proposes a hybrid deep learning architecture that synergistically integrates handcrafted biochemical features with data-driven deep sequence modeling to improve the identification of SLE-associated epitopes.</p><p><strong>Methods: </strong>The framework comprises six interconnected components: (1) handcrafted feature extraction encoding biochemical and physicochemical attributes; (2) an embedding layer for dense sequence representation; (3) a Convolutional Neural Network (CNN) branch that captures local patterns from handcrafted features; (4) a Long Short-Term Memory branch for learning temporal dependencies in sequence data; (5) a scaled dot-product attention-based fusion module that integrates complementary information from both branches; and (6) a Multi-Layer Perceptron for final classification. Model evaluation employed metrics such as Accuracy, Precision, Recall, F1-score, and the area under the receiver operating characteristic curve (ROCAUC).</p><p><strong>Results: </strong>The hybrid model outperformed both baseline machine learning algorithms and ablated versions of itself. It achieved a ROCAUC of 0.9506 and an F1-score of 0.8333 on the SLE epitope prediction task. Notably, ablation studies revealed that the CNN component had the most substantial influence on performance, while the custom fusion mechanism yielded better integration of features than conventional strategies. These findings underscore the model's robustness and capacity to generalize across complex epitope prediction tasks.</p><p><strong>Conclusion: </strong>This work presents an interpretable, biologically informed deep learning approach for predicting SLE-associated epitopes. By merging domain-specific handcrafted features with dynamic deep learning representations, the model not only enhances predictive accuracy but also provides meaningful biological insights. The framework holds promise for broader applications in immunoinformatics and autoimmune disease research.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"230"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220259/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03056-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The accurate prediction of epitopes associated with Systemic Lupus Erythematosus (SLE) plays a vital role in advancing our understanding of autoimmune pathogenesis and in designing effective immunotherapeutics. Traditional bioinformatics methods often struggle to capture the intricate sequence patterns and high-dimensional signals characteristic of epitope data. Deep learning presents a compelling alternative, with its ability to perform automatic feature learning and model complex dependencies inherent in biological sequences. This study proposes a hybrid deep learning architecture that synergistically integrates handcrafted biochemical features with data-driven deep sequence modeling to improve the identification of SLE-associated epitopes.

Methods: The framework comprises six interconnected components: (1) handcrafted feature extraction encoding biochemical and physicochemical attributes; (2) an embedding layer for dense sequence representation; (3) a Convolutional Neural Network (CNN) branch that captures local patterns from handcrafted features; (4) a Long Short-Term Memory branch for learning temporal dependencies in sequence data; (5) a scaled dot-product attention-based fusion module that integrates complementary information from both branches; and (6) a Multi-Layer Perceptron for final classification. Model evaluation employed metrics such as Accuracy, Precision, Recall, F1-score, and the area under the receiver operating characteristic curve (ROCAUC).

Results: The hybrid model outperformed both baseline machine learning algorithms and ablated versions of itself. It achieved a ROCAUC of 0.9506 and an F1-score of 0.8333 on the SLE epitope prediction task. Notably, ablation studies revealed that the CNN component had the most substantial influence on performance, while the custom fusion mechanism yielded better integration of features than conventional strategies. These findings underscore the model's robustness and capacity to generalize across complex epitope prediction tasks.

Conclusion: This work presents an interpretable, biologically informed deep learning approach for predicting SLE-associated epitopes. By merging domain-specific handcrafted features with dynamic deep learning representations, the model not only enhances predictive accuracy but also provides meaningful biological insights. The framework holds promise for broader applications in immunoinformatics and autoimmune disease research.

预测系统性红斑狼疮相关表位的深度学习模型。
背景:准确预测与系统性红斑狼疮(SLE)相关的表位对于提高我们对自身免疫发病机制的理解和设计有效的免疫治疗方法具有至关重要的作用。传统的生物信息学方法往往难以捕捉复杂的序列模式和表位数据的高维信号特征。深度学习提供了一个令人信服的替代方案,它能够执行自动特征学习和模拟生物序列中固有的复杂依赖关系。本研究提出了一种混合深度学习架构,将手工制作的生化特征与数据驱动的深度序列建模协同集成,以提高slel相关表位的识别。方法:该框架由6个相互关联的部分组成:(1)编码生化和物理化学属性的手工特征提取;(2)用于密集序列表示的嵌入层;(3)卷积神经网络(CNN)分支,从手工制作的特征中捕获局部模式;(4)长短期记忆分支用于序列数据的时间依赖性学习;(5)基于尺度点积注意力的融合模块,该模块集成了两个分支的互补信息;(6)用于最终分类的多层感知器。模型评估采用的指标包括准确率、精密度、召回率、f1评分和受试者工作特征曲线下面积(ROCAUC)。结果:混合模型优于基线机器学习算法和自身的消融版本。在SLE表位预测任务中,ROCAUC为0.9506,f1评分为0.8333。值得注意的是,消融研究表明,CNN组件对性能的影响最大,而自定义融合机制比传统策略能够更好地整合特征。这些发现强调了该模型的稳健性和泛化复杂表位预测任务的能力。结论:这项工作提出了一种可解释的、生物学信息丰富的深度学习方法来预测slel相关的表位。通过将特定领域的手工特征与动态深度学习表示相结合,该模型不仅提高了预测准确性,还提供了有意义的生物学见解。该框架有望在免疫信息学和自身免疫性疾病研究中得到更广泛的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信