基于机器学习的血液转录组特征鉴别系统自身免疫和感染。

IF 11.8 Q1 MEDICINE, RESEARCH & EXPERIMENTAL
Med Pub Date : 2025-08-29 DOI:10.1016/j.medj.2025.100840
Kleio-Maria Verrou, Nikolaos I Vlachogiannis, Argyrios N Theofilopoulos, Maria Tektonidou, Georgios Kollias, Christoforos Nikolaou, Petros P Sfikakis
{"title":"基于机器学习的血液转录组特征鉴别系统自身免疫和感染。","authors":"Kleio-Maria Verrou, Nikolaos I Vlachogiannis, Argyrios N Theofilopoulos, Maria Tektonidou, Georgios Kollias, Christoforos Nikolaou, Petros P Sfikakis","doi":"10.1016/j.medj.2025.100840","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pathogenic responses against self and foreign antigens in systemic autoimmunity and infection, respectively, engage similar immunologic components, thus lacking distinguishing diagnostic biomarkers. Herein, we tested whether whole-blood transcriptome analysis discriminates autoimmune from infectious diseases.</p><p><strong>Methods: </strong>We applied nested cross-validation methodology to tune and validate random forests, k-nearest neighbors, and support vector machines, using a new preprocessing method on 22 publicly available datasets, including 594 patients with a broad spectrum of systemic autoimmune diseases and 615 patients with diverse viral, bacterial, and parasitic infections.</p><p><strong>Findings: </strong>Our preprocessing method tackled RNA sequencing batch effects by sorting the genes within each sample according to individual relative expression values and discriminated between the corresponding pathologies with 98% accuracy versus 63% when using raw values. This model was further tested in external datasets comprising various autoimmune diseases and infections new to its training process, yielding accuracies ranging between 80% and 96%. Enrichment analyses of 457 of the most informative genes identified SAP1, ELF1/4, and FLI1 transcription factors among the significant upstream regulators and revealed several key processes and pathways, such as autophagy, DNA damage response, and NOTCH signaling. A subset of 24 genes, including the inflammation-related genes RPL7, TLK2, and ANK2, distinguished between autoimmune and infectious diseases with 89% accuracy.</p><p><strong>Conclusions: </strong>Using a novel batch-correction algorithm, this analysis may provide a new mechanistic understanding of the pathogenic autoimmune response, as well as biomarkers for differential diagnoses of the corresponding pathologies in patients presenting with inflammatory disorders.</p><p><strong>Funding: </strong>This work was funded by the European Regional Development Fund NSRF 2014-2020, no. MIS5002802.</p>","PeriodicalId":29964,"journal":{"name":"Med","volume":" ","pages":"100840"},"PeriodicalIF":11.8000,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based identification of a transcriptomic blood signature discriminating between systemic autoimmunity and infection.\",\"authors\":\"Kleio-Maria Verrou, Nikolaos I Vlachogiannis, Argyrios N Theofilopoulos, Maria Tektonidou, Georgios Kollias, Christoforos Nikolaou, Petros P Sfikakis\",\"doi\":\"10.1016/j.medj.2025.100840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Pathogenic responses against self and foreign antigens in systemic autoimmunity and infection, respectively, engage similar immunologic components, thus lacking distinguishing diagnostic biomarkers. Herein, we tested whether whole-blood transcriptome analysis discriminates autoimmune from infectious diseases.</p><p><strong>Methods: </strong>We applied nested cross-validation methodology to tune and validate random forests, k-nearest neighbors, and support vector machines, using a new preprocessing method on 22 publicly available datasets, including 594 patients with a broad spectrum of systemic autoimmune diseases and 615 patients with diverse viral, bacterial, and parasitic infections.</p><p><strong>Findings: </strong>Our preprocessing method tackled RNA sequencing batch effects by sorting the genes within each sample according to individual relative expression values and discriminated between the corresponding pathologies with 98% accuracy versus 63% when using raw values. This model was further tested in external datasets comprising various autoimmune diseases and infections new to its training process, yielding accuracies ranging between 80% and 96%. Enrichment analyses of 457 of the most informative genes identified SAP1, ELF1/4, and FLI1 transcription factors among the significant upstream regulators and revealed several key processes and pathways, such as autophagy, DNA damage response, and NOTCH signaling. A subset of 24 genes, including the inflammation-related genes RPL7, TLK2, and ANK2, distinguished between autoimmune and infectious diseases with 89% accuracy.</p><p><strong>Conclusions: </strong>Using a novel batch-correction algorithm, this analysis may provide a new mechanistic understanding of the pathogenic autoimmune response, as well as biomarkers for differential diagnoses of the corresponding pathologies in patients presenting with inflammatory disorders.</p><p><strong>Funding: </strong>This work was funded by the European Regional Development Fund NSRF 2014-2020, no. MIS5002802.</p>\",\"PeriodicalId\":29964,\"journal\":{\"name\":\"Med\",\"volume\":\" \",\"pages\":\"100840\"},\"PeriodicalIF\":11.8000,\"publicationDate\":\"2025-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Med\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.medj.2025.100840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Med","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.medj.2025.100840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

背景:在系统性自身免疫和感染中,针对自身抗原和外源抗原的致病性反应分别涉及相似的免疫成分,因此缺乏区分性的诊断性生物标志物。在这里,我们测试了全血转录组分析是否能区分自身免疫性疾病和感染性疾病。方法:我们采用嵌套交叉验证方法,对22个公开可用的数据集进行调整和验证随机森林、k近邻和支持向量机,使用一种新的预处理方法,包括594例广谱系统性自身免疫性疾病患者和615例各种病毒、细菌和寄生虫感染患者。结果:我们的预处理方法通过根据个体相对表达值对每个样本中的基因进行分类来解决RNA测序批次效应,并以98%的准确率区分相应的病理,而使用原始值时的准确率为63%。该模型在外部数据集中进行了进一步测试,这些数据集包括各种自身免疫性疾病和感染,这是其训练过程中新的数据集,准确度在80%到96%之间。对457个最具信息性的基因进行富集分析,鉴定出SAP1、ELF1/4和FLI1转录因子是重要的上游调控因子,并揭示了自噬、DNA损伤反应和NOTCH信号传导等几个关键过程和途径。包括炎症相关基因RPL7、TLK2和ANK2在内的24个基因组成的子集能够以89%的准确率区分自身免疫性疾病和感染性疾病。结论:使用一种新的批量校正算法,该分析可能为病原性自身免疫反应提供新的机制理解,并为炎症性疾病患者相应病理的鉴别诊断提供生物标志物。本研究由欧洲区域发展基金NSRF 2014-2020资助。MIS5002802。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine learning-based identification of a transcriptomic blood signature discriminating between systemic autoimmunity and infection.

Background: Pathogenic responses against self and foreign antigens in systemic autoimmunity and infection, respectively, engage similar immunologic components, thus lacking distinguishing diagnostic biomarkers. Herein, we tested whether whole-blood transcriptome analysis discriminates autoimmune from infectious diseases.

Methods: We applied nested cross-validation methodology to tune and validate random forests, k-nearest neighbors, and support vector machines, using a new preprocessing method on 22 publicly available datasets, including 594 patients with a broad spectrum of systemic autoimmune diseases and 615 patients with diverse viral, bacterial, and parasitic infections.

Findings: Our preprocessing method tackled RNA sequencing batch effects by sorting the genes within each sample according to individual relative expression values and discriminated between the corresponding pathologies with 98% accuracy versus 63% when using raw values. This model was further tested in external datasets comprising various autoimmune diseases and infections new to its training process, yielding accuracies ranging between 80% and 96%. Enrichment analyses of 457 of the most informative genes identified SAP1, ELF1/4, and FLI1 transcription factors among the significant upstream regulators and revealed several key processes and pathways, such as autophagy, DNA damage response, and NOTCH signaling. A subset of 24 genes, including the inflammation-related genes RPL7, TLK2, and ANK2, distinguished between autoimmune and infectious diseases with 89% accuracy.

Conclusions: Using a novel batch-correction algorithm, this analysis may provide a new mechanistic understanding of the pathogenic autoimmune response, as well as biomarkers for differential diagnoses of the corresponding pathologies in patients presenting with inflammatory disorders.

Funding: This work was funded by the European Regional Development Fund NSRF 2014-2020, no. MIS5002802.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Med
Med MEDICINE, RESEARCH & EXPERIMENTAL-
CiteScore
17.70
自引率
0.60%
发文量
102
期刊介绍: Med is a flagship medical journal published monthly by Cell Press, the global publisher of trusted and authoritative science journals including Cell, Cancer Cell, and Cell Reports Medicine. Our mission is to advance clinical research and practice by providing a communication forum for the publication of clinical trial results, innovative observations from longitudinal cohorts, and pioneering discoveries about disease mechanisms. The journal also encourages thought-leadership discussions among biomedical researchers, physicians, and other health scientists and stakeholders. Our goal is to improve health worldwide sustainably and ethically. Med publishes rigorously vetted original research and cutting-edge review and perspective articles on critical health issues globally and regionally. Our research section covers clinical case reports, first-in-human studies, large-scale clinical trials, population-based studies, as well as translational research work with the potential to change the course of medical research and improve clinical practice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信