Zichang Xu, Hendra S Ismanto, Dianita S Saputri, Soichiro Haruna, Guanqun Sun, Jan Wilamowski, Shunsuke Teraguchi, Ayan Sengupta, Songling Li, Daron M Standley
{"title":"从适应性免疫受体的旁位网络中稳健地检测传染病、自身免疫和癌症。","authors":"Zichang Xu, Hendra S Ismanto, Dianita S Saputri, Soichiro Haruna, Guanqun Sun, Jan Wilamowski, Shunsuke Teraguchi, Ayan Sengupta, Songling Li, Daron M Standley","doi":"10.1093/bib/bbae431","DOIUrl":null,"url":null,"abstract":"<p><p>Liquid biopsies based on peripheral blood offer a minimally invasive alternative to solid tissue biopsies for the detection of diseases, primarily cancers. However, such tests currently consider only the serum component of blood, overlooking a potentially rich source of biomarkers: adaptive immune receptors (AIRs) expressed on circulating B and T cells. Machine learning-based classifiers trained on AIRs have been reported to accurately identify not only cancers but also autoimmune and infectious diseases as well. However, when using the conventional \"clonotype cluster\" representation of AIRs, individuals within a disease or healthy cohort exhibit vastly different features, limiting the generalizability of these classifiers. This study aimed to address the challenge of classifying specific diseases from circulating B or T cells by developing a novel representation of AIRs based on similarity networks constructed from their antigen-binding regions (paratopes). Features based on this novel representation, paratope cluster occupancies (PCOs), significantly improved disease classification performance for infectious disease, autoimmune disease, and cancer. Under identical methodological conditions, classifiers trained on PCOs achieved a mean AUC of 0.893 when applied to new individuals, outperforming clonotype cluster-based classifiers (AUC 0.714) and the best-performing published classifier (AUC 0.777). Surprisingly, for cancer patients, we observed that \"healthy-biased\" AIRs were predicted to target known cancer-associated antigens at dramatically higher rates than healthy AIRs as a whole (Z scores >75), suggesting an overlooked reservoir of cancer-targeting immune cells that could be identified by PCOs.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370640/pdf/","citationCount":"0","resultStr":"{\"title\":\"Robust detection of infectious disease, autoimmunity, and cancer from the paratope networks of adaptive immune receptors.\",\"authors\":\"Zichang Xu, Hendra S Ismanto, Dianita S Saputri, Soichiro Haruna, Guanqun Sun, Jan Wilamowski, Shunsuke Teraguchi, Ayan Sengupta, Songling Li, Daron M Standley\",\"doi\":\"10.1093/bib/bbae431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Liquid biopsies based on peripheral blood offer a minimally invasive alternative to solid tissue biopsies for the detection of diseases, primarily cancers. However, such tests currently consider only the serum component of blood, overlooking a potentially rich source of biomarkers: adaptive immune receptors (AIRs) expressed on circulating B and T cells. Machine learning-based classifiers trained on AIRs have been reported to accurately identify not only cancers but also autoimmune and infectious diseases as well. However, when using the conventional \\\"clonotype cluster\\\" representation of AIRs, individuals within a disease or healthy cohort exhibit vastly different features, limiting the generalizability of these classifiers. This study aimed to address the challenge of classifying specific diseases from circulating B or T cells by developing a novel representation of AIRs based on similarity networks constructed from their antigen-binding regions (paratopes). Features based on this novel representation, paratope cluster occupancies (PCOs), significantly improved disease classification performance for infectious disease, autoimmune disease, and cancer. Under identical methodological conditions, classifiers trained on PCOs achieved a mean AUC of 0.893 when applied to new individuals, outperforming clonotype cluster-based classifiers (AUC 0.714) and the best-performing published classifier (AUC 0.777). Surprisingly, for cancer patients, we observed that \\\"healthy-biased\\\" AIRs were predicted to target known cancer-associated antigens at dramatically higher rates than healthy AIRs as a whole (Z scores >75), suggesting an overlooked reservoir of cancer-targeting immune cells that could be identified by PCOs.</p>\",\"PeriodicalId\":9209,\"journal\":{\"name\":\"Briefings in bioinformatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370640/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Briefings in bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/bib/bbae431\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae431","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
摘要
基于外周血的液体活检为检测疾病(主要是癌症)提供了实体组织活检的微创替代方法。然而,这类检测目前只考虑血液中的血清成分,忽略了潜在的丰富生物标志物来源:在循环 B 细胞和 T 细胞上表达的适应性免疫受体(AIRs)。据报道,基于 AIRs 训练的机器学习分类器不仅能准确识别癌症,还能识别自身免疫性疾病和传染性疾病。然而,当使用传统的 AIRs "克隆型集群 "表示法时,疾病或健康队列中的个体会表现出截然不同的特征,从而限制了这些分类器的普适性。本研究旨在通过开发一种基于抗原结合区(旁位点)构建的相似性网络的新型 AIR 表示方法,解决从循环 B 细胞或 T 细胞中对特定疾病进行分类的难题。基于这种新表征的特征--旁位群占位(PCOs)--显著提高了传染病、自身免疫性疾病和癌症的疾病分类性能。在相同的方法条件下,基于 PCOs 训练的分类器应用于新个体时,平均 AUC 为 0.893,优于基于克隆型聚类的分类器(AUC 0.714)和已发表的最佳分类器(AUC 0.777)。令人惊讶的是,对于癌症患者,我们观察到 "健康偏倚 "AIRs靶向已知癌症相关抗原的预测率大大高于健康AIRs整体(Z分数大于75),这表明PCOs可以识别出一个被忽视的癌症靶向免疫细胞库。
Robust detection of infectious disease, autoimmunity, and cancer from the paratope networks of adaptive immune receptors.
Liquid biopsies based on peripheral blood offer a minimally invasive alternative to solid tissue biopsies for the detection of diseases, primarily cancers. However, such tests currently consider only the serum component of blood, overlooking a potentially rich source of biomarkers: adaptive immune receptors (AIRs) expressed on circulating B and T cells. Machine learning-based classifiers trained on AIRs have been reported to accurately identify not only cancers but also autoimmune and infectious diseases as well. However, when using the conventional "clonotype cluster" representation of AIRs, individuals within a disease or healthy cohort exhibit vastly different features, limiting the generalizability of these classifiers. This study aimed to address the challenge of classifying specific diseases from circulating B or T cells by developing a novel representation of AIRs based on similarity networks constructed from their antigen-binding regions (paratopes). Features based on this novel representation, paratope cluster occupancies (PCOs), significantly improved disease classification performance for infectious disease, autoimmune disease, and cancer. Under identical methodological conditions, classifiers trained on PCOs achieved a mean AUC of 0.893 when applied to new individuals, outperforming clonotype cluster-based classifiers (AUC 0.714) and the best-performing published classifier (AUC 0.777). Surprisingly, for cancer patients, we observed that "healthy-biased" AIRs were predicted to target known cancer-associated antigens at dramatically higher rates than healthy AIRs as a whole (Z scores >75), suggesting an overlooked reservoir of cancer-targeting immune cells that could be identified by PCOs.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.