Machine learning using genotype and gene-expression data identifies alterations of genes involved in infection susceptibility, antigen presentation and cytokine signalling as key contributors to JIA risk prediction.

IF 5.1 2区 医学 Q1 RHEUMATOLOGY
Nicholas Pudjihartono, Daniel Ho, Justin Martin O'Sullivan
{"title":"Machine learning using genotype and gene-expression data identifies alterations of genes involved in infection susceptibility, antigen presentation and cytokine signalling as key contributors to JIA risk prediction.","authors":"Nicholas Pudjihartono, Daniel Ho, Justin Martin O'Sullivan","doi":"10.1136/rmdopen-2025-005737","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Previous genome-wide association studies (GWAS) have identified numerous genetic loci associated with juvenile idiopathic arthritis (JIA). However, the functional impact of these variants-particularly on tissue-specific gene expression-and which regulatory interactions make the greatest relative contribution to JIA risk remain unclear. Identifying these key single-nucleotide polymorphism (SNP)-gene-tissue combinations can help prioritise targets for future functional studies and therapeutic interventions.</p><p><strong>Method: </strong>We performed two-sample Mendelian randomisation (2SMR) using spatial expression quantitative trait loci (eQTLs) from nine tissue-specific gene-regulatory networks as instrumental variables (IVs). We also identified JIA-associated SNPs from previous GWAS and mapped their spatial eQTL effects across 49 human tissues. These SNP sets were then used as features in a Lasso-regularised logistic regression model to predict JIA disease status. The model weight magnitudes served as proxies for each SNP's contribution to JIA risk. We evaluated the robustness of our model's feature ranking across 50 cross-validation runs.</p><p><strong>Results: </strong>The top-ranked SNPs included rs7775055, which tags the human leukocyte antigen (HLA) class II haplotype <i>DRB1*0801-DQA1*0401-DQB1*0402</i>, and rs6679677, a non-coding variant that is in 100% linkage with with a coding variant in <i>PTPN22</i>. IVs for genes implicated in infection-related immune processes (eg, <i>MSH5</i>, <i>MICA</i> and <i>LINC01149</i>) also made significant contributions to JIA risk. We additionally identified a spatial eQTL (rs10849448) that upregulated the cytokine signalling gene <i>LTBR</i> across all 49 tissues. Overall, our model highlighted the roles of genes involved in antigen presentation, infection susceptibility and cytokine signalling.</p><p><strong>Conclusion: </strong>By applying a machine learning approach to rank SNP-gene-tissue contributions to JIA risk, our findings offer insights into the genetic mechanisms underlying JIA pathogenesis. Future experimental validation could facilitate new therapeutic targets for the treatment or prevention of JIA.</p>","PeriodicalId":21396,"journal":{"name":"RMD Open","volume":"11 3","pages":""},"PeriodicalIF":5.1000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243633/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RMD Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/rmdopen-2025-005737","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Previous genome-wide association studies (GWAS) have identified numerous genetic loci associated with juvenile idiopathic arthritis (JIA). However, the functional impact of these variants-particularly on tissue-specific gene expression-and which regulatory interactions make the greatest relative contribution to JIA risk remain unclear. Identifying these key single-nucleotide polymorphism (SNP)-gene-tissue combinations can help prioritise targets for future functional studies and therapeutic interventions.

Method: We performed two-sample Mendelian randomisation (2SMR) using spatial expression quantitative trait loci (eQTLs) from nine tissue-specific gene-regulatory networks as instrumental variables (IVs). We also identified JIA-associated SNPs from previous GWAS and mapped their spatial eQTL effects across 49 human tissues. These SNP sets were then used as features in a Lasso-regularised logistic regression model to predict JIA disease status. The model weight magnitudes served as proxies for each SNP's contribution to JIA risk. We evaluated the robustness of our model's feature ranking across 50 cross-validation runs.

Results: The top-ranked SNPs included rs7775055, which tags the human leukocyte antigen (HLA) class II haplotype DRB1*0801-DQA1*0401-DQB1*0402, and rs6679677, a non-coding variant that is in 100% linkage with with a coding variant in PTPN22. IVs for genes implicated in infection-related immune processes (eg, MSH5, MICA and LINC01149) also made significant contributions to JIA risk. We additionally identified a spatial eQTL (rs10849448) that upregulated the cytokine signalling gene LTBR across all 49 tissues. Overall, our model highlighted the roles of genes involved in antigen presentation, infection susceptibility and cytokine signalling.

Conclusion: By applying a machine learning approach to rank SNP-gene-tissue contributions to JIA risk, our findings offer insights into the genetic mechanisms underlying JIA pathogenesis. Future experimental validation could facilitate new therapeutic targets for the treatment or prevention of JIA.

利用基因型和基因表达数据的机器学习识别出与感染易感性、抗原呈递和细胞因子信号传导相关的基因改变,这是JIA风险预测的关键因素。
背景:以前的全基因组关联研究(GWAS)已经确定了许多与青少年特发性关节炎(JIA)相关的遗传位点。然而,这些变异的功能影响,特别是对组织特异性基因表达的影响,以及哪些调控相互作用对JIA风险的相对贡献最大,仍不清楚。确定这些关键的单核苷酸多态性(SNP)-基因组织组合可以帮助确定未来功能研究和治疗干预的优先目标。方法:我们使用来自9个组织特异性基因调控网络的空间表达数量性状位点(eqtl)作为工具变量(IVs)进行两样本孟德尔随机化(2SMR)。我们还从以前的GWAS中发现了jia相关的snp,并绘制了它们在49个人体组织中的空间eQTL效应。然后将这些SNP集作为lasso正则化逻辑回归模型的特征来预测JIA疾病状态。模型权重值作为每个SNP对JIA风险贡献的代理。我们在50次交叉验证中评估了模型特征排名的稳健性。结果:排名靠前的snp包括人类白细胞抗原(HLA) II类单倍型DRB1*0801-DQA1*0401-DQB1*0402标记的rs7775055和与PTPN22编码变异100%连锁的非编码变异rs6679677。参与感染相关免疫过程的基因(如MSH5、MICA和LINC01149)的IVs也对JIA风险有重要贡献。我们还发现了一个空间eQTL (rs10849448),它上调了所有49个组织的细胞因子信号基因LTBR。总的来说,我们的模型强调了参与抗原呈递、感染易感性和细胞因子信号传导的基因的作用。结论:通过应用机器学习方法对snp基因组织对JIA风险的贡献进行排序,我们的研究结果为JIA发病机制的遗传机制提供了见解。未来的实验验证可以为JIA的治疗或预防提供新的治疗靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
RMD Open
RMD Open RHEUMATOLOGY-
CiteScore
7.30
自引率
6.50%
发文量
205
审稿时长
14 weeks
期刊介绍: RMD Open publishes high quality peer-reviewed original research covering the full spectrum of musculoskeletal disorders, rheumatism and connective tissue diseases, including osteoporosis, spine and rehabilitation. Clinical and epidemiological research, basic and translational medicine, interesting clinical cases, and smaller studies that add to the literature are all considered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信