Nicholas Pudjihartono, Daniel Ho, Justin Martin O'Sullivan
{"title":"利用基因型和基因表达数据的机器学习识别出与感染易感性、抗原呈递和细胞因子信号传导相关的基因改变,这是JIA风险预测的关键因素。","authors":"Nicholas Pudjihartono, Daniel Ho, Justin Martin O'Sullivan","doi":"10.1136/rmdopen-2025-005737","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Previous genome-wide association studies (GWAS) have identified numerous genetic loci associated with juvenile idiopathic arthritis (JIA). However, the functional impact of these variants-particularly on tissue-specific gene expression-and which regulatory interactions make the greatest relative contribution to JIA risk remain unclear. Identifying these key single-nucleotide polymorphism (SNP)-gene-tissue combinations can help prioritise targets for future functional studies and therapeutic interventions.</p><p><strong>Method: </strong>We performed two-sample Mendelian randomisation (2SMR) using spatial expression quantitative trait loci (eQTLs) from nine tissue-specific gene-regulatory networks as instrumental variables (IVs). We also identified JIA-associated SNPs from previous GWAS and mapped their spatial eQTL effects across 49 human tissues. These SNP sets were then used as features in a Lasso-regularised logistic regression model to predict JIA disease status. The model weight magnitudes served as proxies for each SNP's contribution to JIA risk. We evaluated the robustness of our model's feature ranking across 50 cross-validation runs.</p><p><strong>Results: </strong>The top-ranked SNPs included rs7775055, which tags the human leukocyte antigen (HLA) class II haplotype <i>DRB1*0801-DQA1*0401-DQB1*0402</i>, and rs6679677, a non-coding variant that is in 100% linkage with with a coding variant in <i>PTPN22</i>. IVs for genes implicated in infection-related immune processes (eg, <i>MSH5</i>, <i>MICA</i> and <i>LINC01149</i>) also made significant contributions to JIA risk. We additionally identified a spatial eQTL (rs10849448) that upregulated the cytokine signalling gene <i>LTBR</i> across all 49 tissues. Overall, our model highlighted the roles of genes involved in antigen presentation, infection susceptibility and cytokine signalling.</p><p><strong>Conclusion: </strong>By applying a machine learning approach to rank SNP-gene-tissue contributions to JIA risk, our findings offer insights into the genetic mechanisms underlying JIA pathogenesis. Future experimental validation could facilitate new therapeutic targets for the treatment or prevention of JIA.</p>","PeriodicalId":21396,"journal":{"name":"RMD Open","volume":"11 3","pages":""},"PeriodicalIF":5.1000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243633/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning using genotype and gene-expression data identifies alterations of genes involved in infection susceptibility, antigen presentation and cytokine signalling as key contributors to JIA risk prediction.\",\"authors\":\"Nicholas Pudjihartono, Daniel Ho, Justin Martin O'Sullivan\",\"doi\":\"10.1136/rmdopen-2025-005737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Previous genome-wide association studies (GWAS) have identified numerous genetic loci associated with juvenile idiopathic arthritis (JIA). However, the functional impact of these variants-particularly on tissue-specific gene expression-and which regulatory interactions make the greatest relative contribution to JIA risk remain unclear. Identifying these key single-nucleotide polymorphism (SNP)-gene-tissue combinations can help prioritise targets for future functional studies and therapeutic interventions.</p><p><strong>Method: </strong>We performed two-sample Mendelian randomisation (2SMR) using spatial expression quantitative trait loci (eQTLs) from nine tissue-specific gene-regulatory networks as instrumental variables (IVs). We also identified JIA-associated SNPs from previous GWAS and mapped their spatial eQTL effects across 49 human tissues. These SNP sets were then used as features in a Lasso-regularised logistic regression model to predict JIA disease status. The model weight magnitudes served as proxies for each SNP's contribution to JIA risk. We evaluated the robustness of our model's feature ranking across 50 cross-validation runs.</p><p><strong>Results: </strong>The top-ranked SNPs included rs7775055, which tags the human leukocyte antigen (HLA) class II haplotype <i>DRB1*0801-DQA1*0401-DQB1*0402</i>, and rs6679677, a non-coding variant that is in 100% linkage with with a coding variant in <i>PTPN22</i>. IVs for genes implicated in infection-related immune processes (eg, <i>MSH5</i>, <i>MICA</i> and <i>LINC01149</i>) also made significant contributions to JIA risk. We additionally identified a spatial eQTL (rs10849448) that upregulated the cytokine signalling gene <i>LTBR</i> across all 49 tissues. Overall, our model highlighted the roles of genes involved in antigen presentation, infection susceptibility and cytokine signalling.</p><p><strong>Conclusion: </strong>By applying a machine learning approach to rank SNP-gene-tissue contributions to JIA risk, our findings offer insights into the genetic mechanisms underlying JIA pathogenesis. Future experimental validation could facilitate new therapeutic targets for the treatment or prevention of JIA.</p>\",\"PeriodicalId\":21396,\"journal\":{\"name\":\"RMD Open\",\"volume\":\"11 3\",\"pages\":\"\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2025-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12243633/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RMD Open\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1136/rmdopen-2025-005737\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RHEUMATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RMD Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/rmdopen-2025-005737","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
Machine learning using genotype and gene-expression data identifies alterations of genes involved in infection susceptibility, antigen presentation and cytokine signalling as key contributors to JIA risk prediction.
Background: Previous genome-wide association studies (GWAS) have identified numerous genetic loci associated with juvenile idiopathic arthritis (JIA). However, the functional impact of these variants-particularly on tissue-specific gene expression-and which regulatory interactions make the greatest relative contribution to JIA risk remain unclear. Identifying these key single-nucleotide polymorphism (SNP)-gene-tissue combinations can help prioritise targets for future functional studies and therapeutic interventions.
Method: We performed two-sample Mendelian randomisation (2SMR) using spatial expression quantitative trait loci (eQTLs) from nine tissue-specific gene-regulatory networks as instrumental variables (IVs). We also identified JIA-associated SNPs from previous GWAS and mapped their spatial eQTL effects across 49 human tissues. These SNP sets were then used as features in a Lasso-regularised logistic regression model to predict JIA disease status. The model weight magnitudes served as proxies for each SNP's contribution to JIA risk. We evaluated the robustness of our model's feature ranking across 50 cross-validation runs.
Results: The top-ranked SNPs included rs7775055, which tags the human leukocyte antigen (HLA) class II haplotype DRB1*0801-DQA1*0401-DQB1*0402, and rs6679677, a non-coding variant that is in 100% linkage with with a coding variant in PTPN22. IVs for genes implicated in infection-related immune processes (eg, MSH5, MICA and LINC01149) also made significant contributions to JIA risk. We additionally identified a spatial eQTL (rs10849448) that upregulated the cytokine signalling gene LTBR across all 49 tissues. Overall, our model highlighted the roles of genes involved in antigen presentation, infection susceptibility and cytokine signalling.
Conclusion: By applying a machine learning approach to rank SNP-gene-tissue contributions to JIA risk, our findings offer insights into the genetic mechanisms underlying JIA pathogenesis. Future experimental validation could facilitate new therapeutic targets for the treatment or prevention of JIA.
期刊介绍:
RMD Open publishes high quality peer-reviewed original research covering the full spectrum of musculoskeletal disorders, rheumatism and connective tissue diseases, including osteoporosis, spine and rehabilitation. Clinical and epidemiological research, basic and translational medicine, interesting clinical cases, and smaller studies that add to the literature are all considered.