Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.

IF 2.6 3区 医学 Q2 OBSTETRICS & GYNECOLOGY
Yaqian Li, Xueqi Li, Tingting Xu, Daijuan Chen, Fan Zhou, Xiaodong Wang
{"title":"Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.","authors":"Yaqian Li, Xueqi Li, Tingting Xu, Daijuan Chen, Fan Zhou, Xiaodong Wang","doi":"10.1007/s43032-025-01847-1","DOIUrl":null,"url":null,"abstract":"<p><p>Gestational diabetes mellitus (GDM) and preeclampsia (PE) are common and serious disorders of pregnancy that threaten maternal safety and perinatal outcomes. Generally, GDM is recognized as an independent risk factor for the development of preeclampsia, while a history of preeclampsia in primiparous women is also a risk factor for GDM in subsequent pregnancies. However, the intricate underlying mechanisms of GDM and PE remain elusive. This study developed a diagnostic prediction model for GDM and PE. It investigated the correlation between shared signature genes and immune infiltration characteristics, by employing bioinformatic analysis combined with a machine learning strategy. The microarray datasets GSE103552 and GSE74341 from the Gene Expression Omnibus (GEO) database were used to obtain differentially expressed genes (DEGs). Then, signature genes were identified from the common DEGs via the methods of random forest (RF) algorithms, and artificial neural network (ANN) models. Furthermore, the immune infiltration patterns associated with GDM and PE were explored and validated in the training and testing sets. Moreover, to uncover the molecular mechanisms involved, an mRNA-miRNA network of target genes was constructed, and potential therapeutic drugs for GDM and PE were explored by querying the Connectivity Map (CMap) database. We obtained 45 DEGs by intersecting upregulated and downregulated DEGs from the GSE103552 and GSE74341 datasets. The results of GO annotation indicated that these 45 DEGs were mainly enriched in the process of cell cycle, and KEGG enrichment analysis indicated significant associations with immune signal transduction pathways and immune-related infectious disease. Six signature genes, namely TRA2A, NPM3, PHF5A, SNORD1C, PLXNA3, and C14orf142, were determined by machine learning models, and a diagnostic prediction model for GDM and PE was constructed based on these key genes, validating the highest prediction in the testing set. Moreover, we found increased infiltration of iDCs and T cell co-inhibition in the GDM group, while neutrophil, Th2 cell, and HLA levels were found to have decreased significantly. The PE group showed a significant increase in mast cells. In addition, the identified key genes were found to have potential associations with various immunocytes, immune functions, and checkpoints in the training and testing sets. Then, a miRNA-gene network analysis predicted several key miRNAs-miR-204, miR-23abc, miR-9, miR-205, and miR-455-5p-that might play significant roles in regulating these DEGs. In addition, the research also identified four potential therapeutic compounds for GDM (prima-1-met, geranylgeraniol, MLN-8054, and LY-364947), along with other drugs (deferiprone, peucedanin, MPEP, and IWR-1-endo) that could be targeted for treating PE. In summary, this work identified six signature genes (TRA2A, NPM3, PHF5A, SNORD1C, PLXNA3, and C14orf142) as potential genetic biomarkers for the diagnostic prediction of GDM and PE. A diagnostic prediction model was constructed based on these key genes, demonstrating strong performance when validated with an independent dataset. Moreover, we investigated the similarities and differences between the two diseases in terms of immune infiltration landscape and analyzed the correlations between key genes and the immune infiltration landscape, which provided insights into the molecular mechanisms underlying the development of GDM and PE. This understanding could pave the way for breakthroughs in identifying new immunotherapeutic targets and strategies for disease prevention and treatment.</p>","PeriodicalId":20920,"journal":{"name":"Reproductive Sciences","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reproductive Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s43032-025-01847-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Gestational diabetes mellitus (GDM) and preeclampsia (PE) are common and serious disorders of pregnancy that threaten maternal safety and perinatal outcomes. Generally, GDM is recognized as an independent risk factor for the development of preeclampsia, while a history of preeclampsia in primiparous women is also a risk factor for GDM in subsequent pregnancies. However, the intricate underlying mechanisms of GDM and PE remain elusive. This study developed a diagnostic prediction model for GDM and PE. It investigated the correlation between shared signature genes and immune infiltration characteristics, by employing bioinformatic analysis combined with a machine learning strategy. The microarray datasets GSE103552 and GSE74341 from the Gene Expression Omnibus (GEO) database were used to obtain differentially expressed genes (DEGs). Then, signature genes were identified from the common DEGs via the methods of random forest (RF) algorithms, and artificial neural network (ANN) models. Furthermore, the immune infiltration patterns associated with GDM and PE were explored and validated in the training and testing sets. Moreover, to uncover the molecular mechanisms involved, an mRNA-miRNA network of target genes was constructed, and potential therapeutic drugs for GDM and PE were explored by querying the Connectivity Map (CMap) database. We obtained 45 DEGs by intersecting upregulated and downregulated DEGs from the GSE103552 and GSE74341 datasets. The results of GO annotation indicated that these 45 DEGs were mainly enriched in the process of cell cycle, and KEGG enrichment analysis indicated significant associations with immune signal transduction pathways and immune-related infectious disease. Six signature genes, namely TRA2A, NPM3, PHF5A, SNORD1C, PLXNA3, and C14orf142, were determined by machine learning models, and a diagnostic prediction model for GDM and PE was constructed based on these key genes, validating the highest prediction in the testing set. Moreover, we found increased infiltration of iDCs and T cell co-inhibition in the GDM group, while neutrophil, Th2 cell, and HLA levels were found to have decreased significantly. The PE group showed a significant increase in mast cells. In addition, the identified key genes were found to have potential associations with various immunocytes, immune functions, and checkpoints in the training and testing sets. Then, a miRNA-gene network analysis predicted several key miRNAs-miR-204, miR-23abc, miR-9, miR-205, and miR-455-5p-that might play significant roles in regulating these DEGs. In addition, the research also identified four potential therapeutic compounds for GDM (prima-1-met, geranylgeraniol, MLN-8054, and LY-364947), along with other drugs (deferiprone, peucedanin, MPEP, and IWR-1-endo) that could be targeted for treating PE. In summary, this work identified six signature genes (TRA2A, NPM3, PHF5A, SNORD1C, PLXNA3, and C14orf142) as potential genetic biomarkers for the diagnostic prediction of GDM and PE. A diagnostic prediction model was constructed based on these key genes, demonstrating strong performance when validated with an independent dataset. Moreover, we investigated the similarities and differences between the two diseases in terms of immune infiltration landscape and analyzed the correlations between key genes and the immune infiltration landscape, which provided insights into the molecular mechanisms underlying the development of GDM and PE. This understanding could pave the way for breakthroughs in identifying new immunotherapeutic targets and strategies for disease prevention and treatment.

综合生物信息学分析和机器学习解读妊娠糖尿病和子痫前期的共享基因特征和免疫浸润特征。
妊娠期糖尿病(GDM)和先兆子痫(PE)是妊娠期常见且严重的疾病,威胁孕产妇安全和围产期结局。一般认为GDM是子痫前期发生的独立危险因素,而初产妇有子痫前期病史也是以后妊娠发生GDM的危险因素。然而,GDM和PE复杂的潜在机制仍然难以捉摸。本研究建立了GDM和PE的诊断预测模型。通过采用生物信息学分析结合机器学习策略,研究了共享特征基因与免疫浸润特征之间的相关性。基因表达Omnibus (Gene Expression Omnibus, GEO)数据库中的微阵列数据集GSE103552和GSE74341获得差异表达基因(differential Expression genes, DEGs)。然后,通过随机森林(RF)算法和人工神经网络(ANN)模型从常见的deg中识别出特征基因。此外,在训练集和测试集中探索并验证了与GDM和PE相关的免疫浸润模式。此外,为了揭示其中的分子机制,构建了靶基因mRNA-miRNA网络,并通过查询Connectivity Map (CMap)数据库探索GDM和PE的潜在治疗药物。通过交叉GSE103552和GSE74341数据集中上调和下调的基因,我们得到了45个基因。GO注释结果表明,这45个deg主要富集于细胞周期过程中,KEGG富集分析表明,KEGG与免疫信号转导通路和免疫相关感染性疾病有显著关联。通过机器学习模型确定了TRA2A、NPM3、PHF5A、SNORD1C、PLXNA3和C14orf142 6个特征基因,并基于这些关键基因构建了GDM和PE的诊断预测模型,验证了测试集中预测最高的结果。此外,我们发现在GDM组中,iDCs的浸润和T细胞的共抑制增加,而中性粒细胞、Th2细胞和HLA水平明显下降。PE组肥大细胞明显增多。此外,在训练集和测试集中发现鉴定的关键基因与各种免疫细胞、免疫功能和检查点具有潜在的关联。然后,mirna基因网络分析预测了几个关键mirna - mir -204, miR-23abc, miR-9, miR-205和mir -455-5p可能在调节这些deg中发挥重要作用。此外,该研究还确定了四种潜在的GDM治疗化合物(prima-1-met, geranylgeranol, MLN-8054和LY-364947),以及其他可靶向治疗PE的药物(去铁素,去青霉素,MPEP和IWR-1-endo)。总之,本研究确定了6个特征基因(TRA2A、NPM3、PHF5A、SNORD1C、PLXNA3和C14orf142)作为GDM和PE诊断预测的潜在遗传生物标志物。基于这些关键基因构建了诊断预测模型,并在独立数据集上进行了验证。此外,我们还研究了两种疾病在免疫浸润景观方面的异同,并分析了关键基因与免疫浸润景观的相关性,为GDM和PE发展的分子机制提供了新的思路。这种认识可以为确定新的免疫治疗靶点和疾病预防和治疗策略的突破铺平道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Reproductive Sciences
Reproductive Sciences 医学-妇产科学
CiteScore
5.50
自引率
3.40%
发文量
322
审稿时长
4-8 weeks
期刊介绍: Reproductive Sciences (RS) is a peer-reviewed, monthly journal publishing original research and reviews in obstetrics and gynecology. RS is multi-disciplinary and includes research in basic reproductive biology and medicine, maternal-fetal medicine, obstetrics, gynecology, reproductive endocrinology, urogynecology, fertility/infertility, embryology, gynecologic/reproductive oncology, developmental biology, stem cell research, molecular/cellular biology and other related fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信