Identification of biomarkers for endometriosis based on summary-data-based Mendelian randomization and machine learning.

IF 1.3 4区 医学 Q2 MEDICINE, GENERAL & INTERNAL
Ziwei Xie, Yuxin Feng, Yue He, Yingying Lin, Xiaohong Wang
{"title":"Identification of biomarkers for endometriosis based on summary-data-based Mendelian randomization and machine learning.","authors":"Ziwei Xie, Yuxin Feng, Yue He, Yingying Lin, Xiaohong Wang","doi":"10.1097/MD.0000000000041804","DOIUrl":null,"url":null,"abstract":"<p><p>Endometriosis (EM) significantly impacts the quality of life, and its diagnosis currently relies on surgery, which carries risks and may miss early lesions. Noninvasive biomarkers are urgently needed for early diagnosis and personalized treatment. This study utilized the genome-wide association study dataset from FinnGen and performed Multi-marker Analysis of GenoMic Annotation (MAGMA) to identify genes significantly associated with EM. Differentially expressed genes (DEGs) were then analyzed, and an intersection selection was conducted to obtain the MAGMA-related DEGs. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were performed to explore the biological functions of these genes. Summary-data-based Mendelian randomization was used to identify potential risk and protective genes. Subsequently, a machine learning model was used to further select key biomarkers. Single-cell RNA sequencing and consensus clustering were applied to analyze the expression of biomarkers and classify the EM samples into subgroups. Immune infiltration analysis was conducted to evaluate the molecular characteristics of these subgroups. MAGMA analysis identified 2832 genes significantly associated with EM, while 3055 DEGs were detected. Intersection analysis resulted in 437 MAGMA-related DEGs. Summary-data-based Mendelian randomization analysis identified 10 candidate genes, and after further selection using a machine learning model, three core biomarkers were validated: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7. Single-cell RNA sequencing revealed the expression patterns of these biomarkers. Consensus clustering analysis classified 77 EM samples into two subgroups, with immune infiltration analysis showing significant differences in immune cell composition among the subgroups. This study successfully identified three core biomarkers for EM: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7, which exhibit protective roles in EM.</p>","PeriodicalId":18549,"journal":{"name":"Medicine","volume":"104 14","pages":"e41804"},"PeriodicalIF":1.3000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/MD.0000000000041804","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Endometriosis (EM) significantly impacts the quality of life, and its diagnosis currently relies on surgery, which carries risks and may miss early lesions. Noninvasive biomarkers are urgently needed for early diagnosis and personalized treatment. This study utilized the genome-wide association study dataset from FinnGen and performed Multi-marker Analysis of GenoMic Annotation (MAGMA) to identify genes significantly associated with EM. Differentially expressed genes (DEGs) were then analyzed, and an intersection selection was conducted to obtain the MAGMA-related DEGs. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were performed to explore the biological functions of these genes. Summary-data-based Mendelian randomization was used to identify potential risk and protective genes. Subsequently, a machine learning model was used to further select key biomarkers. Single-cell RNA sequencing and consensus clustering were applied to analyze the expression of biomarkers and classify the EM samples into subgroups. Immune infiltration analysis was conducted to evaluate the molecular characteristics of these subgroups. MAGMA analysis identified 2832 genes significantly associated with EM, while 3055 DEGs were detected. Intersection analysis resulted in 437 MAGMA-related DEGs. Summary-data-based Mendelian randomization analysis identified 10 candidate genes, and after further selection using a machine learning model, three core biomarkers were validated: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7. Single-cell RNA sequencing revealed the expression patterns of these biomarkers. Consensus clustering analysis classified 77 EM samples into two subgroups, with immune infiltration analysis showing significant differences in immune cell composition among the subgroups. This study successfully identified three core biomarkers for EM: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7, which exhibit protective roles in EM.

基于汇总数据的孟德尔随机化和机器学习的子宫内膜异位症生物标志物鉴定。
子宫内膜异位症(EM)严重影响生活质量,其诊断目前依赖于手术,这有风险,可能会错过早期病变。迫切需要无创生物标志物进行早期诊断和个性化治疗。本研究利用FinnGen的全基因组关联研究数据集,通过多标记基因组注释分析(Multi-marker Analysis of GenoMic Annotation,简称MAGMA)识别与EM显著相关的基因。然后分析差异表达基因(differential expression genes,简称deg),并进行交叉选择获得与MAGMA相关的deg。通过基因本体和京都基因与基因组百科全书的富集分析来探索这些基因的生物学功能。采用基于汇总数据的孟德尔随机化来识别潜在风险和保护基因。随后,使用机器学习模型进一步选择关键生物标志物。应用单细胞RNA测序和共识聚类分析生物标志物的表达,并将EM样品分类为亚组。通过免疫浸润分析来评价这些亚群的分子特征。MAGMA分析鉴定出2832个与EM显著相关的基因,检测到3055个与EM显著相关的基因。交叉分析得到437个与magma相关的deg。基于摘要数据的孟德尔随机化分析确定了10个候选基因,在使用机器学习模型进一步选择后,验证了三个核心生物标志物:腺苷激酶,烯酰辅酶a水合酶/3-羟基辅酶a脱氢酶和CCR4-NOT转录复合物亚基7。单细胞RNA测序揭示了这些生物标志物的表达模式。共识聚类分析将77例EM样本分为两个亚组,免疫浸润分析显示亚组间免疫细胞组成存在显著差异。本研究成功鉴定了EM的三个核心生物标志物:腺苷激酶、烯酰辅酶a水合酶/3-羟酰基辅酶a脱氢酶和CCR4-NOT转录复合物亚基7,它们在EM中发挥保护作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medicine
Medicine 医学-医学:内科
CiteScore
2.80
自引率
0.00%
发文量
4342
审稿时长
>12 weeks
期刊介绍: Medicine is now a fully open access journal, providing authors with a distinctive new service offering continuous publication of original research across a broad spectrum of medical scientific disciplines and sub-specialties. As an open access title, Medicine will continue to provide authors with an established, trusted platform for the publication of their work. To ensure the ongoing quality of Medicine’s content, the peer-review process will only accept content that is scientifically, technically and ethically sound, and in compliance with standard reporting guidelines.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信