Ziwei Xie, Yuxin Feng, Yue He, Yingying Lin, Xiaohong Wang
{"title":"Identification of biomarkers for endometriosis based on summary-data-based Mendelian randomization and machine learning.","authors":"Ziwei Xie, Yuxin Feng, Yue He, Yingying Lin, Xiaohong Wang","doi":"10.1097/MD.0000000000041804","DOIUrl":null,"url":null,"abstract":"<p><p>Endometriosis (EM) significantly impacts the quality of life, and its diagnosis currently relies on surgery, which carries risks and may miss early lesions. Noninvasive biomarkers are urgently needed for early diagnosis and personalized treatment. This study utilized the genome-wide association study dataset from FinnGen and performed Multi-marker Analysis of GenoMic Annotation (MAGMA) to identify genes significantly associated with EM. Differentially expressed genes (DEGs) were then analyzed, and an intersection selection was conducted to obtain the MAGMA-related DEGs. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were performed to explore the biological functions of these genes. Summary-data-based Mendelian randomization was used to identify potential risk and protective genes. Subsequently, a machine learning model was used to further select key biomarkers. Single-cell RNA sequencing and consensus clustering were applied to analyze the expression of biomarkers and classify the EM samples into subgroups. Immune infiltration analysis was conducted to evaluate the molecular characteristics of these subgroups. MAGMA analysis identified 2832 genes significantly associated with EM, while 3055 DEGs were detected. Intersection analysis resulted in 437 MAGMA-related DEGs. Summary-data-based Mendelian randomization analysis identified 10 candidate genes, and after further selection using a machine learning model, three core biomarkers were validated: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7. Single-cell RNA sequencing revealed the expression patterns of these biomarkers. Consensus clustering analysis classified 77 EM samples into two subgroups, with immune infiltration analysis showing significant differences in immune cell composition among the subgroups. This study successfully identified three core biomarkers for EM: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7, which exhibit protective roles in EM.</p>","PeriodicalId":18549,"journal":{"name":"Medicine","volume":"104 14","pages":"e41804"},"PeriodicalIF":1.3000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/MD.0000000000041804","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Endometriosis (EM) significantly impacts the quality of life, and its diagnosis currently relies on surgery, which carries risks and may miss early lesions. Noninvasive biomarkers are urgently needed for early diagnosis and personalized treatment. This study utilized the genome-wide association study dataset from FinnGen and performed Multi-marker Analysis of GenoMic Annotation (MAGMA) to identify genes significantly associated with EM. Differentially expressed genes (DEGs) were then analyzed, and an intersection selection was conducted to obtain the MAGMA-related DEGs. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses were performed to explore the biological functions of these genes. Summary-data-based Mendelian randomization was used to identify potential risk and protective genes. Subsequently, a machine learning model was used to further select key biomarkers. Single-cell RNA sequencing and consensus clustering were applied to analyze the expression of biomarkers and classify the EM samples into subgroups. Immune infiltration analysis was conducted to evaluate the molecular characteristics of these subgroups. MAGMA analysis identified 2832 genes significantly associated with EM, while 3055 DEGs were detected. Intersection analysis resulted in 437 MAGMA-related DEGs. Summary-data-based Mendelian randomization analysis identified 10 candidate genes, and after further selection using a machine learning model, three core biomarkers were validated: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7. Single-cell RNA sequencing revealed the expression patterns of these biomarkers. Consensus clustering analysis classified 77 EM samples into two subgroups, with immune infiltration analysis showing significant differences in immune cell composition among the subgroups. This study successfully identified three core biomarkers for EM: adenosine kinase, enoyl-CoA hydratase/3-hydroxyacyl CoA dehydrogenase, and CCR4-NOT transcription complex subunit 7, which exhibit protective roles in EM.
期刊介绍:
Medicine is now a fully open access journal, providing authors with a distinctive new service offering continuous publication of original research across a broad spectrum of medical scientific disciplines and sub-specialties.
As an open access title, Medicine will continue to provide authors with an established, trusted platform for the publication of their work. To ensure the ongoing quality of Medicine’s content, the peer-review process will only accept content that is scientifically, technically and ethically sound, and in compliance with standard reporting guidelines.