解开PCOS的隐藏因素:从转录组学到使用WGCNA, LASSO和ROC分析的关键生物标志物

IF 2.5 2区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Harshini Senthilkumar, Mohanapriya Arumugam
{"title":"解开PCOS的隐藏因素:从转录组学到使用WGCNA, LASSO和ROC分析的关键生物标志物","authors":"Harshini Senthilkumar,&nbsp;Mohanapriya Arumugam","doi":"10.1016/j.jsbmb.2025.106795","DOIUrl":null,"url":null,"abstract":"<div><div>Polycystic ovary syndrome (PCOS) is a complex endocrine and metabolic disorder characterized by hormonal imbalance, inflammation and insulin resistance. This study utilized a comprehensive RNA-Seq workflow to uncover the molecular mechanisms underlying PCOS. Data were retrieved from the NCBI- Sequence Read Archive [SRA] repository, followed by quality assessment using FastQC and Trimmomatic trimming. Reads were aligned to the GRCh38 genome using HISAT2 and differential expression analysis was performed by DESeq2 in R, identifying 395 significant DEGs (268 upregulated, 127 downregulated). Gene ontology (GO) analysis revealed dysregulated biological processes including activation of protein kinase activity, telomerase capping, protein serine/ threonine kinase activity and MAP kinase kinase activity, while KEGG pathway analysis highlighted key signaling pathways such as MAPK signaling, Ras signaling, and GnRH signaling. Weighted gene co-expression network analysis (WGCNA) identified modules associated with PCOS traits and Protein-Protein Interaction (PPI) network analysis identified 20 hub genes. Through Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis, five key predictor genes – <em>PRKACA</em>, <em>SREBF2</em>, <em>MAPT</em>, <em>EHMT1</em>, and <em>JUP</em> were identified, highlighting their critical roles in the pathogenesis of PCOS. The prognostic value of these genes was further validated by receiver operating characteristics (ROC) curve analysis, which showed a high area under the curve (AUC) of 0.89. This strong discriminatory power between PCOS and healthy groups highlights the potential of these biomarkers as reliable indicators for diagnosing and understanding the disease. The results highlight the therapeutic and diagnostic potential of these biomarkers and pave the way for precision medicine approaches in PCOS treatment.</div></div>","PeriodicalId":51106,"journal":{"name":"Journal of Steroid Biochemistry and Molecular Biology","volume":"253 ","pages":"Article 106795"},"PeriodicalIF":2.5000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Untangling hidden players of PCOS: From transcriptomics to key biomarkers using WGCNA, LASSO and ROC analysis\",\"authors\":\"Harshini Senthilkumar,&nbsp;Mohanapriya Arumugam\",\"doi\":\"10.1016/j.jsbmb.2025.106795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Polycystic ovary syndrome (PCOS) is a complex endocrine and metabolic disorder characterized by hormonal imbalance, inflammation and insulin resistance. This study utilized a comprehensive RNA-Seq workflow to uncover the molecular mechanisms underlying PCOS. Data were retrieved from the NCBI- Sequence Read Archive [SRA] repository, followed by quality assessment using FastQC and Trimmomatic trimming. Reads were aligned to the GRCh38 genome using HISAT2 and differential expression analysis was performed by DESeq2 in R, identifying 395 significant DEGs (268 upregulated, 127 downregulated). Gene ontology (GO) analysis revealed dysregulated biological processes including activation of protein kinase activity, telomerase capping, protein serine/ threonine kinase activity and MAP kinase kinase activity, while KEGG pathway analysis highlighted key signaling pathways such as MAPK signaling, Ras signaling, and GnRH signaling. Weighted gene co-expression network analysis (WGCNA) identified modules associated with PCOS traits and Protein-Protein Interaction (PPI) network analysis identified 20 hub genes. Through Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis, five key predictor genes – <em>PRKACA</em>, <em>SREBF2</em>, <em>MAPT</em>, <em>EHMT1</em>, and <em>JUP</em> were identified, highlighting their critical roles in the pathogenesis of PCOS. The prognostic value of these genes was further validated by receiver operating characteristics (ROC) curve analysis, which showed a high area under the curve (AUC) of 0.89. This strong discriminatory power between PCOS and healthy groups highlights the potential of these biomarkers as reliable indicators for diagnosing and understanding the disease. The results highlight the therapeutic and diagnostic potential of these biomarkers and pave the way for precision medicine approaches in PCOS treatment.</div></div>\",\"PeriodicalId\":51106,\"journal\":{\"name\":\"Journal of Steroid Biochemistry and Molecular Biology\",\"volume\":\"253 \",\"pages\":\"Article 106795\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Steroid Biochemistry and Molecular Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0960076025001232\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Steroid Biochemistry and Molecular Biology","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0960076025001232","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

多囊卵巢综合征(PCOS)是一种以激素失衡、炎症和胰岛素抵抗为特征的复杂内分泌代谢疾病。本研究利用全面的RNA-Seq工作流程来揭示多囊卵巢综合征的分子机制。数据从NCBI- Sequence Read Archive [SRA]存储库中检索,随后使用FastQC和Trimmomatic修剪进行质量评估。使用HISAT2将Reads与GRCh38基因组进行比对,并在R中使用DESeq2进行差异表达分析,鉴定出395个显著的DEGs(268个上调,127个下调)。基因本体(GO)分析揭示了包括蛋白激酶活性激活、端粒酶封顶、蛋白丝氨酸/苏氨酸激酶活性和MAP激酶活性在内的生物过程失调,而KEGG途径分析强调了MAPK信号通路、Ras信号通路和GnRH信号通路等关键信号通路。加权基因共表达网络分析(WGCNA)鉴定了与PCOS性状相关的模块,蛋白质-蛋白质相互作用(PPI)网络分析鉴定了20个枢纽基因。通过最小绝对收缩和选择算子(LASSO)回归分析,确定了PRKACA、SREBF2、MAPT、EHMT1和JUP 5个关键预测基因,突出了它们在PCOS发病机制中的关键作用。通过受试者工作特征(ROC)曲线分析进一步验证了这些基因的预后价值,曲线下面积(AUC)高达0.89。多囊卵巢综合征和健康人群之间的这种强烈的区分力突出了这些生物标志物作为诊断和了解该疾病的可靠指标的潜力。结果突出了这些生物标志物的治疗和诊断潜力,并为PCOS治疗的精准医学方法铺平了道路。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Untangling hidden players of PCOS: From transcriptomics to key biomarkers using WGCNA, LASSO and ROC analysis
Polycystic ovary syndrome (PCOS) is a complex endocrine and metabolic disorder characterized by hormonal imbalance, inflammation and insulin resistance. This study utilized a comprehensive RNA-Seq workflow to uncover the molecular mechanisms underlying PCOS. Data were retrieved from the NCBI- Sequence Read Archive [SRA] repository, followed by quality assessment using FastQC and Trimmomatic trimming. Reads were aligned to the GRCh38 genome using HISAT2 and differential expression analysis was performed by DESeq2 in R, identifying 395 significant DEGs (268 upregulated, 127 downregulated). Gene ontology (GO) analysis revealed dysregulated biological processes including activation of protein kinase activity, telomerase capping, protein serine/ threonine kinase activity and MAP kinase kinase activity, while KEGG pathway analysis highlighted key signaling pathways such as MAPK signaling, Ras signaling, and GnRH signaling. Weighted gene co-expression network analysis (WGCNA) identified modules associated with PCOS traits and Protein-Protein Interaction (PPI) network analysis identified 20 hub genes. Through Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis, five key predictor genes – PRKACA, SREBF2, MAPT, EHMT1, and JUP were identified, highlighting their critical roles in the pathogenesis of PCOS. The prognostic value of these genes was further validated by receiver operating characteristics (ROC) curve analysis, which showed a high area under the curve (AUC) of 0.89. This strong discriminatory power between PCOS and healthy groups highlights the potential of these biomarkers as reliable indicators for diagnosing and understanding the disease. The results highlight the therapeutic and diagnostic potential of these biomarkers and pave the way for precision medicine approaches in PCOS treatment.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.60
自引率
2.40%
发文量
113
审稿时长
46 days
期刊介绍: The Journal of Steroid Biochemistry and Molecular Biology is devoted to new experimental and theoretical developments in areas related to steroids including vitamin D, lipids and their metabolomics. The Journal publishes a variety of contributions, including original articles, general and focused reviews, and rapid communications (brief articles of particular interest and clear novelty). Selected cutting-edge topics will be addressed in Special Issues managed by Guest Editors. Special Issues will contain both commissioned reviews and original research papers to provide comprehensive coverage of specific topics, and all submissions will undergo rigorous peer-review prior to publication.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信