鉴别特征基因和缺陷体质:基于生物信息学和机器学习的综合分析。

Long Xi, W U Zixuan, Y U Yunfeng, Lin Jie, Peng Qinghua
{"title":"鉴别特征基因和缺陷体质:基于生物信息学和机器学习的综合分析。","authors":"Long Xi, W U Zixuan, Y U Yunfeng, Lin Jie, Peng Qinghua","doi":"10.19852/j.cnki.jtcm.2025.04.021","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To utilize the Traditional Chinese Medicine constitution (TCMC) as a complementary and alternative approach for early disease detection and treatment, with a focus on <i>Yin</i> and <i>Yang</i> deficiency constitutions, which serve as key references for disease prevention and management.</p><p><strong>Methods: </strong>The dataset containing the data of <i>Yin</i> and <i>Yang</i> deficiency constitution was identified through the Gene Expression Omnibus database. This database was used for differential expression genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA), and the characteristic genes were then obtained in the dataset using a machine learning method. The hub genes of <i>Yin</i> and <i>Yang</i> deficiency constitution were obtained after analysis using the above three methods, and the hub genes were enriched and analyzed. Subsequently, the hub genes of <i>Yin</i> and <i>Yang</i> deficiency constitution were validated using external datasets. Receiver operating characteristic (ROC) analysis was used on each hub genes of the two groups to further understand their diagnostic performance. The miRNA-lncRNA-gene network was used to further analyze the hub genes. Immunoinfiltration and gene set enrichment analysis were performed on the shared hub genes.</p><p><strong>Results: </strong>The GSE87474 dataset was used for DEGs analysis and WGCNA. Using machine learning analyses, we identified 15 and 14 hub genes for <i>Yin</i> and <i>Yang</i> deficiency constitutions, respectively. The results of enrichment analyses showed that <i>Yin</i> deficiency constitution was associated with interleukin-17 signaling pathway, whereas <i>Yang</i> deficiency constitution was associated with glycosaminoglycan biosynthesis-keratan sulfate. The validation dataset GSE56116 showed statistically significant data for s-adenosylmethionine sensor upstream of MTORC1 (SAMTOR, also named C7orf60), cofilin 2 (CFL2), cytohesin 1 interacting protein (CYTIP), G protein-coupled receptor 183 (GPR183), hippocampus abundant transcript 1 (HIAT1), kelch like family member 15 (KLHL15), mitogen-activated protein kinase 6 (MAPK6), and prostaglandin-endoperoxide synthase 2 (PTGS2) in <i>Yin</i> deficiency and fucosy-ltransferase 8 (FUT8), TATA-box binding protein associated factor, RNA polymerase I subunit D (TAF1D), zinc finger protein 24 (ZNF24), MAPK6, and leptin receptor overlapping transcript like 1 (LEPROTL1) in <i>Yang</i> deficiency. The ROC results indicated that these genes have diagnostic value. MAPK6 is a shared hub gene for <i>Yin</i> and <i>Yang</i> deficiencies.</p><p><strong>Conclusions: </strong>This study identified C7orf60, CFL2, CYTIP, GPR183, HIAT1, KLHL15, MAPK6, and PTGS2 in <i>Yin</i> deficiency and FUT8, TAF1D, ZNF24, MAPK6, and LEPROTL1 in <i>Yang</i> deficiency as potential biomarkers, providing insights into their pathogenesis. This theory not only guides the diagnostic approach in TCM but also extends its influence to various scientific research fields.</p>","PeriodicalId":94119,"journal":{"name":"Journal of traditional Chinese medicine = Chung i tsa chih ying wen pan","volume":"45 4","pages":"909-921"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12340588/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of characteristic genes ofanddeficiency constitutions: an integrated analysis based on bioinformatics and machine learning.\",\"authors\":\"Long Xi, W U Zixuan, Y U Yunfeng, Lin Jie, Peng Qinghua\",\"doi\":\"10.19852/j.cnki.jtcm.2025.04.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>To utilize the Traditional Chinese Medicine constitution (TCMC) as a complementary and alternative approach for early disease detection and treatment, with a focus on <i>Yin</i> and <i>Yang</i> deficiency constitutions, which serve as key references for disease prevention and management.</p><p><strong>Methods: </strong>The dataset containing the data of <i>Yin</i> and <i>Yang</i> deficiency constitution was identified through the Gene Expression Omnibus database. This database was used for differential expression genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA), and the characteristic genes were then obtained in the dataset using a machine learning method. The hub genes of <i>Yin</i> and <i>Yang</i> deficiency constitution were obtained after analysis using the above three methods, and the hub genes were enriched and analyzed. Subsequently, the hub genes of <i>Yin</i> and <i>Yang</i> deficiency constitution were validated using external datasets. Receiver operating characteristic (ROC) analysis was used on each hub genes of the two groups to further understand their diagnostic performance. The miRNA-lncRNA-gene network was used to further analyze the hub genes. Immunoinfiltration and gene set enrichment analysis were performed on the shared hub genes.</p><p><strong>Results: </strong>The GSE87474 dataset was used for DEGs analysis and WGCNA. Using machine learning analyses, we identified 15 and 14 hub genes for <i>Yin</i> and <i>Yang</i> deficiency constitutions, respectively. The results of enrichment analyses showed that <i>Yin</i> deficiency constitution was associated with interleukin-17 signaling pathway, whereas <i>Yang</i> deficiency constitution was associated with glycosaminoglycan biosynthesis-keratan sulfate. The validation dataset GSE56116 showed statistically significant data for s-adenosylmethionine sensor upstream of MTORC1 (SAMTOR, also named C7orf60), cofilin 2 (CFL2), cytohesin 1 interacting protein (CYTIP), G protein-coupled receptor 183 (GPR183), hippocampus abundant transcript 1 (HIAT1), kelch like family member 15 (KLHL15), mitogen-activated protein kinase 6 (MAPK6), and prostaglandin-endoperoxide synthase 2 (PTGS2) in <i>Yin</i> deficiency and fucosy-ltransferase 8 (FUT8), TATA-box binding protein associated factor, RNA polymerase I subunit D (TAF1D), zinc finger protein 24 (ZNF24), MAPK6, and leptin receptor overlapping transcript like 1 (LEPROTL1) in <i>Yang</i> deficiency. The ROC results indicated that these genes have diagnostic value. MAPK6 is a shared hub gene for <i>Yin</i> and <i>Yang</i> deficiencies.</p><p><strong>Conclusions: </strong>This study identified C7orf60, CFL2, CYTIP, GPR183, HIAT1, KLHL15, MAPK6, and PTGS2 in <i>Yin</i> deficiency and FUT8, TAF1D, ZNF24, MAPK6, and LEPROTL1 in <i>Yang</i> deficiency as potential biomarkers, providing insights into their pathogenesis. This theory not only guides the diagnostic approach in TCM but also extends its influence to various scientific research fields.</p>\",\"PeriodicalId\":94119,\"journal\":{\"name\":\"Journal of traditional Chinese medicine = Chung i tsa chih ying wen pan\",\"volume\":\"45 4\",\"pages\":\"909-921\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12340588/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of traditional Chinese medicine = Chung i tsa chih ying wen pan\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.19852/j.cnki.jtcm.2025.04.021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of traditional Chinese medicine = Chung i tsa chih ying wen pan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.19852/j.cnki.jtcm.2025.04.021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目的:利用中医体质作为疾病早期发现和治疗的补充和替代方法,重点关注阴阳亏虚体质,为疾病预防和管理提供关键参考。方法:通过基因表达综合数据库对阴阳虚体质数据集进行识别。将该数据库用于差异表达基因(DEGs)分析和加权基因共表达网络分析(WGCNA),然后使用机器学习方法在数据集中获得特征基因。通过以上三种方法分析得到了阴阳虚体质的枢纽基因,并对枢纽基因进行了富集分析。随后,利用外部数据集对阴阳虚体质中心基因进行验证。采用受试者工作特征(Receiver operating characteristic, ROC)分析两组各枢纽基因,进一步了解其诊断效能。利用mirna - lncrna -基因网络进一步分析枢纽基因。对共享枢纽基因进行免疫浸润和基因集富集分析。结果:使用GSE87474数据集进行DEGs分析和WGCNA。利用机器学习分析,我们分别鉴定了15个和14个阴阳亏虚体质的中心基因。富集分析结果表明,阴虚体质与白细胞介素-17信号通路有关,而阳虚体质与糖胺聚糖生物合成-硫酸角蛋白聚糖信号通路有关。验证数据集GSE56116显示,MTORC1上游s-腺苷甲硫氨酸传感器(SAMTOR,也称为C7orf60)、cofilin 2 (CFL2)、cytohesin 1相互作用蛋白(CYTIP)、G蛋白偶联受体183 (GPR183)、海马丰富转录物1 (HIAT1)、kelch样家族成员15 (KLHL15)、丝裂原活化蛋白激酶6 (MAPK6)、阴虚中前列腺素内过氧化物合成酶2 (PTGS2)和聚焦转移酶8 (FUT8)的数据具有统计学意义。阳虚患者TATA-box结合蛋白相关因子、RNA聚合酶I亚基D (TAF1D)、锌指蛋白24 (ZNF24)、MAPK6、瘦素受体重叠转录样1 (LEPROTL1)。ROC结果显示这些基因具有诊断价值。MAPK6是阴阳不足的共享中心基因。结论:本研究确定阴虚患者的C7orf60、CFL2、CYTIP、GPR183、HIAT1、KLHL15、MAPK6和PTGS2,阳虚患者的FUT8、TAF1D、ZNF24、MAPK6和LEPROTL1作为潜在的生物标志物,为其发病机制提供了新的认识。这一理论不仅指导着中医的诊断方法,而且影响着各个科学研究领域。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of characteristic genes ofanddeficiency constitutions: an integrated analysis based on bioinformatics and machine learning.

Objective: To utilize the Traditional Chinese Medicine constitution (TCMC) as a complementary and alternative approach for early disease detection and treatment, with a focus on Yin and Yang deficiency constitutions, which serve as key references for disease prevention and management.

Methods: The dataset containing the data of Yin and Yang deficiency constitution was identified through the Gene Expression Omnibus database. This database was used for differential expression genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA), and the characteristic genes were then obtained in the dataset using a machine learning method. The hub genes of Yin and Yang deficiency constitution were obtained after analysis using the above three methods, and the hub genes were enriched and analyzed. Subsequently, the hub genes of Yin and Yang deficiency constitution were validated using external datasets. Receiver operating characteristic (ROC) analysis was used on each hub genes of the two groups to further understand their diagnostic performance. The miRNA-lncRNA-gene network was used to further analyze the hub genes. Immunoinfiltration and gene set enrichment analysis were performed on the shared hub genes.

Results: The GSE87474 dataset was used for DEGs analysis and WGCNA. Using machine learning analyses, we identified 15 and 14 hub genes for Yin and Yang deficiency constitutions, respectively. The results of enrichment analyses showed that Yin deficiency constitution was associated with interleukin-17 signaling pathway, whereas Yang deficiency constitution was associated with glycosaminoglycan biosynthesis-keratan sulfate. The validation dataset GSE56116 showed statistically significant data for s-adenosylmethionine sensor upstream of MTORC1 (SAMTOR, also named C7orf60), cofilin 2 (CFL2), cytohesin 1 interacting protein (CYTIP), G protein-coupled receptor 183 (GPR183), hippocampus abundant transcript 1 (HIAT1), kelch like family member 15 (KLHL15), mitogen-activated protein kinase 6 (MAPK6), and prostaglandin-endoperoxide synthase 2 (PTGS2) in Yin deficiency and fucosy-ltransferase 8 (FUT8), TATA-box binding protein associated factor, RNA polymerase I subunit D (TAF1D), zinc finger protein 24 (ZNF24), MAPK6, and leptin receptor overlapping transcript like 1 (LEPROTL1) in Yang deficiency. The ROC results indicated that these genes have diagnostic value. MAPK6 is a shared hub gene for Yin and Yang deficiencies.

Conclusions: This study identified C7orf60, CFL2, CYTIP, GPR183, HIAT1, KLHL15, MAPK6, and PTGS2 in Yin deficiency and FUT8, TAF1D, ZNF24, MAPK6, and LEPROTL1 in Yang deficiency as potential biomarkers, providing insights into their pathogenesis. This theory not only guides the diagnostic approach in TCM but also extends its influence to various scientific research fields.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信