Chunyang Fan, Wei Xu, Xuefeng Li, Jiale Wang, Wei He, Meng Shen, Di Hua, Yao Zhang, Ye Gu, Xiexing Wu, Haiqing Mao
{"title":"Integrated bulk and single-cell RNA sequencing to identify potential biomarkers in intervertebral disc degeneration.","authors":"Chunyang Fan, Wei Xu, Xuefeng Li, Jiale Wang, Wei He, Meng Shen, Di Hua, Yao Zhang, Ye Gu, Xiexing Wu, Haiqing Mao","doi":"10.1186/s40001-025-02346-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Nucleus pulposus (NP) deterioration plays a significant role in the development of intervertebral disc degeneration (IVDD) and low back pain (LBP). This paper aims to identify potential genes within degenerated NP tissue and elucidate the pathogenesis of IVDD through bioinformatics analysis.</p><p><strong>Methods: </strong>We conducted a transcriptomic analysis of patient's degenerative NP tissue employing advanced bioinformatics techniques and machine learning algorithms. Utilizing hdWGCNA, we successfully acquired WGCNA single-cell sequencing data and pinpointed crucial genes implicated in IVDD. Subsequently, we employed the Monocle3 package to perform pseudotime sequence analysis, enabling the identification of genes associated with the differentiation and developmental processes of NP tissue. Following this, normalized and logarithmically transformed the bulk sequencing data. Subsequently, we conducted preliminary screening using single-factor logistic regression on the genes derived from single-cell sequencing. Next, we applied two machine learning techniques, namely, SVM-RFE and random forest, to discern pivotal pathogenic genes. Finally, we used validation sets to verify trends and qualitativeness and performed in vitro and in vivo validation analyses of normal and degenerative NP tissues.</p><p><strong>Results: </strong>909 genes associated with IVDD were identified through hdWGCNA, while pseudotime sequence analysis uncovered 1964 genes related to differentiation and developmental processes. The two had 208 genes in common. Subsequently, we conducted an initial screening of single-cell genes by integrating the bulk database with single logistic regression. Next, we utilized machine learning techniques to identify the IVDD genes CDH, DPH5, and SELENOF. PCR analysis confirmed that the expression of CDH and DPH5 in degraded nucleus pulposus cells (NPCs) was decreased by 31% and 28% in vivo, and 36% and 29% in vitro, respectively, while SELENOF showed the opposite trend. Furthermore, IVDD was validated through imaging and histological staining.</p><p><strong>Conclusion: </strong>As pathogenic genes in IVDD, our findings indicate that CTH, DPH5, and SELENOF are important players and might be promising therapeutic targets for IVDD treatment.</p>","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"102"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11827443/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-02346-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Nucleus pulposus (NP) deterioration plays a significant role in the development of intervertebral disc degeneration (IVDD) and low back pain (LBP). This paper aims to identify potential genes within degenerated NP tissue and elucidate the pathogenesis of IVDD through bioinformatics analysis.
Methods: We conducted a transcriptomic analysis of patient's degenerative NP tissue employing advanced bioinformatics techniques and machine learning algorithms. Utilizing hdWGCNA, we successfully acquired WGCNA single-cell sequencing data and pinpointed crucial genes implicated in IVDD. Subsequently, we employed the Monocle3 package to perform pseudotime sequence analysis, enabling the identification of genes associated with the differentiation and developmental processes of NP tissue. Following this, normalized and logarithmically transformed the bulk sequencing data. Subsequently, we conducted preliminary screening using single-factor logistic regression on the genes derived from single-cell sequencing. Next, we applied two machine learning techniques, namely, SVM-RFE and random forest, to discern pivotal pathogenic genes. Finally, we used validation sets to verify trends and qualitativeness and performed in vitro and in vivo validation analyses of normal and degenerative NP tissues.
Results: 909 genes associated with IVDD were identified through hdWGCNA, while pseudotime sequence analysis uncovered 1964 genes related to differentiation and developmental processes. The two had 208 genes in common. Subsequently, we conducted an initial screening of single-cell genes by integrating the bulk database with single logistic regression. Next, we utilized machine learning techniques to identify the IVDD genes CDH, DPH5, and SELENOF. PCR analysis confirmed that the expression of CDH and DPH5 in degraded nucleus pulposus cells (NPCs) was decreased by 31% and 28% in vivo, and 36% and 29% in vitro, respectively, while SELENOF showed the opposite trend. Furthermore, IVDD was validated through imaging and histological staining.
Conclusion: As pathogenic genes in IVDD, our findings indicate that CTH, DPH5, and SELENOF are important players and might be promising therapeutic targets for IVDD treatment.
期刊介绍:
European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.