Bishal Shrestha, Andrew Jordan Siciliano, Hao Zhu, Tong Liu, Zheng Wang
{"title":"scHiGex: predicting single-cell gene expression based on single-cell Hi-C data.","authors":"Bishal Shrestha, Andrew Jordan Siciliano, Hao Zhu, Tong Liu, Zheng Wang","doi":"10.1093/nargab/lqaf002","DOIUrl":null,"url":null,"abstract":"<p><p>A novel biochemistry experiment named HiRES has been developed to capture both the chromosomal conformations and gene expression levels of individual single cells simultaneously. Nevertheless, when compared to the extensive volume of single-cell Hi-C data generated from individual cells, the number of datasets produced from this experiment remains limited in the scientific community. Hence, there is a requirement for a computational tool that can forecast the levels of gene expression in individual cells using single-cell Hi-C data from the same cells. We trained a graph transformer called scHiGex that accurately and effectively predicts gene expression levels based on single-cell Hi-C data. We conducted a benchmark of scHiGex that demonstrated notable performance on the predictions with an average absolute error of 0.07. Furthermore, the predicted levels of gene expression led to precise categorizations (adjusted Rand index score 1) of cells into distinct cell types, demonstrating that our model effectively captured the heterogeneity between individual cell types. scHiGex is freely available at https://github.com/zwang-bioinformatics/scHiGex.</p>","PeriodicalId":33994,"journal":{"name":"NAR Genomics and Bioinformatics","volume":"7 1","pages":"lqaf002"},"PeriodicalIF":4.0000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11770341/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NAR Genomics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/nargab/lqaf002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
A novel biochemistry experiment named HiRES has been developed to capture both the chromosomal conformations and gene expression levels of individual single cells simultaneously. Nevertheless, when compared to the extensive volume of single-cell Hi-C data generated from individual cells, the number of datasets produced from this experiment remains limited in the scientific community. Hence, there is a requirement for a computational tool that can forecast the levels of gene expression in individual cells using single-cell Hi-C data from the same cells. We trained a graph transformer called scHiGex that accurately and effectively predicts gene expression levels based on single-cell Hi-C data. We conducted a benchmark of scHiGex that demonstrated notable performance on the predictions with an average absolute error of 0.07. Furthermore, the predicted levels of gene expression led to precise categorizations (adjusted Rand index score 1) of cells into distinct cell types, demonstrating that our model effectively captured the heterogeneity between individual cell types. scHiGex is freely available at https://github.com/zwang-bioinformatics/scHiGex.