{"title":"通过rna测序数据的基因组相互作用编码图像表示揭示组织异质性。","authors":"Junyan Liu,Zixia Zhou,Yizheng Chen,Md Tauhidul Islam,Lei Xing","doi":"10.1016/j.ajhg.2025.08.021","DOIUrl":null,"url":null,"abstract":"Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"17 1","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unveiling tissue heterogeneity through genomic interaction-encoded image representation of RNA-sequencing data.\",\"authors\":\"Junyan Liu,Zixia Zhou,Yizheng Chen,Md Tauhidul Islam,Lei Xing\",\"doi\":\"10.1016/j.ajhg.2025.08.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.\",\"PeriodicalId\":7659,\"journal\":{\"name\":\"American journal of human genetics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American journal of human genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ajhg.2025.08.021\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ajhg.2025.08.021","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
Unveiling tissue heterogeneity through genomic interaction-encoded image representation of RNA-sequencing data.
Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.
期刊介绍:
The American Journal of Human Genetics (AJHG) is a monthly journal published by Cell Press, chosen by The American Society of Human Genetics (ASHG) as its premier publication starting from January 2008. AJHG represents Cell Press's first society-owned journal, and both ASHG and Cell Press anticipate significant synergies between AJHG content and that of other Cell Press titles.