通过rna测序数据的基因组相互作用编码图像表示揭示组织异质性。

IF 8.1 1区 生物学 Q1 GENETICS & HEREDITY
Junyan Liu,Zixia Zhou,Yizheng Chen,Md Tauhidul Islam,Lei Xing
{"title":"通过rna测序数据的基因组相互作用编码图像表示揭示组织异质性。","authors":"Junyan Liu,Zixia Zhou,Yizheng Chen,Md Tauhidul Islam,Lei Xing","doi":"10.1016/j.ajhg.2025.08.021","DOIUrl":null,"url":null,"abstract":"Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"17 1","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unveiling tissue heterogeneity through genomic interaction-encoded image representation of RNA-sequencing data.\",\"authors\":\"Junyan Liu,Zixia Zhou,Yizheng Chen,Md Tauhidul Islam,Lei Xing\",\"doi\":\"10.1016/j.ajhg.2025.08.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.\",\"PeriodicalId\":7659,\"journal\":{\"name\":\"American journal of human genetics\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"American journal of human genetics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ajhg.2025.08.021\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ajhg.2025.08.021","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

基因组测序对生物医学研究和临床实践都至关重要。虽然单细胞RNA测序(scRNA-seq)提供了对细胞水平生物过程的深入了解,但大量RNA测序因其可扩展性和成本效益而被广泛使用。为了探索生物异质性,研究人员努力从大量样品中推断单细胞样细胞成分,即将大量样品反卷积成多种细胞类型。然而,现有的反卷积方法面临两个主要限制:(1)依赖于预定义的基因特征矩阵,而没有考虑样本间的可变性;(2)生物系统中对噪声的敏感性。在这里,我们提出了一个细胞成分分析(CCA)框架,通过利用RNA-seq数据的基因组相互作用编码图像表示来大大改进模式发现。该框架结合了样本特异性基因表达变异性,并通过使用卷积变分自编码器和高斯混合模型派生签名模式。然后,基于这些样本特异性、可解释的基因特征模式,对大量RNA-seq数据进行图像域线性分解,用于CCA和其他下游任务,如癌症亚型分类和生物标志物发现。我们通过使用模拟和实验数据集证明,与现有技术相比,所提出的技术在平均Pearson相关性方面提高了14.1%以上的分解精度。该方法为组织异质性分析提供了有效的解决方案,并为一系列临床和生物学应用奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Unveiling tissue heterogeneity through genomic interaction-encoded image representation of RNA-sequencing data.
Genomic sequencing is essential for both biomedical research and clinical practice. While single-cell RNA sequencing (scRNA-seq) provides insights into biological processes at the cellular level, bulk RNA sequencing remains widely used for its scalability and cost-effectiveness. To explore biological heterogeneity, research efforts have been made toward inferring single-cell-like cellular compositions from bulk samples, i.e., deconvolving bulk samples into multiple cell types. However, existing deconvolution methods face two major limitations: (1) reliance on predefined gene signature matrices without accounting for inter-sample variability and (2) susceptibility to noise within biological systems. Here, we propose a cellular-component analysis (CCA) framework by leveraging a genomic-interaction-encoded image representation of RNA-seq data for substantially improved pattern discovery. The framework incorporates sample-specific gene-expression variability and derives signature patterns by utilizing a convolutional variational autoencoder and Gaussian mixture model. An image-domain linear decomposition of bulk RNA-seq data based on these sample-specific, interpretable gene-signature patterns is then performed for CCA and other downstream tasks, such as cancer subtype classification and biomarker discovery. We demonstrate that the proposed technique improves decomposition accuracy by over 14.1% in average Pearson correlation compared to existing techniques by using both simulation and experimental datasets. This approach offers an effective solution for tissue heterogeneity analysis and lays a foundation for a range of clinical and biological applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
14.70
自引率
4.10%
发文量
185
审稿时长
1 months
期刊介绍: The American Journal of Human Genetics (AJHG) is a monthly journal published by Cell Press, chosen by The American Society of Human Genetics (ASHG) as its premier publication starting from January 2008. AJHG represents Cell Press's first society-owned journal, and both ASHG and Cell Press anticipate significant synergies between AJHG content and that of other Cell Press titles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信