DeSide: A unified deep learning approach for cellular deconvolution of tumor microenvironment

IF 9.4 1区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Xin Xiong, Yerong Liu, Dandan Pu, Zhu Yang, Zedong Bi, Liang Tian, Xuefei Li
{"title":"DeSide: A unified deep learning approach for cellular deconvolution of tumor microenvironment","authors":"Xin Xiong, Yerong Liu, Dandan Pu, Zhu Yang, Zedong Bi, Liang Tian, Xuefei Li","doi":"10.1073/pnas.2407096121","DOIUrl":null,"url":null,"abstract":"Cellular deconvolution via bulk RNA sequencing (RNA-seq) presents a cost-effective and efficient alternative to experimental methods such as flow cytometry and single-cell RNA-seq (scRNA-seq) for analyzing the complex cellular composition of tumor microenvironments. Despite challenges due to heterogeneity within and among tumors, our innovative deep learning–based approach, DeSide, shows exceptional accuracy in estimating the proportions of 16 distinct cell types and subtypes within solid tumors. DeSide integrates biological pathways and assesses noncancerous cell types first, effectively sidestepping the issue of highly variable gene expression profiles (GEPs) associated with cancer cells. By leveraging scRNA-seq data from six cancer types and 185 cancer cell lines across 22 cancer types as references, our method introduces distinctive sampling and filtering techniques to generate a high-quality training set that closely replicates real tumor GEPs, based on The Cancer Genome Atlas (TCGA) bulk RNA-seq data. With this model and high-quality training set, DeSide outperforms existing methods in estimating tumor purity and the proportions of noncancerous cells within solid tumors. Our model precisely predicts cellular compositions across 19 cancer types from TCGA and proves its effectiveness with multiple additional external datasets. Crucially, DeSide enables the identification and analysis of combinatorial cell type pairs, facilitating the stratification of cancer patients into prognostically significant groups. This approach not only provides deeper insights into the dynamics of tumor biology but also highlights potential therapeutic targets by underscoring the importance of specific cell type or subtype interactions.","PeriodicalId":20548,"journal":{"name":"Proceedings of the National Academy of Sciences of the United States of America","volume":"95 1","pages":""},"PeriodicalIF":9.4000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the National Academy of Sciences of the United States of America","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1073/pnas.2407096121","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Cellular deconvolution via bulk RNA sequencing (RNA-seq) presents a cost-effective and efficient alternative to experimental methods such as flow cytometry and single-cell RNA-seq (scRNA-seq) for analyzing the complex cellular composition of tumor microenvironments. Despite challenges due to heterogeneity within and among tumors, our innovative deep learning–based approach, DeSide, shows exceptional accuracy in estimating the proportions of 16 distinct cell types and subtypes within solid tumors. DeSide integrates biological pathways and assesses noncancerous cell types first, effectively sidestepping the issue of highly variable gene expression profiles (GEPs) associated with cancer cells. By leveraging scRNA-seq data from six cancer types and 185 cancer cell lines across 22 cancer types as references, our method introduces distinctive sampling and filtering techniques to generate a high-quality training set that closely replicates real tumor GEPs, based on The Cancer Genome Atlas (TCGA) bulk RNA-seq data. With this model and high-quality training set, DeSide outperforms existing methods in estimating tumor purity and the proportions of noncancerous cells within solid tumors. Our model precisely predicts cellular compositions across 19 cancer types from TCGA and proves its effectiveness with multiple additional external datasets. Crucially, DeSide enables the identification and analysis of combinatorial cell type pairs, facilitating the stratification of cancer patients into prognostically significant groups. This approach not only provides deeper insights into the dynamics of tumor biology but also highlights potential therapeutic targets by underscoring the importance of specific cell type or subtype interactions.
DeSide:用于肿瘤微环境细胞解卷积的统一深度学习方法
通过大容量 RNA 测序(RNA-seq)进行细胞解旋是流式细胞仪和单细胞 RNA-seq(scRNA-seq)等实验方法的一种经济高效的替代方法,可用于分析肿瘤微环境的复杂细胞组成。尽管肿瘤内部和肿瘤之间的异质性带来了挑战,但我们基于深度学习的创新方法 DeSide 在估计实体瘤内 16 种不同细胞类型和亚型的比例方面显示出了超高的准确性。DeSide 整合了生物通路,并首先评估非癌细胞类型,从而有效地避开了与癌细胞相关的基因表达谱(GEP)高度多变的问题。通过利用六种癌症类型的 scRNA-seq 数据和 22 种癌症类型的 185 个癌症细胞系作为参考,我们的方法引入了独特的采样和过滤技术,以癌症基因组图谱(TCGA)的批量 RNA-seq 数据为基础,生成了高质量的训练集,该训练集密切复制了真实的肿瘤 GEPs。有了这个模型和高质量的训练集,DeSide 在估计肿瘤纯度和实体瘤内非癌细胞比例方面优于现有方法。我们的模型能精确预测 TCGA 中 19 种癌症类型的细胞组成,并通过多个额外的外部数据集证明了其有效性。最重要的是,DeSide 能够识别和分析组合细胞类型对,便于将癌症患者分为具有重要预后意义的组别。这种方法不仅能深入了解肿瘤生物学动态,还能通过强调特定细胞类型或亚型相互作用的重要性来突出潜在的治疗目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
19.00
自引率
0.90%
发文量
3575
审稿时长
2.5 months
期刊介绍: The Proceedings of the National Academy of Sciences (PNAS), a peer-reviewed journal of the National Academy of Sciences (NAS), serves as an authoritative source for high-impact, original research across the biological, physical, and social sciences. With a global scope, the journal welcomes submissions from researchers worldwide, making it an inclusive platform for advancing scientific knowledge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信