利用 scPAFA 在大规模单细胞转录组上发现与疾病相关的多细胞通路模块。

IF 5.2 1区 生物学 Q1 BIOLOGY
Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin
{"title":"利用 scPAFA 在大规模单细胞转录组上发现与疾病相关的多细胞通路模块。","authors":"Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin","doi":"10.1038/s42003-024-07238-7","DOIUrl":null,"url":null,"abstract":"<p><p>Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"7 1","pages":"1523"},"PeriodicalIF":5.2000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569158/pdf/","citationCount":"0","resultStr":"{\"title\":\"Uncovering disease-related multicellular pathway modules on large-scale single-cell transcriptomes with scPAFA.\",\"authors\":\"Zhuoli Huang, Yuhui Zheng, Weikai Wang, Wenwen Zhou, Yanbo Zhang, Chen Wei, Xiuqing Zhang, Xin Jin, Jianhua Yin\",\"doi\":\"10.1038/s42003-024-07238-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.</p>\",\"PeriodicalId\":10552,\"journal\":{\"name\":\"Communications Biology\",\"volume\":\"7 1\",\"pages\":\"1523\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2024-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569158/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1038/s42003-024-07238-7\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-024-07238-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

通路分析是单细胞 RNA 测序(scRNA-seq)数据疾病研究中的一个关键分析阶段,它能根据先前的知识提供生物学解释。然而,目前可用来生成细胞级通路活性评分(PAS)的工具在大规模 scRNA-seq 数据集中表现出计算效率低下的问题。此外,疾病相关通路通常是通过特定细胞类型内的跨条件比较来确定的,忽略了涉及多种细胞类型的潜在模式。在这里,我们介绍了单细胞通路活性因子分析(scPAFA),这是一个专为大规模单细胞数据集设计的 Python 库,可以快速计算 PAS 并发现可从生物学角度解释的疾病相关多细胞通路模块,这些模块是多种细胞类型中疾病相关 PAS 改变的低维表示。在结直肠癌(CRC)数据集和超过120万个细胞的大规模狼疮图谱上的应用表明,scPAFA能将PAS计算的运行时间缩短40倍以上,并进一步确定了可靠且可解释的多细胞通路模块,这些模块分别捕捉了CRC和狼疮患者转录异常的异质性。总之,scPAFA 是对现有疾病研究工具的宝贵补充,有望揭示复杂的疾病机制,支持通路水平的生物标记物发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Uncovering disease-related multicellular pathway modules on large-scale single-cell transcriptomes with scPAFA.

Pathway analysis is a crucial analytical phase in disease research on single-cell RNA sequencing (scRNA-seq) data, offering biological interpretations based on prior knowledge. However, currently available tools for generating cell-level pathway activity scores (PAS) exhibit computational inefficacy in large-scale scRNA-seq datasets. Additionally, disease-related pathways are often identified through cross-condition comparisons within specific cell types, overlooking potential patterns that involve multiple cell types. Here, we present single-cell pathway activity factor analysis (scPAFA), a Python library designed for large-scale single-cell datasets allowing rapid PAS computation and uncovering biologically interpretable disease-related multicellular pathway modules, which are low-dimensional representations of disease-related PAS alterations in multiple cell types. Application on colorectal cancer (CRC) datasets and large-scale lupus atlas over 1.2 million cells demonstrated that scPAFA can achieve over 40-fold reductions in the runtime of PAS computation and further identified reliable and interpretable multicellular pathway modules that capture the heterogeneity of CRC and transcriptional abnormalities in lupus patients, respectively. Overall, scPAFA presents a valuable addition to existing research tools in disease research, with the potential to reveal complex disease mechanisms and support biomarker discovery at the pathway level.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Communications Biology
Communications Biology Medicine-Medicine (miscellaneous)
CiteScore
8.60
自引率
1.70%
发文量
1233
审稿时长
13 weeks
期刊介绍: Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信