利用分层 Dirichlet 过程对单细胞 RNA 测序数据集进行共享差异聚类

IF 2 Q2 ECONOMICS
Jinlu Liu, Sara Wade, Natalia Bochkina
{"title":"利用分层 Dirichlet 过程对单细胞 RNA 测序数据集进行共享差异聚类","authors":"Jinlu Liu, Sara Wade, Natalia Bochkina","doi":"10.1016/j.ecosta.2024.02.001","DOIUrl":null,"url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) is a powerful technology that allows researchers to understand gene expression patterns at the single-cell level and uncover the heterogeneous nature of cells. Clustering is an important tool in scRNA-seq analysis to discover groups of cells with similar gene expression patterns and identify potential cell types. Integration of multiple scRNA-seq datasets is a pressing challenge, and in this direction, a novel model is developed to extend clustering methods to appropriately combine inference across multiple datasets. The model simultaneously addresses normalization to deal with the inherent noise and uncertainty in scRNA-seq, infers cell types, and integrates multiple datasets for shared clustering in principled manner through a hierarchical Bayesian framework. A Gibbs sampler is developed that copes with the high-dimensionality of scRNA-seq through consensus clustering. The methodological developments are driven by experimental data from embryonic cells, with the aim of understanding the role of PAX6 in prenatal development, and more specifically how cell-subtypes and their proportions change when knocking out this factor.","PeriodicalId":54125,"journal":{"name":"Econometrics and Statistics","volume":"56 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process\",\"authors\":\"Jinlu Liu, Sara Wade, Natalia Bochkina\",\"doi\":\"10.1016/j.ecosta.2024.02.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Single-cell RNA sequencing (scRNA-seq) is a powerful technology that allows researchers to understand gene expression patterns at the single-cell level and uncover the heterogeneous nature of cells. Clustering is an important tool in scRNA-seq analysis to discover groups of cells with similar gene expression patterns and identify potential cell types. Integration of multiple scRNA-seq datasets is a pressing challenge, and in this direction, a novel model is developed to extend clustering methods to appropriately combine inference across multiple datasets. The model simultaneously addresses normalization to deal with the inherent noise and uncertainty in scRNA-seq, infers cell types, and integrates multiple datasets for shared clustering in principled manner through a hierarchical Bayesian framework. A Gibbs sampler is developed that copes with the high-dimensionality of scRNA-seq through consensus clustering. The methodological developments are driven by experimental data from embryonic cells, with the aim of understanding the role of PAX6 in prenatal development, and more specifically how cell-subtypes and their proportions change when knocking out this factor.\",\"PeriodicalId\":54125,\"journal\":{\"name\":\"Econometrics and Statistics\",\"volume\":\"56 1\",\"pages\":\"\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-02-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Econometrics and Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ecosta.2024.02.001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Econometrics and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.ecosta.2024.02.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}
引用次数: 0

摘要

单细胞 RNA 测序(scRNA-seq)是一项功能强大的技术,可让研究人员了解单细胞水平的基因表达模式,并揭示细胞的异质性。聚类是 scRNA-seq 分析中的一个重要工具,可用于发现具有相似基因表达模式的细胞群,并识别潜在的细胞类型。整合多个 scRNA-seq 数据集是一个紧迫的挑战,为此,我们开发了一个新模型来扩展聚类方法,以适当地结合多个数据集的推论。该模型同时解决了归一化问题,以处理 scRNA-seq 中固有的噪声和不确定性,推断细胞类型,并通过分层贝叶斯框架,以原则性的方式整合多个数据集,实现共享聚类。开发的吉布斯采样器可通过共识聚类来应对 scRNA-seq 的高维性。方法论的发展是由胚胎细胞的实验数据驱动的,目的是了解 PAX6 在产前发育中的作用,更具体地说,当敲除该因子时,细胞亚型及其比例是如何变化的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process
Single-cell RNA sequencing (scRNA-seq) is a powerful technology that allows researchers to understand gene expression patterns at the single-cell level and uncover the heterogeneous nature of cells. Clustering is an important tool in scRNA-seq analysis to discover groups of cells with similar gene expression patterns and identify potential cell types. Integration of multiple scRNA-seq datasets is a pressing challenge, and in this direction, a novel model is developed to extend clustering methods to appropriately combine inference across multiple datasets. The model simultaneously addresses normalization to deal with the inherent noise and uncertainty in scRNA-seq, infers cell types, and integrates multiple datasets for shared clustering in principled manner through a hierarchical Bayesian framework. A Gibbs sampler is developed that copes with the high-dimensionality of scRNA-seq through consensus clustering. The methodological developments are driven by experimental data from embryonic cells, with the aim of understanding the role of PAX6 in prenatal development, and more specifically how cell-subtypes and their proportions change when knocking out this factor.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.10
自引率
10.50%
发文量
84
期刊介绍: Econometrics and Statistics is the official journal of the networks Computational and Financial Econometrics and Computational and Methodological Statistics. It publishes research papers in all aspects of econometrics and statistics and comprises of the two sections Part A: Econometrics and Part B: Statistics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信