scMODD: A model-driven algorithm for doublet identification in single-cell RNA-sequencing data

Xinye Zhao, Alexander Du, Peng-Chao Qiu
{"title":"scMODD: A model-driven algorithm for doublet identification in single-cell RNA-sequencing data","authors":"Xinye Zhao, Alexander Du, Peng-Chao Qiu","doi":"10.3389/fsysb.2022.1082309","DOIUrl":null,"url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) data often contain doublets, where a doublet manifests as 1 cell barcode that corresponds to combined gene expression of two or more cells. Existence of doublets can lead to spurious biological interpretations. Here, we present single-cell MOdel-driven Doublet Detection (scMODD), a model-driven algorithm to detect doublets in scRNA-seq data. ScMODD achieved similar performance compared to existing doublet detection algorithms which are primarily data-driven, showing the promise of model-driven approach for doublet detection. When implementing scMODD in simulated and real scRNA-seq data, we tested both the negative binomial (NB) model and the zero-inflated negative binomial (ZINB) model to serve as the underlying statistical model for scRNA-seq count data, and observed that incorporating zero inflation did not improve detection performance, suggesting that consideration of zero inflation is not necessary in the context of doublet detection in scRNA-seq.","PeriodicalId":73109,"journal":{"name":"Frontiers in systems biology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in systems biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fsysb.2022.1082309","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Single-cell RNA sequencing (scRNA-seq) data often contain doublets, where a doublet manifests as 1 cell barcode that corresponds to combined gene expression of two or more cells. Existence of doublets can lead to spurious biological interpretations. Here, we present single-cell MOdel-driven Doublet Detection (scMODD), a model-driven algorithm to detect doublets in scRNA-seq data. ScMODD achieved similar performance compared to existing doublet detection algorithms which are primarily data-driven, showing the promise of model-driven approach for doublet detection. When implementing scMODD in simulated and real scRNA-seq data, we tested both the negative binomial (NB) model and the zero-inflated negative binomial (ZINB) model to serve as the underlying statistical model for scRNA-seq count data, and observed that incorporating zero inflation did not improve detection performance, suggesting that consideration of zero inflation is not necessary in the context of doublet detection in scRNA-seq.
scMODD:一种模型驱动的单细胞rna测序数据双链识别算法
单细胞RNA测序(scRNA-seq)数据通常包含双序列,其中双序列表现为1个细胞条形码,对应于两个或多个细胞的组合基因表达。双重基因的存在可能导致虚假的生物学解释。在这里,我们提出了单细胞MOdel驱动的双联检测(scMODD),这是一种检测scRNA-seq数据中双联的模型驱动算法。与主要由数据驱动的现有双峰检测算法相比,ScMODD实现了类似的性能,表明了模型驱动方法用于双峰检测的前景。当在模拟和真实的scRNA-seq数据中实现scMODD时,我们测试了负二项(NB)模型和零膨胀负二项模型作为scRNA-seq计数数据的基础统计模型,并观察到结合零膨胀并不能提高检测性能,这表明在scRNA-seq中的双位点检测的情况下没有必要考虑零膨胀。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信