Cleanet: Robust Doublet Detection in Cytometry Data Based on Protein Expression Patterns.

IF 2.1 4区 生物学 Q3 BIOCHEMICAL RESEARCH METHODS
Matei Ionita, Michelle L McKeague, Mark M Painter, Divij Mathew, Ajinkya Pattekar, Ayman Rezk, Shwetank, Damian Maseda, E John Wherry, Allison R Greenplate
{"title":"Cleanet: Robust Doublet Detection in Cytometry Data Based on Protein Expression Patterns.","authors":"Matei Ionita, Michelle L McKeague, Mark M Painter, Divij Mathew, Ajinkya Pattekar, Ayman Rezk, Shwetank, Damian Maseda, E John Wherry, Allison R Greenplate","doi":"10.1002/cyto.a.24961","DOIUrl":null,"url":null,"abstract":"<p><p>Flow and mass cytometry experiments are essential for profiling immune cells at single-cell resolution. Better understanding of human immunology increasingly involves analyzing studies at the scale of hundreds or thousands of samples, with data analysis a significant bottleneck. This trend increases the demand for automated analysis methods. In particular, a common preprocessing step in cytometry data analysis is distinguishing single cells from doublets (or multiplets), events in which two (or more) cells pass simultaneously through the detector. Typically, doublets are identified on two-dimensional density plots, using their high measured values for DNA intercalators (mass cytometry) or scattering channels (flow cytometry). Despite its popularity, this bivariate gating method is sometimes imprecise: for example, we show that bivariate gating of mass cytometry data can mistake single eosinophils for doublets, due to their high DNA content. Taking inspiration from methods already used in single-cell transcriptomics, but not in the cytometry community, we propose an alternative approach. Our method, called Cleanet, first simulates doublet events, then identifies true events with protein expression similar to the simulated doublets. This simple method is completely automated and detects both homotypic and heterotypic doublets. We validate it in datasets acquired with mass and flow cytometry; moreover, we verify with imaging flow cytometry data from ImageStream and Discover A8 instruments that most events predicted to be doublets truly consist of multiple cells. Cleanet can also classify doublets based on their component cell types, which potentially enables the study of cell-cell interactions, mining extra information out of doublet events that would otherwise be discarded. As a proof of concept, we demonstrate that Cleanet can detect a treatment-specific increase in interactions between two cell lines. By automating doublet detection and classification, we aim to streamline the data analysis in large cytometry studies and provide a more accurate picture of both immune cell populations and cell-cell interactions.</p>","PeriodicalId":11068,"journal":{"name":"Cytometry Part A","volume":" ","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cytometry Part A","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/cyto.a.24961","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Flow and mass cytometry experiments are essential for profiling immune cells at single-cell resolution. Better understanding of human immunology increasingly involves analyzing studies at the scale of hundreds or thousands of samples, with data analysis a significant bottleneck. This trend increases the demand for automated analysis methods. In particular, a common preprocessing step in cytometry data analysis is distinguishing single cells from doublets (or multiplets), events in which two (or more) cells pass simultaneously through the detector. Typically, doublets are identified on two-dimensional density plots, using their high measured values for DNA intercalators (mass cytometry) or scattering channels (flow cytometry). Despite its popularity, this bivariate gating method is sometimes imprecise: for example, we show that bivariate gating of mass cytometry data can mistake single eosinophils for doublets, due to their high DNA content. Taking inspiration from methods already used in single-cell transcriptomics, but not in the cytometry community, we propose an alternative approach. Our method, called Cleanet, first simulates doublet events, then identifies true events with protein expression similar to the simulated doublets. This simple method is completely automated and detects both homotypic and heterotypic doublets. We validate it in datasets acquired with mass and flow cytometry; moreover, we verify with imaging flow cytometry data from ImageStream and Discover A8 instruments that most events predicted to be doublets truly consist of multiple cells. Cleanet can also classify doublets based on their component cell types, which potentially enables the study of cell-cell interactions, mining extra information out of doublet events that would otherwise be discarded. As a proof of concept, we demonstrate that Cleanet can detect a treatment-specific increase in interactions between two cell lines. By automating doublet detection and classification, we aim to streamline the data analysis in large cytometry studies and provide a more accurate picture of both immune cell populations and cell-cell interactions.

Cleanet:基于蛋白质表达模式的细胞计数数据的鲁棒双偶检测。
流式和质量细胞术实验是必不可少的分析免疫细胞在单细胞分辨率。为了更好地了解人类免疫学,越来越多地涉及到分析数百或数千个样本的研究,而数据分析是一个重大的瓶颈。这种趋势增加了对自动化分析方法的需求。特别是,在细胞术数据分析中,一个常见的预处理步骤是区分单细胞和双细胞(或多胞胎),两个(或更多)细胞同时通过检测器的事件。通常,利用DNA插入器(质量细胞术)或散射通道(流式细胞术)的高测量值,在二维密度图上识别双重态。尽管它很受欢迎,但这种二元门控方法有时是不精确的:例如,我们表明,由于大量细胞计数数据的二元门控可能会将单个嗜酸性粒细胞误认为双分子,因为它们的DNA含量很高。从已经在单细胞转录组学中使用的方法中获得灵感,但在细胞术社区中没有,我们提出了一种替代方法。我们的方法,称为Cleanet,首先模拟双偶事件,然后识别与模拟双偶相似的蛋白质表达的真实事件。这种简单的方法是完全自动化的,可以检测同型和异型双态。我们在质量和流式细胞术获得的数据集中验证了它;此外,我们用ImageStream和Discover A8仪器的成像流式细胞术数据验证了大多数预测为双重事件的事件确实由多个细胞组成。Cleanet还可以根据它们组成的细胞类型对双联体进行分类,这有可能使研究细胞间相互作用成为可能,从双联体事件中挖掘出额外的信息,否则这些信息将被丢弃。作为概念的证明,我们证明Cleanet可以检测到两个细胞系之间相互作用的治疗特异性增加。通过自动化双重检测和分类,我们的目标是简化大型细胞术研究中的数据分析,并提供更准确的免疫细胞群和细胞间相互作用的图像。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cytometry Part A
Cytometry Part A 生物-生化研究方法
CiteScore
8.10
自引率
13.50%
发文量
183
审稿时长
4-8 weeks
期刊介绍: Cytometry Part A, the journal of quantitative single-cell analysis, features original research reports and reviews of innovative scientific studies employing quantitative single-cell measurement, separation, manipulation, and modeling techniques, as well as original articles on mechanisms of molecular and cellular functions obtained by cytometry techniques. The journal welcomes submissions from multiple research fields that fully embrace the study of the cytome: Biomedical Instrumentation Engineering Biophotonics Bioinformatics Cell Biology Computational Biology Data Science Immunology Parasitology Microbiology Neuroscience Cancer Stem Cells Tissue Regeneration.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信