Jaesik Kim, Matei Ionita, Matthew Lee, Michelle L McKeague, Ajinkya Pattekar, Mark M Painter, Joost Wagenaar, Van Truong, Dylan T Norton, Divij Mathew, Yonghyun Nam, Sokratis A Apostolidis, Cynthia Clendenin, Patryk Orzechowski, Sang-Hyuk Jung, Jakob Woerner, Caroline A G Ittner, Alexandra P Turner, Mika Esperanza, Thomas G Dunn, Nilam S Mangalmurti, John P Reilly, Nuala J Meyer, Carolyn S Calfee, Kathleen D Liu, Michael A Matthy, Lamorna Brown Swigart, Ellen L Burnham, Jeffrey McKeehan, Sheetal Gandotra, Derek W Russel, Kevin W Gibbs, Karl W Thomas, Harsh Barot, Allison R Greenplate, E John Wherry, Dokyoon Kim
{"title":"Cytometry masked autoencoder: An accurate and interpretable automated immunophenotyper.","authors":"Jaesik Kim, Matei Ionita, Matthew Lee, Michelle L McKeague, Ajinkya Pattekar, Mark M Painter, Joost Wagenaar, Van Truong, Dylan T Norton, Divij Mathew, Yonghyun Nam, Sokratis A Apostolidis, Cynthia Clendenin, Patryk Orzechowski, Sang-Hyuk Jung, Jakob Woerner, Caroline A G Ittner, Alexandra P Turner, Mika Esperanza, Thomas G Dunn, Nilam S Mangalmurti, John P Reilly, Nuala J Meyer, Carolyn S Calfee, Kathleen D Liu, Michael A Matthy, Lamorna Brown Swigart, Ellen L Burnham, Jeffrey McKeehan, Sheetal Gandotra, Derek W Russel, Kevin W Gibbs, Karl W Thomas, Harsh Barot, Allison R Greenplate, E John Wherry, Dokyoon Kim","doi":"10.1016/j.xcrm.2024.101808","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell cytometry data are crucial for understanding the role of the immune system in diseases and responses to treatment. However, traditional methods for annotating cytometry data face challenges in scalability, robustness, and accuracy. We propose a cytometry masked autoencoder (cyMAE), which automates immunophenotyping tasks including cell type annotation. The model upholds user-defined cell type definitions, facilitating interpretability and cross-study comparisons. The training of cyMAE has a self-supervised phase, which leverages large amounts of unlabeled data, followed by fine-tuning on specialized tasks using smaller amounts of annotated data. The cost of training a new model is amortized over repeated inferences on new datasets using the same panel. Through validation across multiple studies using the same panel, we demonstrate that cyMAE delivers accurate and interpretable cellular immunophenotyping and improves the prediction of subject-level metadata. This proof of concept marks a significant step forward for large-scale immunology studies.</p>","PeriodicalId":9822,"journal":{"name":"Cell Reports Medicine","volume":null,"pages":null},"PeriodicalIF":11.7000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Reports Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.xcrm.2024.101808","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell cytometry data are crucial for understanding the role of the immune system in diseases and responses to treatment. However, traditional methods for annotating cytometry data face challenges in scalability, robustness, and accuracy. We propose a cytometry masked autoencoder (cyMAE), which automates immunophenotyping tasks including cell type annotation. The model upholds user-defined cell type definitions, facilitating interpretability and cross-study comparisons. The training of cyMAE has a self-supervised phase, which leverages large amounts of unlabeled data, followed by fine-tuning on specialized tasks using smaller amounts of annotated data. The cost of training a new model is amortized over repeated inferences on new datasets using the same panel. Through validation across multiple studies using the same panel, we demonstrate that cyMAE delivers accurate and interpretable cellular immunophenotyping and improves the prediction of subject-level metadata. This proof of concept marks a significant step forward for large-scale immunology studies.
Cell Reports MedicineBiochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (all)
CiteScore
15.00
自引率
1.40%
发文量
231
审稿时长
40 days
期刊介绍:
Cell Reports Medicine is an esteemed open-access journal by Cell Press that publishes groundbreaking research in translational and clinical biomedical sciences, influencing human health and medicine.
Our journal ensures wide visibility and accessibility, reaching scientists and clinicians across various medical disciplines. We publish original research that spans from intriguing human biology concepts to all aspects of clinical work. We encourage submissions that introduce innovative ideas, forging new paths in clinical research and practice. We also welcome studies that provide vital information, enhancing our understanding of current standards of care in diagnosis, treatment, and prognosis. This encompasses translational studies, clinical trials (including long-term follow-ups), genomics, biomarker discovery, and technological advancements that contribute to diagnostics, treatment, and healthcare. Additionally, studies based on vertebrate model organisms are within the scope of the journal, as long as they directly relate to human health and disease.