Seungheun Baek, Soyon Park, Yan Ting Chok, Junhyun Lee, Jueon Park, Mogan Gim, Jaewoo Kang
{"title":"CRADLE-VAE: Enhancing Single-Cell Gene Perturbation Modeling with Counterfactual Reasoning-based Artifact Disentanglement","authors":"Seungheun Baek, Soyon Park, Yan Ting Chok, Junhyun Lee, Jueon Park, Mogan Gim, Jaewoo Kang","doi":"arxiv-2409.05484","DOIUrl":null,"url":null,"abstract":"Predicting cellular responses to various perturbations is a critical focus in\ndrug discovery and personalized therapeutics, with deep learning models playing\na significant role in this endeavor. Single-cell datasets contain technical\nartifacts that may hinder the predictability of such models, which poses\nquality control issues highly regarded in this area. To address this, we\npropose CRADLE-VAE, a causal generative framework tailored for single-cell gene\nperturbation modeling, enhanced with counterfactual reasoning-based artifact\ndisentanglement. Throughout training, CRADLE-VAE models the underlying latent\ndistribution of technical artifacts and perturbation effects present in\nsingle-cell datasets. It employs counterfactual reasoning to effectively\ndisentangle such artifacts by modulating the latent basal spaces and learns\nrobust features for generating cellular response data with improved quality.\nExperimental results demonstrate that this approach improves not only treatment\neffect estimation performance but also generative quality as well. The\nCRADLE-VAE codebase is publicly available at\nhttps://github.com/dmis-lab/CRADLE-VAE.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05484","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Predicting cellular responses to various perturbations is a critical focus in
drug discovery and personalized therapeutics, with deep learning models playing
a significant role in this endeavor. Single-cell datasets contain technical
artifacts that may hinder the predictability of such models, which poses
quality control issues highly regarded in this area. To address this, we
propose CRADLE-VAE, a causal generative framework tailored for single-cell gene
perturbation modeling, enhanced with counterfactual reasoning-based artifact
disentanglement. Throughout training, CRADLE-VAE models the underlying latent
distribution of technical artifacts and perturbation effects present in
single-cell datasets. It employs counterfactual reasoning to effectively
disentangle such artifacts by modulating the latent basal spaces and learns
robust features for generating cellular response data with improved quality.
Experimental results demonstrate that this approach improves not only treatment
effect estimation performance but also generative quality as well. The
CRADLE-VAE codebase is publicly available at
https://github.com/dmis-lab/CRADLE-VAE.