Joshua Cook, Eric Ron, Dmitri Svetlov, Luis Aguiulera, Brian Munsky
{"title":"Sequential design of single-cell experiments to identify discrete stochastic models for gene expression.","authors":"Joshua Cook, Eric Ron, Dmitri Svetlov, Luis Aguiulera, Brian Munsky","doi":"10.1101/2024.09.12.612709","DOIUrl":null,"url":null,"abstract":"Control of gene regulation requires quantitatively accurate predictions of heterogeneous cellular responses. When inferred from single-cell experiments, discrete stochastic models can enable such predictions, but such experiments are highly adjustable, allowing for almost infinitely many potential designs (e.g., at different induction levels, for different measurement times, or considering different observed biological species). Not all experiments are equally informative, experiments are time-consuming or expensive to perform, and research begins with limited prior information with which to construct models. To address these concerns, we developed a sequential experiment design strategy that starts with simple preliminary experiments and then integrates chemical master equations to compute the likelihood of single-cell data, a Bayesian inference procedure to sample posterior parameter distributions, and a finite state projection based Fisher information matrix to estimate the expected information for different designs for subsequent experiments. Using simulated then real single-cell data, we determined practical working principles to reduce the overall number of experiments needed to achieve predictive, quantitative understanding of single-cell responses.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Systems Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.12.612709","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Control of gene regulation requires quantitatively accurate predictions of heterogeneous cellular responses. When inferred from single-cell experiments, discrete stochastic models can enable such predictions, but such experiments are highly adjustable, allowing for almost infinitely many potential designs (e.g., at different induction levels, for different measurement times, or considering different observed biological species). Not all experiments are equally informative, experiments are time-consuming or expensive to perform, and research begins with limited prior information with which to construct models. To address these concerns, we developed a sequential experiment design strategy that starts with simple preliminary experiments and then integrates chemical master equations to compute the likelihood of single-cell data, a Bayesian inference procedure to sample posterior parameter distributions, and a finite state projection based Fisher information matrix to estimate the expected information for different designs for subsequent experiments. Using simulated then real single-cell data, we determined practical working principles to reduce the overall number of experiments needed to achieve predictive, quantitative understanding of single-cell responses.