{"title":"Experimental design for system identification of Boolean Control Networks in biology","authors":"A. Busetto, J. Lygeros","doi":"10.1109/CDC.2014.7040282","DOIUrl":null,"url":null,"abstract":"This study is primarily motivated by biological applications and focuses on the identification of Boolean networks from scarce and noisy data. We consider two Bayesian experimental design scenarios: selection of the observations under a budget, and input design. The goal is to maximize the mutual information between models and data, that is the ultimate statistical upper bound on the identifiability of a system from empirical data. First, we introduce a method to select which components of the state variable to measure under a budget constraint, and at which time points. Our greedy approach takes advantage of the submodularity of the mutual information, and hence requires only a polynomial number of evaluations of the objective to achieve near-optimal designs. Second, we consider the computationally harder task of designing sequences of input interventions, and propose a likelihood-free approximation method. Exact and approximate design solutions are verified with predictive models of genetic regulatory interaction networks in embryonic development.","PeriodicalId":202708,"journal":{"name":"53rd IEEE Conference on Decision and Control","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"53rd IEEE Conference on Decision and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.2014.7040282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
This study is primarily motivated by biological applications and focuses on the identification of Boolean networks from scarce and noisy data. We consider two Bayesian experimental design scenarios: selection of the observations under a budget, and input design. The goal is to maximize the mutual information between models and data, that is the ultimate statistical upper bound on the identifiability of a system from empirical data. First, we introduce a method to select which components of the state variable to measure under a budget constraint, and at which time points. Our greedy approach takes advantage of the submodularity of the mutual information, and hence requires only a polynomial number of evaluations of the objective to achieve near-optimal designs. Second, we consider the computationally harder task of designing sequences of input interventions, and propose a likelihood-free approximation method. Exact and approximate design solutions are verified with predictive models of genetic regulatory interaction networks in embryonic development.