Louise M Kimpton, Michael Dunne, James M Salter, Peter Challenor
{"title":"基于交叉验证的随机模型序贯设计。","authors":"Louise M Kimpton, Michael Dunne, James M Salter, Peter Challenor","doi":"10.1098/rsta.2024.0217","DOIUrl":null,"url":null,"abstract":"<p><p>Complex numerical models are increasingly being used in healthcare and epidemiology. To represent the complex features, modellers often make the decision to include stochastic behaviour where repeated runs of the model with identical inputs produce different outputs. When computational constraints limit the number of model runs and replications, heteroscedastic Gaussian processes can be used as a fast surrogate, allowing for efficient emulation of varying noise levels across the input space. The accuracy of any emulator is greatly dependent on the design of the training data, where sequential design algorithms increase the number of design points iteratively based on predefined criteria. For stochastic models, the design problem is more challenging due to the possibility of replicates at design points. This article develops a new sequential design method for stochastic models which scales well in high-dimensional input spaces. We build upon an existing method for deterministic models using an expected squared leave-one-out error criterion that balances exploration and replication. We compare our approach with existing sequential design methods as well as applying it to an agent-based model and a COVID-19 model. Results demonstrate that the proposed method performs well in noisy environments, offering a scalable alternative to existing methods.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 2)'.</p>","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"383 2293","pages":"20240217"},"PeriodicalIF":4.3000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Cross-validation-based sequential design for stochastic models.\",\"authors\":\"Louise M Kimpton, Michael Dunne, James M Salter, Peter Challenor\",\"doi\":\"10.1098/rsta.2024.0217\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Complex numerical models are increasingly being used in healthcare and epidemiology. To represent the complex features, modellers often make the decision to include stochastic behaviour where repeated runs of the model with identical inputs produce different outputs. When computational constraints limit the number of model runs and replications, heteroscedastic Gaussian processes can be used as a fast surrogate, allowing for efficient emulation of varying noise levels across the input space. The accuracy of any emulator is greatly dependent on the design of the training data, where sequential design algorithms increase the number of design points iteratively based on predefined criteria. For stochastic models, the design problem is more challenging due to the possibility of replicates at design points. This article develops a new sequential design method for stochastic models which scales well in high-dimensional input spaces. We build upon an existing method for deterministic models using an expected squared leave-one-out error criterion that balances exploration and replication. We compare our approach with existing sequential design methods as well as applying it to an agent-based model and a COVID-19 model. Results demonstrate that the proposed method performs well in noisy environments, offering a scalable alternative to existing methods.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 2)'.</p>\",\"PeriodicalId\":19879,\"journal\":{\"name\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"volume\":\"383 2293\",\"pages\":\"20240217\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1098/rsta.2024.0217\",\"RegionNum\":3,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0217","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Cross-validation-based sequential design for stochastic models.
Complex numerical models are increasingly being used in healthcare and epidemiology. To represent the complex features, modellers often make the decision to include stochastic behaviour where repeated runs of the model with identical inputs produce different outputs. When computational constraints limit the number of model runs and replications, heteroscedastic Gaussian processes can be used as a fast surrogate, allowing for efficient emulation of varying noise levels across the input space. The accuracy of any emulator is greatly dependent on the design of the training data, where sequential design algorithms increase the number of design points iteratively based on predefined criteria. For stochastic models, the design problem is more challenging due to the possibility of replicates at design points. This article develops a new sequential design method for stochastic models which scales well in high-dimensional input spaces. We build upon an existing method for deterministic models using an expected squared leave-one-out error criterion that balances exploration and replication. We compare our approach with existing sequential design methods as well as applying it to an agent-based model and a COVID-19 model. Results demonstrate that the proposed method performs well in noisy environments, offering a scalable alternative to existing methods.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 2)'.
期刊介绍:
Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.