Emulating computer models with high-dimensional count output.

IF 4.3 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
James M Salter, Trevelyan J McKinley, Xiaoyu Xiong, Daniel B Williamson
{"title":"Emulating computer models with high-dimensional count output.","authors":"James M Salter, Trevelyan J McKinley, Xiaoyu Xiong, Daniel B Williamson","doi":"10.1098/rsta.2024.0216","DOIUrl":null,"url":null,"abstract":"<p><p>Computer models are used to study the real world, and often contain a large number of uncertain input parameters, produce a large number of outputs, may be expensive to run and need calibrating to real-world observations to be useful for decision-making. Emulators are often used as cheap surrogates for the expensive simulator, trained on a small number of simulations to provide predictions with uncertainty at unseen inputs. In epidemiological applications, for example compartmental or agent-based models for modelling the spread of infectious diseases, the output is usually spatially and temporally indexed, stochastic and consists of counts rather than continuous variables. Here, we consider emulating high-dimensional count output from a complex computer model using a Poisson lognormal PCA (PLNPCA) emulator. We apply the PLNPCA emulator to output fields from a COVID-19 model for England and Wales and compare this to fitting emulators to aggregations of the full output. We show that performance is generally comparable, while the PLNPCA emulator inherits desirable properties, including allowing the full output to be predicted while capturing correlations between outputs, providing high-dimensional samples of counts that are representative of the true model output.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 1)'.</p>","PeriodicalId":19879,"journal":{"name":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","volume":"383 2292","pages":"20240216"},"PeriodicalIF":4.3000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11904617/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1098/rsta.2024.0216","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Computer models are used to study the real world, and often contain a large number of uncertain input parameters, produce a large number of outputs, may be expensive to run and need calibrating to real-world observations to be useful for decision-making. Emulators are often used as cheap surrogates for the expensive simulator, trained on a small number of simulations to provide predictions with uncertainty at unseen inputs. In epidemiological applications, for example compartmental or agent-based models for modelling the spread of infectious diseases, the output is usually spatially and temporally indexed, stochastic and consists of counts rather than continuous variables. Here, we consider emulating high-dimensional count output from a complex computer model using a Poisson lognormal PCA (PLNPCA) emulator. We apply the PLNPCA emulator to output fields from a COVID-19 model for England and Wales and compare this to fitting emulators to aggregations of the full output. We show that performance is generally comparable, while the PLNPCA emulator inherits desirable properties, including allowing the full output to be predicted while capturing correlations between outputs, providing high-dimensional samples of counts that are representative of the true model output.This article is part of the theme issue 'Uncertainty quantification for healthcare and biological systems (Part 1)'.

模拟具有高维计数输出的计算机模型。
计算机模型用于研究现实世界,通常包含大量不确定的输入参数,产生大量的输出,运行起来可能很昂贵,需要根据现实世界的观察结果进行校准,才能对决策有用。模拟器通常被用作昂贵模拟器的廉价替代品,在少量的模拟上进行训练,以在未见输入的不确定性下提供预测。在流行病学应用中,例如用于模拟传染病传播的分区或基于主体的模型,其输出通常是空间和时间索引,是随机的,由计数而不是连续变量组成。在这里,我们考虑使用泊松对数正态PCA (PLNPCA)模拟器模拟复杂计算机模型的高维计数输出。我们将PLNPCA仿真器应用于英格兰和威尔士COVID-19模型的输出字段,并将其与拟合仿真器与完整输出的聚合进行比较。我们表明,性能通常是可比较的,而PLNPCA模拟器继承了理想的属性,包括允许在捕获输出之间的相关性的同时预测完整的输出,提供代表真实模型输出的高维计数样本。本文是主题问题“医疗保健和生物系统的不确定性量化(第1部分)”的一部分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.30
自引率
2.00%
发文量
367
审稿时长
3 months
期刊介绍: Continuing its long history of influential scientific publishing, Philosophical Transactions A publishes high-quality theme issues on topics of current importance and general interest within the physical, mathematical and engineering sciences, guest-edited by leading authorities and comprising new research, reviews and opinions from prominent researchers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信