Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models

IF 2.7 Q2 MULTIDISCIPLINARY SCIENCES
Ernest Somua-Wiafe , Richard Minkah , Kwabena Doku-Amponsah , Louis Asiedu , Edward Acheampong , Samuel Iddi
{"title":"Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models","authors":"Ernest Somua-Wiafe ,&nbsp;Richard Minkah ,&nbsp;Kwabena Doku-Amponsah ,&nbsp;Louis Asiedu ,&nbsp;Edward Acheampong ,&nbsp;Samuel Iddi","doi":"10.1016/j.sciaf.2024.e02498","DOIUrl":null,"url":null,"abstract":"<div><div>Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.</div><div>This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.</div><div>Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"27 ","pages":"Article e02498"},"PeriodicalIF":2.7000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246822762400440X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.
This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.
Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Scientific African
Scientific African Multidisciplinary-Multidisciplinary
CiteScore
5.60
自引率
3.40%
发文量
332
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信