利用地理加权随机森林了解疟疾发病率与环境风险因素之间关系的空间非平稳性:以卢旺达为例

IF 1 4区 医学 Q4 HEALTH CARE SCIENCES & SERVICES
Gilbert Nduwayezu, Pengxiang Zhao, Clarisse Kagoyire, Lina Eklund, Jean Pierre Bizimana, Petter Pilesjo, Ali Mansourian
{"title":"利用地理加权随机森林了解疟疾发病率与环境风险因素之间关系的空间非平稳性:以卢旺达为例","authors":"Gilbert Nduwayezu,&nbsp;Pengxiang Zhao,&nbsp;Clarisse Kagoyire,&nbsp;Lina Eklund,&nbsp;Jean Pierre Bizimana,&nbsp;Petter Pilesjo,&nbsp;Ali Mansourian","doi":"10.4081/gh.2023.1184","DOIUrl":null,"url":null,"abstract":"<p><p>As found in the health studies literature, the levels of climate association between epidemiological diseases have been found to vary across regions. Therefore, it seems reasonable to allow for the possibility that relationships might vary spatially within regions. We implemented the geographically weighted random forest (GWRF) machine learning method to analyze ecological disease patterns caused by spatially non-stationary processes using a malaria incidence dataset for Rwanda. We first compared the geographically weighted regression (WGR), the global random forest (GRF), and the geographically weighted random forest (GWRF) to examine the spatial non-stationarity in the non-linear relationships between malaria incidence and their risk factors. We used the Gaussian areal kriging model to disaggregate the malaria incidence at the local administrative cell level to understand the relationships at a fine scale since the model goodness of fit was not satisfactory to explain malaria incidence due to the limited number of sample values. Our results show that in terms of the coefficients of determination and prediction accuracy, the geographical random forest model performs better than the GWR and the global random forest model. The coefficients of determination of the geographically weighted regression (R2), the global RF (R2), and the GWRF (R2) were 4.74, 0.76, and 0.79, respectively. The GWRF algorithm achieves the best result and reveals that risk factors (rainfall, land surface temperature, elevation, and air temperature) have a strong non-linear relationship with the spatial distribution of malaria incidence rates, which could have implications for supporting local initiatives for malaria elimination in Rwanda.</p>","PeriodicalId":56260,"journal":{"name":"Geospatial Health","volume":"18 1","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Understanding the spatial non-stationarity in the relationships between malaria incidence and environmental risk factors using Geographically Weighted Random Forest: A case study in Rwanda.\",\"authors\":\"Gilbert Nduwayezu,&nbsp;Pengxiang Zhao,&nbsp;Clarisse Kagoyire,&nbsp;Lina Eklund,&nbsp;Jean Pierre Bizimana,&nbsp;Petter Pilesjo,&nbsp;Ali Mansourian\",\"doi\":\"10.4081/gh.2023.1184\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>As found in the health studies literature, the levels of climate association between epidemiological diseases have been found to vary across regions. Therefore, it seems reasonable to allow for the possibility that relationships might vary spatially within regions. We implemented the geographically weighted random forest (GWRF) machine learning method to analyze ecological disease patterns caused by spatially non-stationary processes using a malaria incidence dataset for Rwanda. We first compared the geographically weighted regression (WGR), the global random forest (GRF), and the geographically weighted random forest (GWRF) to examine the spatial non-stationarity in the non-linear relationships between malaria incidence and their risk factors. We used the Gaussian areal kriging model to disaggregate the malaria incidence at the local administrative cell level to understand the relationships at a fine scale since the model goodness of fit was not satisfactory to explain malaria incidence due to the limited number of sample values. Our results show that in terms of the coefficients of determination and prediction accuracy, the geographical random forest model performs better than the GWR and the global random forest model. The coefficients of determination of the geographically weighted regression (R2), the global RF (R2), and the GWRF (R2) were 4.74, 0.76, and 0.79, respectively. The GWRF algorithm achieves the best result and reveals that risk factors (rainfall, land surface temperature, elevation, and air temperature) have a strong non-linear relationship with the spatial distribution of malaria incidence rates, which could have implications for supporting local initiatives for malaria elimination in Rwanda.</p>\",\"PeriodicalId\":56260,\"journal\":{\"name\":\"Geospatial Health\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2023-05-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geospatial Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.4081/gh.2023.1184\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geospatial Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.4081/gh.2023.1184","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

摘要

正如卫生研究文献所发现的那样,流行病学疾病之间的气候关联程度在不同地区有所不同。因此,考虑到区域内的关系可能在空间上有所不同似乎是合理的。我们使用卢旺达的疟疾发病率数据集实施了地理加权随机森林(GWRF)机器学习方法来分析由空间非平稳过程引起的生态疾病模式。我们首先比较了地理加权回归(WGR)、全球随机森林(GRF)和地理加权随机森林(GWRF),以检验疟疾发病率与其危险因素之间的非线性关系的空间非平稳性。由于样本数有限,模型的拟合优度不能很好地解释疟疾发病率,我们使用高斯面克里格模型对地方行政单元水平的疟疾发病率进行分解,以了解在精细尺度上的关系。结果表明,在确定系数和预测精度方面,地理随机森林模型优于GWR和全局随机森林模型。地理加权回归(R2)、全局RF (R2)和GWRF (R2)的决定系数分别为4.74、0.76和0.79。GWRF算法获得了最好的结果,并揭示了风险因素(降雨、地表温度、海拔和气温)与疟疾发病率的空间分布具有很强的非线性关系,这可能对支持卢旺达当地消除疟疾的举措具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Understanding the spatial non-stationarity in the relationships between malaria incidence and environmental risk factors using Geographically Weighted Random Forest: A case study in Rwanda.

As found in the health studies literature, the levels of climate association between epidemiological diseases have been found to vary across regions. Therefore, it seems reasonable to allow for the possibility that relationships might vary spatially within regions. We implemented the geographically weighted random forest (GWRF) machine learning method to analyze ecological disease patterns caused by spatially non-stationary processes using a malaria incidence dataset for Rwanda. We first compared the geographically weighted regression (WGR), the global random forest (GRF), and the geographically weighted random forest (GWRF) to examine the spatial non-stationarity in the non-linear relationships between malaria incidence and their risk factors. We used the Gaussian areal kriging model to disaggregate the malaria incidence at the local administrative cell level to understand the relationships at a fine scale since the model goodness of fit was not satisfactory to explain malaria incidence due to the limited number of sample values. Our results show that in terms of the coefficients of determination and prediction accuracy, the geographical random forest model performs better than the GWR and the global random forest model. The coefficients of determination of the geographically weighted regression (R2), the global RF (R2), and the GWRF (R2) were 4.74, 0.76, and 0.79, respectively. The GWRF algorithm achieves the best result and reveals that risk factors (rainfall, land surface temperature, elevation, and air temperature) have a strong non-linear relationship with the spatial distribution of malaria incidence rates, which could have implications for supporting local initiatives for malaria elimination in Rwanda.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Geospatial Health
Geospatial Health 医学-公共卫生、环境卫生与职业卫生
CiteScore
2.40
自引率
11.80%
发文量
48
审稿时长
12 months
期刊介绍: The focus of the journal is on all aspects of the application of geographical information systems, remote sensing, global positioning systems, spatial statistics and other geospatial tools in human and veterinary health. The journal publishes two issues per year.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信