Yan Jin, Yong Ge, Haoyu Fan, Zeshuo Li, Yaojie Liu, Yan Jia
{"title":"利用随机森林地区间回归克里金模型绘制泰国 1 公里分辨率的国内生产总值分布图","authors":"Yan Jin, Yong Ge, Haoyu Fan, Zeshuo Li, Yaojie Liu, Yan Jia","doi":"10.3390/ijgi12120481","DOIUrl":null,"url":null,"abstract":"Accurate spatial distribution of gridded gross domestic product (GDP) data is crucial for revealing regional disparities within administrative units, thus facilitating a deeper understanding of regional economic dynamics, industrial distribution, and urbanization trends. The existing GDP spatial models often rely on prediction residuals for model evaluation or utilize residual distribution to improve the final accuracy, frequently overlooking the modifiable areal unit problem within residual distribution. This paper introduces a hybrid downscaling model that combines random forest and area-to-area kriging to map gridded GDP. Employing Thailand as a case study, GDP distribution maps were generated at a 1 km spatial resolution for the year 2015 and compared with five alternative downscaling methods and an existing GDP product. The results demonstrate that the proposed approach yields higher accuracy and greater precision in detailing GDP distribution, as evidenced by the smallest mean absolute error and root mean squared error values, which stand at USD 256.458 and 699.348 ten million, respectively. Among the four different sets of auxiliary variables considered, one consistently exhibited a higher prediction accuracy. This particular set of auxiliary variables integrated classification-based variables, illustrating the advantages of incorporating such integrated variables into modeling while accounting for classification characteristics.","PeriodicalId":14614,"journal":{"name":"ISPRS Int. J. Geo Inf.","volume":"5 1","pages":"481"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mapping Gross Domestic Product Distribution at 1 km Resolution across Thailand Using the Random Forest Area-to-Area Regression Kriging Model\",\"authors\":\"Yan Jin, Yong Ge, Haoyu Fan, Zeshuo Li, Yaojie Liu, Yan Jia\",\"doi\":\"10.3390/ijgi12120481\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate spatial distribution of gridded gross domestic product (GDP) data is crucial for revealing regional disparities within administrative units, thus facilitating a deeper understanding of regional economic dynamics, industrial distribution, and urbanization trends. The existing GDP spatial models often rely on prediction residuals for model evaluation or utilize residual distribution to improve the final accuracy, frequently overlooking the modifiable areal unit problem within residual distribution. This paper introduces a hybrid downscaling model that combines random forest and area-to-area kriging to map gridded GDP. Employing Thailand as a case study, GDP distribution maps were generated at a 1 km spatial resolution for the year 2015 and compared with five alternative downscaling methods and an existing GDP product. The results demonstrate that the proposed approach yields higher accuracy and greater precision in detailing GDP distribution, as evidenced by the smallest mean absolute error and root mean squared error values, which stand at USD 256.458 and 699.348 ten million, respectively. Among the four different sets of auxiliary variables considered, one consistently exhibited a higher prediction accuracy. This particular set of auxiliary variables integrated classification-based variables, illustrating the advantages of incorporating such integrated variables into modeling while accounting for classification characteristics.\",\"PeriodicalId\":14614,\"journal\":{\"name\":\"ISPRS Int. J. Geo Inf.\",\"volume\":\"5 1\",\"pages\":\"481\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISPRS Int. J. Geo Inf.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/ijgi12120481\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Int. J. Geo Inf.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/ijgi12120481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
网格化国内生产总值(GDP)数据的精确空间分布对于揭示行政单元内的区域差异至关重要,从而有助于深入了解区域经济动态、产业分布和城市化趋势。现有的 GDP 空间模型往往依赖预测残差来评价模型,或利用残差分布来提高最终精度,往往忽视了残差分布中的可修改面积单位问题。本文介绍了一种混合降尺度模型,该模型结合了随机森林和区域到区域克里格法,用于绘制网格 GDP 图。本文以泰国为例,以 1 公里的空间分辨率生成了 2015 年的国内生产总值分布图,并与其他五种降尺度方法和现有的国内生产总值产品进行了比较。结果表明,所建议的方法在详细绘制国内生产总值分布图方面具有更高的准确度和精确度,这体现在平均绝对误差和均方根误差值最小,分别为 2.56458 亿美元和 6.99348 亿美元。在所考虑的四组不同的辅助变量中,有一组始终表现出较高的预测精度。这组特定的辅助变量整合了基于分类的变量,说明了在考虑分类特征的同时将此类整合变量纳入建模的优势。
Mapping Gross Domestic Product Distribution at 1 km Resolution across Thailand Using the Random Forest Area-to-Area Regression Kriging Model
Accurate spatial distribution of gridded gross domestic product (GDP) data is crucial for revealing regional disparities within administrative units, thus facilitating a deeper understanding of regional economic dynamics, industrial distribution, and urbanization trends. The existing GDP spatial models often rely on prediction residuals for model evaluation or utilize residual distribution to improve the final accuracy, frequently overlooking the modifiable areal unit problem within residual distribution. This paper introduces a hybrid downscaling model that combines random forest and area-to-area kriging to map gridded GDP. Employing Thailand as a case study, GDP distribution maps were generated at a 1 km spatial resolution for the year 2015 and compared with five alternative downscaling methods and an existing GDP product. The results demonstrate that the proposed approach yields higher accuracy and greater precision in detailing GDP distribution, as evidenced by the smallest mean absolute error and root mean squared error values, which stand at USD 256.458 and 699.348 ten million, respectively. Among the four different sets of auxiliary variables considered, one consistently exhibited a higher prediction accuracy. This particular set of auxiliary variables integrated classification-based variables, illustrating the advantages of incorporating such integrated variables into modeling while accounting for classification characteristics.