A data-driven framework for assessing global progress towards sustainable development goals

IF 9.6 1区 环境科学与生态学 Q1 ENVIRONMENTAL STUDIES
Lu Chen , Chenyang Shuai , Xi Chen , Bu Zhao
{"title":"A data-driven framework for assessing global progress towards sustainable development goals","authors":"Lu Chen ,&nbsp;Chenyang Shuai ,&nbsp;Xi Chen ,&nbsp;Bu Zhao","doi":"10.1016/j.spc.2025.10.001","DOIUrl":null,"url":null,"abstract":"<div><div>Effective monitoring of the Sustainable Development Goals (SDGs) is crucial for advancing global sustainable development. However, widespread data gaps continue to hinder the accurate assessment of SDG performance across countries and goals. To address this challenge, this study develops a data-driven integrated assessment framework combining dimensionality reduction and machine learning-based imputation techniques, based on 380 SDG indicators from the World Bank database covering the period 2000–2020. Principal indicators were selected using a combination of Principal Component Analysis (PCA) and multiple regression, and missing data were imputed using the random forest (RF)-based missForest algorithm. Based on the completed dataset, the SDG index and performance of 17 individual SDGs were assessed for 215 countries and regions worldwide from 2000 to 2020. The results show that: (1) identification of 218 principal indicators covering over 90 % of the information in the initial set; (2) robust imputation of missing values with a Normalized Root Mean Squared Error (NRMSE) of approximately 0.2 and a Proportion of Falsely Classified (PFC) around 0.08; (3) a steady global improvement in SDG performance with significant regional disparities—Europe leading, Africa lagging, and Asia progressing most rapidly; and (4) uneven development across different goals, with some facing considerable challenges. This study enhances the completeness and applicability of global SDG performance assessment and provides empirical evidence to support more targeted sustainable development policymaking.</div></div>","PeriodicalId":48619,"journal":{"name":"Sustainable Production and Consumption","volume":"60 ","pages":"Pages 217-228"},"PeriodicalIF":9.6000,"publicationDate":"2025-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sustainable Production and Consumption","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352550925001940","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}
引用次数: 0

Abstract

Effective monitoring of the Sustainable Development Goals (SDGs) is crucial for advancing global sustainable development. However, widespread data gaps continue to hinder the accurate assessment of SDG performance across countries and goals. To address this challenge, this study develops a data-driven integrated assessment framework combining dimensionality reduction and machine learning-based imputation techniques, based on 380 SDG indicators from the World Bank database covering the period 2000–2020. Principal indicators were selected using a combination of Principal Component Analysis (PCA) and multiple regression, and missing data were imputed using the random forest (RF)-based missForest algorithm. Based on the completed dataset, the SDG index and performance of 17 individual SDGs were assessed for 215 countries and regions worldwide from 2000 to 2020. The results show that: (1) identification of 218 principal indicators covering over 90 % of the information in the initial set; (2) robust imputation of missing values with a Normalized Root Mean Squared Error (NRMSE) of approximately 0.2 and a Proportion of Falsely Classified (PFC) around 0.08; (3) a steady global improvement in SDG performance with significant regional disparities—Europe leading, Africa lagging, and Asia progressing most rapidly; and (4) uneven development across different goals, with some facing considerable challenges. This study enhances the completeness and applicability of global SDG performance assessment and provides empirical evidence to support more targeted sustainable development policymaking.
一个数据驱动的框架,用于评估实现可持续发展目标的全球进展
有效监测可持续发展目标对推动全球可持续发展至关重要。然而,广泛的数据差距继续阻碍对各国和各目标的可持续发展目标绩效进行准确评估。为了应对这一挑战,本研究基于世界银行数据库中2000-2020年期间的380项可持续发展目标指标,开发了一个数据驱动的综合评估框架,结合了降维和基于机器学习的imputation技术。采用主成分分析(PCA)和多元回归相结合的方法选择主指标,采用基于随机森林(RF)的misforest算法对缺失数据进行估算。根据已完成的数据集,对2000年至2020年全球215个国家和地区的可持续发展目标指数和17个单项可持续发展目标的绩效进行了评估。结果表明:(1)对218个主要指标进行了识别,覆盖初始集信息的90%以上;(2)缺失值的稳健性估计,归一化均方根误差(NRMSE)约为0.2,错误分类比例(PFC)约为0.08;(3)全球可持续发展目标绩效稳步提升,但区域差异显著——欧洲领先,非洲落后,亚洲进步最快;(4)不同目标间发展不平衡,部分目标面临较大挑战。本研究增强了全球可持续发展目标绩效评估的完整性和适用性,为更有针对性的可持续发展政策制定提供了实证依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Sustainable Production and Consumption
Sustainable Production and Consumption Environmental Science-Environmental Engineering
CiteScore
17.40
自引率
7.40%
发文量
389
审稿时长
13 days
期刊介绍: Sustainable production and consumption refers to the production and utilization of goods and services in a way that benefits society, is economically viable, and has minimal environmental impact throughout its entire lifespan. Our journal is dedicated to publishing top-notch interdisciplinary research and practical studies in this emerging field. We take a distinctive approach by examining the interplay between technology, consumption patterns, and policy to identify sustainable solutions for both production and consumption systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信