Development of plastic waste generation distribution model using remote sensing data product and machine learning

Elprida Agustina , Emenda Sembiring , Anjar Dimara Sakti , Like Hana Fournida Purba
{"title":"Development of plastic waste generation distribution model using remote sensing data product and machine learning","authors":"Elprida Agustina ,&nbsp;Emenda Sembiring ,&nbsp;Anjar Dimara Sakti ,&nbsp;Like Hana Fournida Purba","doi":"10.1016/j.clwas.2025.100324","DOIUrl":null,"url":null,"abstract":"<div><div>This research aims to map the distribution of plastic waste generation at the household level to establish baseline data for plastic waste management. The study focuses on 905,935 households in Bali Province. Variables related to household characteristics were gathered from historical studies, including house area size, population density, area characteristics based on rural or urban designations, and economic status in specific coordinates. The remote sensing data products and their corresponding variables used in this study included: VIIRS (Visible Infrared Imaging Radiometer Suite) Night-time Day/Night data representing economic status, WorldPop Global Project Population Data representing population density, and Impervious data representing urban/rural classification. 200 primary sampling data points on plastic waste generation at households, coordinates, and house area sizes were collected. Linear and nonlinear regression machine learning algorithms were performed, with plastic waste generation as the dependent variable and the extracted remote sensing data products and house size as independent variables. The best-performing model was the non-linear regression model LGBM (Light Gradient Boosting Machine), achieving an R² score of 0.882, RMSE (Root Mean Squared Error) of 18.374, and MAPE (Mean Absolute Percentage Error) of 12.877 on testing data. The modeling results indicated that the feature importance of each variable, in order, was economic status, population density, house size, and urban or rural area classification.</div></div>","PeriodicalId":100256,"journal":{"name":"Cleaner Waste Systems","volume":"11 ","pages":"Article 100324"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Waste Systems","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772912525001228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This research aims to map the distribution of plastic waste generation at the household level to establish baseline data for plastic waste management. The study focuses on 905,935 households in Bali Province. Variables related to household characteristics were gathered from historical studies, including house area size, population density, area characteristics based on rural or urban designations, and economic status in specific coordinates. The remote sensing data products and their corresponding variables used in this study included: VIIRS (Visible Infrared Imaging Radiometer Suite) Night-time Day/Night data representing economic status, WorldPop Global Project Population Data representing population density, and Impervious data representing urban/rural classification. 200 primary sampling data points on plastic waste generation at households, coordinates, and house area sizes were collected. Linear and nonlinear regression machine learning algorithms were performed, with plastic waste generation as the dependent variable and the extracted remote sensing data products and house size as independent variables. The best-performing model was the non-linear regression model LGBM (Light Gradient Boosting Machine), achieving an R² score of 0.882, RMSE (Root Mean Squared Error) of 18.374, and MAPE (Mean Absolute Percentage Error) of 12.877 on testing data. The modeling results indicated that the feature importance of each variable, in order, was economic status, population density, house size, and urban or rural area classification.
利用遥感数据产品和机器学习开发塑料垃圾产生分布模型
本研究旨在绘制家庭层面塑料废物产生的分布图,为塑料废物管理建立基线数据。这项研究的重点是巴厘岛905,935户家庭。从历史研究中收集与家庭特征相关的变量,包括房屋面积大小、人口密度、基于农村或城市名称的区域特征以及特定坐标的经济状况。本研究使用的遥感数据产品及其对应变量包括:代表经济状况的VIIRS (Visible Infrared Imaging Radiometer Suite)夜间/日间数据,代表人口密度的WorldPop全球项目人口数据,以及代表城市/农村分类的Impervious数据。收集了200个关于家庭、坐标和房屋面积大小的塑料废物产生的主要抽样数据点。以塑料垃圾产生量为因变量,提取的遥感数据产品和房屋大小为自变量,进行线性和非线性回归机器学习算法。其中表现最好的是非线性回归模型LGBM (Light Gradient Boosting Machine),测试数据的R²得分为0.882,均方根误差RMSE为18.374,平均绝对百分比误差MAPE为12.877。建模结果表明,各变量的特征重要性依次为经济状况、人口密度、房屋面积、城乡分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信