Development of plastic waste generation distribution model using remote sensing data product and machine learning

Cleaner Waste Systems Pub Date : 2025-06-01 DOI:10.1016/j.clwas.2025.100324

Elprida Agustina , Emenda Sembiring , Anjar Dimara Sakti , Like Hana Fournida Purba

{"title":"Development of plastic waste generation distribution model using remote sensing data product and machine learning","authors":"Elprida Agustina , Emenda Sembiring , Anjar Dimara Sakti , Like Hana Fournida Purba","doi":"10.1016/j.clwas.2025.100324","DOIUrl":null,"url":null,"abstract":"<div><div>This research aims to map the distribution of plastic waste generation at the household level to establish baseline data for plastic waste management. The study focuses on 905,935 households in Bali Province. Variables related to household characteristics were gathered from historical studies, including house area size, population density, area characteristics based on rural or urban designations, and economic status in specific coordinates. The remote sensing data products and their corresponding variables used in this study included: VIIRS (Visible Infrared Imaging Radiometer Suite) Night-time Day/Night data representing economic status, WorldPop Global Project Population Data representing population density, and Impervious data representing urban/rural classification. 200 primary sampling data points on plastic waste generation at households, coordinates, and house area sizes were collected. Linear and nonlinear regression machine learning algorithms were performed, with plastic waste generation as the dependent variable and the extracted remote sensing data products and house size as independent variables. The best-performing model was the non-linear regression model LGBM (Light Gradient Boosting Machine), achieving an R² score of 0.882, RMSE (Root Mean Squared Error) of 18.374, and MAPE (Mean Absolute Percentage Error) of 12.877 on testing data. The modeling results indicated that the feature importance of each variable, in order, was economic status, population density, house size, and urban or rural area classification.</div></div>","PeriodicalId":100256,"journal":{"name":"Cleaner Waste Systems","volume":"11 ","pages":"Article 100324"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Waste Systems","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772912525001228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This research aims to map the distribution of plastic waste generation at the household level to establish baseline data for plastic waste management. The study focuses on 905,935 households in Bali Province. Variables related to household characteristics were gathered from historical studies, including house area size, population density, area characteristics based on rural or urban designations, and economic status in specific coordinates. The remote sensing data products and their corresponding variables used in this study included: VIIRS (Visible Infrared Imaging Radiometer Suite) Night-time Day/Night data representing economic status, WorldPop Global Project Population Data representing population density, and Impervious data representing urban/rural classification. 200 primary sampling data points on plastic waste generation at households, coordinates, and house area sizes were collected. Linear and nonlinear regression machine learning algorithms were performed, with plastic waste generation as the dependent variable and the extracted remote sensing data products and house size as independent variables. The best-performing model was the non-linear regression model LGBM (Light Gradient Boosting Machine), achieving an R² score of 0.882, RMSE (Root Mean Squared Error) of 18.374, and MAPE (Mean Absolute Percentage Error) of 12.877 on testing data. The modeling results indicated that the feature importance of each variable, in order, was economic status, population density, house size, and urban or rural area classification.

查看原文本刊更多论文

利用遥感数据产品和机器学习开发塑料垃圾产生分布模型

本研究旨在绘制家庭层面塑料废物产生的分布图，为塑料废物管理建立基线数据。这项研究的重点是巴厘岛905,935户家庭。从历史研究中收集与家庭特征相关的变量，包括房屋面积大小、人口密度、基于农村或城市名称的区域特征以及特定坐标的经济状况。本研究使用的遥感数据产品及其对应变量包括：代表经济状况的VIIRS （Visible Infrared Imaging Radiometer Suite）夜间/日间数据，代表人口密度的WorldPop全球项目人口数据，以及代表城市/农村分类的Impervious数据。收集了200个关于家庭、坐标和房屋面积大小的塑料废物产生的主要抽样数据点。以塑料垃圾产生量为因变量，提取的遥感数据产品和房屋大小为自变量，进行线性和非线性回归机器学习算法。其中表现最好的是非线性回归模型LGBM (Light Gradient Boosting Machine)，测试数据的R²得分为0.882，均方根误差RMSE为18.374，平均绝对百分比误差MAPE为12.877。建模结果表明，各变量的特征重要性依次为经济状况、人口密度、房屋面积、城乡分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cleaner Waste Systems

CiteScore

2.60

自引率

0.00%

发文量