Luis Izquierdo-Horna , Ramzy Kahhat , Ian Vázquez-Rowe
{"title":"应用随机森林预测家庭燃料消费产生的城市固体废物","authors":"Luis Izquierdo-Horna , Ramzy Kahhat , Ian Vázquez-Rowe","doi":"10.1016/j.rcradv.2025.200264","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately forecasting municipal solid waste (MSW) generation is essential for designing efficient waste management systems and promoting sustainable urban development. As cities expand and consumption patterns shift, reliable data-driven approaches are increasingly necessary to address the complexities of MSW generation. This study applied the random forest (RF) algorithm, a machine learning technique, to predict MSW generation at the household level. RF was selected for its capacity to handle non-linear relationships, imbalanced datasets, and outliers. The analysis focused on data from 2019, avoiding distortions associated with the COVID-19 pandemic. The model integrated per capita MSW data with household fuel consumption indicators (i.e., natural gas, electricity, and liquefied petroleum gas) and demographic variables such as age, education level, and monthly expenditure. The case study focused on the city of Lima, Peru, using 80 % of the data for training and 20 % for testing, with hyperparameters optimized via 5-fold cross-validation. The final model explained 55 % of the variance in MSW generation (R² = 0.55). This result reflects the model’s ability to capture significant drivers of variability, although it leaves room for refinement due to factors not included in the analysis, such as cultural practices or seasonality. Among the predictors, household monthly expenditure on cooking fuels emerged as the most influential variable, reinforcing the connection between resource consumption and waste generation. These findings highlight the potential of integrating socioeconomic indicators into predictive models to enhance their reliability. By improving forecasting capabilities, this study supports targeted policies for urban waste management and sustainable resource use.</div></div>","PeriodicalId":74689,"journal":{"name":"Resources, conservation & recycling advances","volume":"27 ","pages":"Article 200264"},"PeriodicalIF":6.4000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applying random forest to forecast municipal solid waste generation from household fuel consumption\",\"authors\":\"Luis Izquierdo-Horna , Ramzy Kahhat , Ian Vázquez-Rowe\",\"doi\":\"10.1016/j.rcradv.2025.200264\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurately forecasting municipal solid waste (MSW) generation is essential for designing efficient waste management systems and promoting sustainable urban development. As cities expand and consumption patterns shift, reliable data-driven approaches are increasingly necessary to address the complexities of MSW generation. This study applied the random forest (RF) algorithm, a machine learning technique, to predict MSW generation at the household level. RF was selected for its capacity to handle non-linear relationships, imbalanced datasets, and outliers. The analysis focused on data from 2019, avoiding distortions associated with the COVID-19 pandemic. The model integrated per capita MSW data with household fuel consumption indicators (i.e., natural gas, electricity, and liquefied petroleum gas) and demographic variables such as age, education level, and monthly expenditure. The case study focused on the city of Lima, Peru, using 80 % of the data for training and 20 % for testing, with hyperparameters optimized via 5-fold cross-validation. The final model explained 55 % of the variance in MSW generation (R² = 0.55). This result reflects the model’s ability to capture significant drivers of variability, although it leaves room for refinement due to factors not included in the analysis, such as cultural practices or seasonality. Among the predictors, household monthly expenditure on cooking fuels emerged as the most influential variable, reinforcing the connection between resource consumption and waste generation. These findings highlight the potential of integrating socioeconomic indicators into predictive models to enhance their reliability. By improving forecasting capabilities, this study supports targeted policies for urban waste management and sustainable resource use.</div></div>\",\"PeriodicalId\":74689,\"journal\":{\"name\":\"Resources, conservation & recycling advances\",\"volume\":\"27 \",\"pages\":\"Article 200264\"},\"PeriodicalIF\":6.4000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Resources, conservation & recycling advances\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2667378925000227\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Resources, conservation & recycling advances","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667378925000227","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Applying random forest to forecast municipal solid waste generation from household fuel consumption
Accurately forecasting municipal solid waste (MSW) generation is essential for designing efficient waste management systems and promoting sustainable urban development. As cities expand and consumption patterns shift, reliable data-driven approaches are increasingly necessary to address the complexities of MSW generation. This study applied the random forest (RF) algorithm, a machine learning technique, to predict MSW generation at the household level. RF was selected for its capacity to handle non-linear relationships, imbalanced datasets, and outliers. The analysis focused on data from 2019, avoiding distortions associated with the COVID-19 pandemic. The model integrated per capita MSW data with household fuel consumption indicators (i.e., natural gas, electricity, and liquefied petroleum gas) and demographic variables such as age, education level, and monthly expenditure. The case study focused on the city of Lima, Peru, using 80 % of the data for training and 20 % for testing, with hyperparameters optimized via 5-fold cross-validation. The final model explained 55 % of the variance in MSW generation (R² = 0.55). This result reflects the model’s ability to capture significant drivers of variability, although it leaves room for refinement due to factors not included in the analysis, such as cultural practices or seasonality. Among the predictors, household monthly expenditure on cooking fuels emerged as the most influential variable, reinforcing the connection between resource consumption and waste generation. These findings highlight the potential of integrating socioeconomic indicators into predictive models to enhance their reliability. By improving forecasting capabilities, this study supports targeted policies for urban waste management and sustainable resource use.