Dioumacor Faye , Redouane Lguensat , François Kaly , Andrew Sudmant , Amadou T. Gaye , Egide Kalisa
{"title":"空气质量预测的机器学习:来自卢旺达五个省的见解","authors":"Dioumacor Faye , Redouane Lguensat , François Kaly , Andrew Sudmant , Amadou T. Gaye , Egide Kalisa","doi":"10.1016/j.sciaf.2025.e02959","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately predicting air quality is a crucial challenge for public health and environmental management. This study compares and contrasts machine learning approaches to benchmark best practices for the Rwandan context and to evaluate the added value of advanced statistical methods for air quality monitoring in data-scarce settings. We forecast fine particulate matter (PM<sub>2.5</sub>) concentrations across five provinces in Rwanda, using multi-year meteorological and air quality data to identify context-specific patterns. This work establishes a methodological foundation for context-optimized early warning systems and informs policy interventions to improve air quality management in Rwanda. By rigorously testing machine learning capabilities against regional constraints, we demonstrate how machine learning can reduce population exposure to pollution, quantify attribution gaps in under-monitored regions, and improve sustainable environmental governance in resource-limited settings. The results indicate significant seasonal variability, with higher PM<sub>2.5</sub> levels during dry seasons than wet seasons. Our evaluation demonstrates that machine learning models can capture complex, non-linear relationships between environmental variables and pollution trends, although performance varies between algorithms. Limitations remain, including the integration of real-time data streams and localized variables such as industrial emissions, road traffic, and agricultural practices.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"30 ","pages":"Article e02959"},"PeriodicalIF":3.3000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning for air quality forecasting: Insights from five provinces of Rwanda\",\"authors\":\"Dioumacor Faye , Redouane Lguensat , François Kaly , Andrew Sudmant , Amadou T. Gaye , Egide Kalisa\",\"doi\":\"10.1016/j.sciaf.2025.e02959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurately predicting air quality is a crucial challenge for public health and environmental management. This study compares and contrasts machine learning approaches to benchmark best practices for the Rwandan context and to evaluate the added value of advanced statistical methods for air quality monitoring in data-scarce settings. We forecast fine particulate matter (PM<sub>2.5</sub>) concentrations across five provinces in Rwanda, using multi-year meteorological and air quality data to identify context-specific patterns. This work establishes a methodological foundation for context-optimized early warning systems and informs policy interventions to improve air quality management in Rwanda. By rigorously testing machine learning capabilities against regional constraints, we demonstrate how machine learning can reduce population exposure to pollution, quantify attribution gaps in under-monitored regions, and improve sustainable environmental governance in resource-limited settings. The results indicate significant seasonal variability, with higher PM<sub>2.5</sub> levels during dry seasons than wet seasons. Our evaluation demonstrates that machine learning models can capture complex, non-linear relationships between environmental variables and pollution trends, although performance varies between algorithms. Limitations remain, including the integration of real-time data streams and localized variables such as industrial emissions, road traffic, and agricultural practices.</div></div>\",\"PeriodicalId\":21690,\"journal\":{\"name\":\"Scientific African\",\"volume\":\"30 \",\"pages\":\"Article e02959\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific African\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468227625004296\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468227625004296","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Machine learning for air quality forecasting: Insights from five provinces of Rwanda
Accurately predicting air quality is a crucial challenge for public health and environmental management. This study compares and contrasts machine learning approaches to benchmark best practices for the Rwandan context and to evaluate the added value of advanced statistical methods for air quality monitoring in data-scarce settings. We forecast fine particulate matter (PM2.5) concentrations across five provinces in Rwanda, using multi-year meteorological and air quality data to identify context-specific patterns. This work establishes a methodological foundation for context-optimized early warning systems and informs policy interventions to improve air quality management in Rwanda. By rigorously testing machine learning capabilities against regional constraints, we demonstrate how machine learning can reduce population exposure to pollution, quantify attribution gaps in under-monitored regions, and improve sustainable environmental governance in resource-limited settings. The results indicate significant seasonal variability, with higher PM2.5 levels during dry seasons than wet seasons. Our evaluation demonstrates that machine learning models can capture complex, non-linear relationships between environmental variables and pollution trends, although performance varies between algorithms. Limitations remain, including the integration of real-time data streams and localized variables such as industrial emissions, road traffic, and agricultural practices.