{"title":"用于空气质量预测和数据分析的机器学习:最新进展、挑战和展望综述。","authors":"Manal Karmoude, Brenton Munhungewarwa, Isaiah Chiraira, Ryan Mckenzie, Jude Kong, Bevan Smith, Gelan Ayana, Nkosiphendule Njara, Thuso Mathaha, Mukesh Kumar, Bruce Mellado","doi":"10.1016/j.scitotenv.2025.180593","DOIUrl":null,"url":null,"abstract":"<p><p>Air quality is a critical determinant of human health, with severe consequences resulting from air pollution. The growing necessity for air quality monitoring has led to the adoption of IoT sensor networks, which provide real-time data for forecasting, issuing warnings, and informing public health interventions. In this context, machine learning (ML) algorithms have proven to be powerful tools for enhancing air quality prediction and addressing monitoring challenges. However, a comprehensive review compiling the research space of ML for air quality is seldom available. This review analyzes over 70 recent studies that apply ML techniques to air quality monitoring, categorizing them based on the type of learning approach employed, with a focus on identifying the most effective algorithms in each category. The findings demonstrate that ensemble models such as Random Forest (RF) and Extreme Gradient Boosting (XGBoost) consistently achieve high accuracy in structured datasets, while deep learning (DL) approaches like Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNN) excel in capturing temporal dependencies and spatial patterns in pollution forecasting. Unsupervised approaches like clustering and anomaly detection effectively enhance data quality and sensor calibration, whereas reinforcement learning shows promise in adaptive control scenarios, despite challenges related to computational intensity and interpretability. This review is highly significant, offering valuable insights for policymakers and researchers in developing strategies to mitigate air pollution and improve public health using advanced ML techniques.</p>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":"1002 ","pages":"180593"},"PeriodicalIF":8.0000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning for air quality prediction and data analysis: Review on recent advancements, challenges, and outlooks.\",\"authors\":\"Manal Karmoude, Brenton Munhungewarwa, Isaiah Chiraira, Ryan Mckenzie, Jude Kong, Bevan Smith, Gelan Ayana, Nkosiphendule Njara, Thuso Mathaha, Mukesh Kumar, Bruce Mellado\",\"doi\":\"10.1016/j.scitotenv.2025.180593\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Air quality is a critical determinant of human health, with severe consequences resulting from air pollution. The growing necessity for air quality monitoring has led to the adoption of IoT sensor networks, which provide real-time data for forecasting, issuing warnings, and informing public health interventions. In this context, machine learning (ML) algorithms have proven to be powerful tools for enhancing air quality prediction and addressing monitoring challenges. However, a comprehensive review compiling the research space of ML for air quality is seldom available. This review analyzes over 70 recent studies that apply ML techniques to air quality monitoring, categorizing them based on the type of learning approach employed, with a focus on identifying the most effective algorithms in each category. The findings demonstrate that ensemble models such as Random Forest (RF) and Extreme Gradient Boosting (XGBoost) consistently achieve high accuracy in structured datasets, while deep learning (DL) approaches like Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNN) excel in capturing temporal dependencies and spatial patterns in pollution forecasting. Unsupervised approaches like clustering and anomaly detection effectively enhance data quality and sensor calibration, whereas reinforcement learning shows promise in adaptive control scenarios, despite challenges related to computational intensity and interpretability. This review is highly significant, offering valuable insights for policymakers and researchers in developing strategies to mitigate air pollution and improve public health using advanced ML techniques.</p>\",\"PeriodicalId\":422,\"journal\":{\"name\":\"Science of the Total Environment\",\"volume\":\"1002 \",\"pages\":\"180593\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science of the Total Environment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.scitotenv.2025.180593\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.scitotenv.2025.180593","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Machine learning for air quality prediction and data analysis: Review on recent advancements, challenges, and outlooks.
Air quality is a critical determinant of human health, with severe consequences resulting from air pollution. The growing necessity for air quality monitoring has led to the adoption of IoT sensor networks, which provide real-time data for forecasting, issuing warnings, and informing public health interventions. In this context, machine learning (ML) algorithms have proven to be powerful tools for enhancing air quality prediction and addressing monitoring challenges. However, a comprehensive review compiling the research space of ML for air quality is seldom available. This review analyzes over 70 recent studies that apply ML techniques to air quality monitoring, categorizing them based on the type of learning approach employed, with a focus on identifying the most effective algorithms in each category. The findings demonstrate that ensemble models such as Random Forest (RF) and Extreme Gradient Boosting (XGBoost) consistently achieve high accuracy in structured datasets, while deep learning (DL) approaches like Long Short-Term Memory (LSTM) and Convolutional Neural Networks (CNN) excel in capturing temporal dependencies and spatial patterns in pollution forecasting. Unsupervised approaches like clustering and anomaly detection effectively enhance data quality and sensor calibration, whereas reinforcement learning shows promise in adaptive control scenarios, despite challenges related to computational intensity and interpretability. This review is highly significant, offering valuable insights for policymakers and researchers in developing strategies to mitigate air pollution and improve public health using advanced ML techniques.
期刊介绍:
The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere.
The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.