采用结构化深度学习和选择性状态空间建模的低成本视频空气质量估计系统

IF 10.3 1区环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES

Environment International Pub Date : 2025-05-01 DOI:10.1016/j.envint.2025.109496

Maqsood Ahmed , Xiang Zhang , Yonglin Shen , Tanveer Ahmed , Shahid Ali , Ayaz Ali , Aminjon Gulakhmadov , Won-Ho Nam , Nengcheng Chen

{"title":"采用结构化深度学习和选择性状态空间建模的低成本视频空气质量估计系统","authors":"Maqsood Ahmed , Xiang Zhang , Yonglin Shen , Tanveer Ahmed , Shahid Ali , Ayaz Ali , Aminjon Gulakhmadov , Won-Ho Nam , Nengcheng Chen","doi":"10.1016/j.envint.2025.109496","DOIUrl":null,"url":null,"abstract":"<div><div>Air quality is crucial for both public health and environmental sustainability. An efficient and cost-effective model is essential for accurate air quality predictions and proactive pollution control. However, existing research primarily focuses on single static image analysis, which does not account for the dynamic and temporal nature of air pollution. Meanwhile, research on video-based air quality estimation remains limited, particularly in achieving accurate multi-pollutant outputs. This study proposes Air Quality Prediction-Mamba (AQP-Mamba), a video-based deep learning model that integrates a structured Selective State Space Model (SSM) with a selective scan mechanism and a hybrid predictor (HP) to estimate air quality. The spatiotemporal forward and backward SSM dynamically adjusts parameters based on input, ensures linear complexity, and effectively captures long-range dependencies by bidirectional processing of spatiotemporal features through four scanning techniques (row-wise, column-wise, and their vertical reversals), which allows the model to accurately track pollutant concentrations and air quality variations over time. Thus, the model efficiently extracts spatiotemporal features from video and simultaneously performs regression (PM<sub>2.5</sub>, PM<sub>10</sub>, and AQI), and classification (AQI) tasks, respectively. A high-quality outdoor hourly air quality dataset (LMSAQV) with 13,176 videos collected from six monitoring stations in Lahore, Pakistan, was utilized as the case study. The experimental results demonstrate that the AQP-Mamba significantly outperforms several state-of-the-art models, including VideoSwin-T, VideoMAE, I3D, VTHCL, andTimeSformer. The proposed model achieves strong regression performance (PM<sub>2.5</sub>: R<sup>2</sup> = 0.91, PM<sub>10</sub>: R<sup>2</sup> = 0.90, AQI: R<sup>2</sup> = 0.92) and excellent classification metrics: accuracy (94.57 %), precision (93.86 %), recall (94.20 %), and F1-score (93.44 %), respectively. The proposed model delivers consistent, real-time performance with a latency of 1.98 s per video, offering an effective, scalable, and cost-efficient solution for multi-pollutant estimation. This approach has the potential to address gaps in air quality data collected by expensive instruments globally.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"199 ","pages":"Article 109496"},"PeriodicalIF":10.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Low-cost video-based air quality estimation system using structured deep learning with selective state space modeling\",\"authors\":\"Maqsood Ahmed , Xiang Zhang , Yonglin Shen , Tanveer Ahmed , Shahid Ali , Ayaz Ali , Aminjon Gulakhmadov , Won-Ho Nam , Nengcheng Chen\",\"doi\":\"10.1016/j.envint.2025.109496\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Air quality is crucial for both public health and environmental sustainability. An efficient and cost-effective model is essential for accurate air quality predictions and proactive pollution control. However, existing research primarily focuses on single static image analysis, which does not account for the dynamic and temporal nature of air pollution. Meanwhile, research on video-based air quality estimation remains limited, particularly in achieving accurate multi-pollutant outputs. This study proposes Air Quality Prediction-Mamba (AQP-Mamba), a video-based deep learning model that integrates a structured Selective State Space Model (SSM) with a selective scan mechanism and a hybrid predictor (HP) to estimate air quality. The spatiotemporal forward and backward SSM dynamically adjusts parameters based on input, ensures linear complexity, and effectively captures long-range dependencies by bidirectional processing of spatiotemporal features through four scanning techniques (row-wise, column-wise, and their vertical reversals), which allows the model to accurately track pollutant concentrations and air quality variations over time. Thus, the model efficiently extracts spatiotemporal features from video and simultaneously performs regression (PM<sub>2.5</sub>, PM<sub>10</sub>, and AQI), and classification (AQI) tasks, respectively. A high-quality outdoor hourly air quality dataset (LMSAQV) with 13,176 videos collected from six monitoring stations in Lahore, Pakistan, was utilized as the case study. The experimental results demonstrate that the AQP-Mamba significantly outperforms several state-of-the-art models, including VideoSwin-T, VideoMAE, I3D, VTHCL, andTimeSformer. The proposed model achieves strong regression performance (PM<sub>2.5</sub>: R<sup>2</sup> = 0.91, PM<sub>10</sub>: R<sup>2</sup> = 0.90, AQI: R<sup>2</sup> = 0.92) and excellent classification metrics: accuracy (94.57 %), precision (93.86 %), recall (94.20 %), and F1-score (93.44 %), respectively. The proposed model delivers consistent, real-time performance with a latency of 1.98 s per video, offering an effective, scalable, and cost-efficient solution for multi-pollutant estimation. This approach has the potential to address gaps in air quality data collected by expensive instruments globally.</div></div>\",\"PeriodicalId\":308,\"journal\":{\"name\":\"Environment International\",\"volume\":\"199 \",\"pages\":\"Article 109496\"},\"PeriodicalIF\":10.3000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment International\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0160412025002478\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412025002478","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

空气质量对公众健康和环境可持续性都至关重要。一个高效和具有成本效益的模型对于准确预测空气质量和主动控制污染至关重要。然而，现有的研究主要集中在单一的静态图像分析，没有考虑到空气污染的动态性和时代性。同时，基于视频的空气质量估计的研究仍然有限，特别是在实现精确的多污染物输出方面。本研究提出了空气质量预测-曼巴（AQP-Mamba），这是一种基于视频的深度学习模型，它集成了带有选择性扫描机制的结构化选择性状态空间模型（SSM）和混合预测器（HP）来估计空气质量。时空正向和反向SSM基于输入动态调整参数，确保线性复杂性，并通过四种扫描技术（行向、列向及其垂直反转）对时空特征进行双向处理，有效捕获远程依赖关系，从而使模型能够准确跟踪污染物浓度和空气质量随时间的变化。因此，该模型有效地从视频中提取时空特征，同时分别执行回归（PM2.5， PM10和AQI）和分类（AQI）任务。一个高质量的户外每小时空气质量数据集（LMSAQV）收集了来自巴基斯坦拉合尔6个监测站的13176个视频作为案例研究。实验结果表明，AQP-Mamba显著优于几种最先进的模型，包括video swan -t、VideoMAE、I3D、VTHCL和TimeSformer。该模型实现了强劲的回归性能(PM2.5: R2 = 0.91,PM10: R2 = 0.90,机能:R2 = 0.92)和优秀的分类指标:精度(94.57 %),精密(93.86 %),回忆(94.20 %),和F1-score(93.44 %),分别。所提出的模型提供一致的实时性能，每个视频的延迟为1.98 s，为多污染物估计提供了有效，可扩展和经济高效的解决方案。这种方法有可能解决全球昂贵仪器收集的空气质量数据的差距。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Low-cost video-based air quality estimation system using structured deep learning with selective state space modeling

查看原文本刊更多论文

Low-cost video-based air quality estimation system using structured deep learning with selective state space modeling

Air quality is crucial for both public health and environmental sustainability. An efficient and cost-effective model is essential for accurate air quality predictions and proactive pollution control. However, existing research primarily focuses on single static image analysis, which does not account for the dynamic and temporal nature of air pollution. Meanwhile, research on video-based air quality estimation remains limited, particularly in achieving accurate multi-pollutant outputs. This study proposes Air Quality Prediction-Mamba (AQP-Mamba), a video-based deep learning model that integrates a structured Selective State Space Model (SSM) with a selective scan mechanism and a hybrid predictor (HP) to estimate air quality. The spatiotemporal forward and backward SSM dynamically adjusts parameters based on input, ensures linear complexity, and effectively captures long-range dependencies by bidirectional processing of spatiotemporal features through four scanning techniques (row-wise, column-wise, and their vertical reversals), which allows the model to accurately track pollutant concentrations and air quality variations over time. Thus, the model efficiently extracts spatiotemporal features from video and simultaneously performs regression (PM_2.5, PM₁₀, and AQI), and classification (AQI) tasks, respectively. A high-quality outdoor hourly air quality dataset (LMSAQV) with 13,176 videos collected from six monitoring stations in Lahore, Pakistan, was utilized as the case study. The experimental results demonstrate that the AQP-Mamba significantly outperforms several state-of-the-art models, including VideoSwin-T, VideoMAE, I3D, VTHCL, and TimeSformer. The proposed model achieves strong regression performance (PM_2.5: R² = 0.91, PM₁₀: R² = 0.90, AQI: R² = 0.92) and excellent classification metrics: accuracy (94.57 %), precision (93.86 %), recall (94.20 %), and F1-score (93.44 %), respectively. The proposed model delivers consistent, real-time performance with a latency of 1.98 s per video, offering an effective, scalable, and cost-efficient solution for multi-pollutant estimation. This approach has the potential to address gaps in air quality data collected by expensive instruments globally.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Environment International 环境科学-环境科学

CiteScore

21.90

自引率

3.40%

发文量

734

审稿时长

2.8 months

期刊介绍： Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review. It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.