{"title":"评估地铁室内空气质量的深度学习数据输入:准确性、效率和对下游任务的影响","authors":"Vahid Ghorbani , Amir Ghorbani , ChangKyoo Yoo","doi":"10.1016/j.buildenv.2025.113713","DOIUrl":null,"url":null,"abstract":"<div><div>Missing data is a pervasive issue in environmental and building systems and poses significant challenges for indoor air quality (IAQ) management in subway stations, where sensor readings directly guide ventilation control, exposure assessment, and risk modeling. This paper evaluates a range of imputation techniques for subway IAQ time-series, comparing their reconstruction accuracy, computational efficiency, and impact on downstream tasks. Twelve models are assessed across twelve distinct missing data scenarios that vary in pattern and severity. The methods include nine advanced deep learning approaches covering sequence and attention models, probabilistic generative frameworks, graph aware techniques and spectral or convolutional architectures and three naive statistical methods. Experiments use a newly collected dataset from Yeongtong station in the Seoul metro system giving real data insights into IAQ imputation challenges. To our knowledge this is the first comprehensive evaluation linking algorithmic performance to both ventilation control implications and forecasting outcomes in a multi-zone IAQ monitoring context. This study demonstrates the importance of aligning imputation strategies with specific missingness characteristics and forecasting requirements by means of reconstruction accuracy, which improved downstream IAQ task performances of forecasting and ventilation control. All implementation details are publicly accessible.<span><span><sup>1</sup></span></span></div></div>","PeriodicalId":9273,"journal":{"name":"Building and Environment","volume":"286 ","pages":"Article 113713"},"PeriodicalIF":7.6000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating deep learning data imputation for subway indoor air quality: Accuracy, efficiency, and implications for downstream tasks\",\"authors\":\"Vahid Ghorbani , Amir Ghorbani , ChangKyoo Yoo\",\"doi\":\"10.1016/j.buildenv.2025.113713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Missing data is a pervasive issue in environmental and building systems and poses significant challenges for indoor air quality (IAQ) management in subway stations, where sensor readings directly guide ventilation control, exposure assessment, and risk modeling. This paper evaluates a range of imputation techniques for subway IAQ time-series, comparing their reconstruction accuracy, computational efficiency, and impact on downstream tasks. Twelve models are assessed across twelve distinct missing data scenarios that vary in pattern and severity. The methods include nine advanced deep learning approaches covering sequence and attention models, probabilistic generative frameworks, graph aware techniques and spectral or convolutional architectures and three naive statistical methods. Experiments use a newly collected dataset from Yeongtong station in the Seoul metro system giving real data insights into IAQ imputation challenges. To our knowledge this is the first comprehensive evaluation linking algorithmic performance to both ventilation control implications and forecasting outcomes in a multi-zone IAQ monitoring context. This study demonstrates the importance of aligning imputation strategies with specific missingness characteristics and forecasting requirements by means of reconstruction accuracy, which improved downstream IAQ task performances of forecasting and ventilation control. All implementation details are publicly accessible.<span><span><sup>1</sup></span></span></div></div>\",\"PeriodicalId\":9273,\"journal\":{\"name\":\"Building and Environment\",\"volume\":\"286 \",\"pages\":\"Article 113713\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Building and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360132325011837\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building and Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360132325011837","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Evaluating deep learning data imputation for subway indoor air quality: Accuracy, efficiency, and implications for downstream tasks
Missing data is a pervasive issue in environmental and building systems and poses significant challenges for indoor air quality (IAQ) management in subway stations, where sensor readings directly guide ventilation control, exposure assessment, and risk modeling. This paper evaluates a range of imputation techniques for subway IAQ time-series, comparing their reconstruction accuracy, computational efficiency, and impact on downstream tasks. Twelve models are assessed across twelve distinct missing data scenarios that vary in pattern and severity. The methods include nine advanced deep learning approaches covering sequence and attention models, probabilistic generative frameworks, graph aware techniques and spectral or convolutional architectures and three naive statistical methods. Experiments use a newly collected dataset from Yeongtong station in the Seoul metro system giving real data insights into IAQ imputation challenges. To our knowledge this is the first comprehensive evaluation linking algorithmic performance to both ventilation control implications and forecasting outcomes in a multi-zone IAQ monitoring context. This study demonstrates the importance of aligning imputation strategies with specific missingness characteristics and forecasting requirements by means of reconstruction accuracy, which improved downstream IAQ task performances of forecasting and ventilation control. All implementation details are publicly accessible.1
期刊介绍:
Building and Environment, an international journal, is dedicated to publishing original research papers, comprehensive review articles, editorials, and short communications in the fields of building science, urban physics, and human interaction with the indoor and outdoor built environment. The journal emphasizes innovative technologies and knowledge verified through measurement and analysis. It covers environmental performance across various spatial scales, from cities and communities to buildings and systems, fostering collaborative, multi-disciplinary research with broader significance.