Xinyi Wei , Hao Meng , Lizhen Shao , Dongmei Fu , Lingwei Ma , Dawei Zhang
{"title":"一种基于分解的长时间连续缺失大气污染数据插值算法及其应用","authors":"Xinyi Wei , Hao Meng , Lizhen Shao , Dongmei Fu , Lingwei Ma , Dawei Zhang","doi":"10.1016/j.jocs.2025.102697","DOIUrl":null,"url":null,"abstract":"<div><div>With the intensification of environmental air pollution, the impact of air pollutants on both the ecological environment and human health has attracted widespread attention. However, due to the relatively late introduction of environmental monitoring systems, there were long consecutive missing values in early pollutant data. In this paper, we propose a decomposition-based imputation method for long consecutive missing pollution data. Firstly, wavelet coherence analysis is employed to investigate the periodic relationship between the pollution data and the relevant air data, decomposing them into periodic and non-periodic components. Then, machine learning and transfer learning are used to impute the periodic and non-periodic components, respectively. Furthermore, the effectiveness of the method is validated on artificially missing <span><math><msub><mrow><mi>NO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> and <span><math><msub><mrow><mi>SO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentration data from five regions of China. Comparison results show that the proposed method significantly outperforms some other imputation methods in the literature in terms of both mean absolute error and mean absolute percentage error. Finally, the proposed imputation method is applied in the study of accelerated aging of polycarbonate materials. Experimental results show that the predictive accuracy of the aging model is improved when using the imputed pollutant data.</div></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"92 ","pages":"Article 102697"},"PeriodicalIF":3.7000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A decomposition based imputation algorithm for long consecutive missing atmospheric pollution data and its application\",\"authors\":\"Xinyi Wei , Hao Meng , Lizhen Shao , Dongmei Fu , Lingwei Ma , Dawei Zhang\",\"doi\":\"10.1016/j.jocs.2025.102697\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the intensification of environmental air pollution, the impact of air pollutants on both the ecological environment and human health has attracted widespread attention. However, due to the relatively late introduction of environmental monitoring systems, there were long consecutive missing values in early pollutant data. In this paper, we propose a decomposition-based imputation method for long consecutive missing pollution data. Firstly, wavelet coherence analysis is employed to investigate the periodic relationship between the pollution data and the relevant air data, decomposing them into periodic and non-periodic components. Then, machine learning and transfer learning are used to impute the periodic and non-periodic components, respectively. Furthermore, the effectiveness of the method is validated on artificially missing <span><math><msub><mrow><mi>NO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> and <span><math><msub><mrow><mi>SO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> concentration data from five regions of China. Comparison results show that the proposed method significantly outperforms some other imputation methods in the literature in terms of both mean absolute error and mean absolute percentage error. Finally, the proposed imputation method is applied in the study of accelerated aging of polycarbonate materials. Experimental results show that the predictive accuracy of the aging model is improved when using the imputed pollutant data.</div></div>\",\"PeriodicalId\":48907,\"journal\":{\"name\":\"Journal of Computational Science\",\"volume\":\"92 \",\"pages\":\"Article 102697\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Science\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1877750325001747\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877750325001747","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A decomposition based imputation algorithm for long consecutive missing atmospheric pollution data and its application
With the intensification of environmental air pollution, the impact of air pollutants on both the ecological environment and human health has attracted widespread attention. However, due to the relatively late introduction of environmental monitoring systems, there were long consecutive missing values in early pollutant data. In this paper, we propose a decomposition-based imputation method for long consecutive missing pollution data. Firstly, wavelet coherence analysis is employed to investigate the periodic relationship between the pollution data and the relevant air data, decomposing them into periodic and non-periodic components. Then, machine learning and transfer learning are used to impute the periodic and non-periodic components, respectively. Furthermore, the effectiveness of the method is validated on artificially missing and concentration data from five regions of China. Comparison results show that the proposed method significantly outperforms some other imputation methods in the literature in terms of both mean absolute error and mean absolute percentage error. Finally, the proposed imputation method is applied in the study of accelerated aging of polycarbonate materials. Experimental results show that the predictive accuracy of the aging model is improved when using the imputed pollutant data.
期刊介绍:
Computational Science is a rapidly growing multi- and interdisciplinary field that uses advanced computing and data analysis to understand and solve complex problems. It has reached a level of predictive capability that now firmly complements the traditional pillars of experimentation and theory.
The recent advances in experimental techniques such as detectors, on-line sensor networks and high-resolution imaging techniques, have opened up new windows into physical and biological processes at many levels of detail. The resulting data explosion allows for detailed data driven modeling and simulation.
This new discipline in science combines computational thinking, modern computational methods, devices and collateral technologies to address problems far beyond the scope of traditional numerical methods.
Computational science typically unifies three distinct elements:
• Modeling, Algorithms and Simulations (e.g. numerical and non-numerical, discrete and continuous);
• Software developed to solve science (e.g., biological, physical, and social), engineering, medicine, and humanities problems;
• Computer and information science that develops and optimizes the advanced system hardware, software, networking, and data management components (e.g. problem solving environments).