{"title":"多空气污染物估计的注意机制增强随机森林模型","authors":"Xinyu Yu , Man Sing Wong , Kwon-Ho Lee","doi":"10.1016/j.jag.2025.104661","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning techniques based on satellite observations energize the derivation of near-surface air pollutant concentrations. However, most of previous studies mainly focused on estimating single air pollutant concentration, ignoring the interactions and dependencies between different air pollutants. Therefore, we proposed a Multiple Pollutants simultaneous estimation method based on Attention mechanism augmented Random Forest model (MPA-RF), including PM<sub>2.5</sub>, PM<sub>10</sub>, O<sub>3</sub>, NO<sub>2</sub>, CO and SO<sub>2</sub>. Specifically, self-attention mechanism was incorporated with the multi-output random forest first to emphasize pertinent features in inputs during model training. Additionally, the multi-head self-attention was also integrated to derive the interactions and temporal dependencies of different air pollutants from historical data. Satellite observations from Advanced Himawari Imager (AHI) in three major urban agglomerations in China were extracted to demonstrate the model performance using sample- and site-based cross-validation schemes. Results elucidate that the proposed model is capable of deriving simultaneous estimations of six air pollutants with high accuracy, R<sup>2</sup> ranging from 0.74 to 0.93. Benefiting from the consideration of interactions and dependencies between different air pollutants, the proposed model outperforms other single-task contrast models with an R<sup>2</sup> improvement ranging from 9% to 26%. Moreover, the derived seamless estimations offer a basis for air pollution spatio-temporal patterns and dynamic evolution analysis with time-saving and efficient manner.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"141 ","pages":"Article 104661"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attention mechanism augmented random forest model for multiple air pollutants estimation\",\"authors\":\"Xinyu Yu , Man Sing Wong , Kwon-Ho Lee\",\"doi\":\"10.1016/j.jag.2025.104661\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Machine learning techniques based on satellite observations energize the derivation of near-surface air pollutant concentrations. However, most of previous studies mainly focused on estimating single air pollutant concentration, ignoring the interactions and dependencies between different air pollutants. Therefore, we proposed a Multiple Pollutants simultaneous estimation method based on Attention mechanism augmented Random Forest model (MPA-RF), including PM<sub>2.5</sub>, PM<sub>10</sub>, O<sub>3</sub>, NO<sub>2</sub>, CO and SO<sub>2</sub>. Specifically, self-attention mechanism was incorporated with the multi-output random forest first to emphasize pertinent features in inputs during model training. Additionally, the multi-head self-attention was also integrated to derive the interactions and temporal dependencies of different air pollutants from historical data. Satellite observations from Advanced Himawari Imager (AHI) in three major urban agglomerations in China were extracted to demonstrate the model performance using sample- and site-based cross-validation schemes. Results elucidate that the proposed model is capable of deriving simultaneous estimations of six air pollutants with high accuracy, R<sup>2</sup> ranging from 0.74 to 0.93. Benefiting from the consideration of interactions and dependencies between different air pollutants, the proposed model outperforms other single-task contrast models with an R<sup>2</sup> improvement ranging from 9% to 26%. Moreover, the derived seamless estimations offer a basis for air pollution spatio-temporal patterns and dynamic evolution analysis with time-saving and efficient manner.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"141 \",\"pages\":\"Article 104661\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843225003085\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225003085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
Attention mechanism augmented random forest model for multiple air pollutants estimation
Machine learning techniques based on satellite observations energize the derivation of near-surface air pollutant concentrations. However, most of previous studies mainly focused on estimating single air pollutant concentration, ignoring the interactions and dependencies between different air pollutants. Therefore, we proposed a Multiple Pollutants simultaneous estimation method based on Attention mechanism augmented Random Forest model (MPA-RF), including PM2.5, PM10, O3, NO2, CO and SO2. Specifically, self-attention mechanism was incorporated with the multi-output random forest first to emphasize pertinent features in inputs during model training. Additionally, the multi-head self-attention was also integrated to derive the interactions and temporal dependencies of different air pollutants from historical data. Satellite observations from Advanced Himawari Imager (AHI) in three major urban agglomerations in China were extracted to demonstrate the model performance using sample- and site-based cross-validation schemes. Results elucidate that the proposed model is capable of deriving simultaneous estimations of six air pollutants with high accuracy, R2 ranging from 0.74 to 0.93. Benefiting from the consideration of interactions and dependencies between different air pollutants, the proposed model outperforms other single-task contrast models with an R2 improvement ranging from 9% to 26%. Moreover, the derived seamless estimations offer a basis for air pollution spatio-temporal patterns and dynamic evolution analysis with time-saving and efficient manner.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.