Seonje Jung , Junsu Gil , Meehye Lee , Clara Betancourt , Martin Schultz , Yunsoo Choi , Taekyu Joo , Daigon Kim
{"title":"通过可解释的人工智能比较,使用图形机器学习和参数分析来插值缺失的臭氧数据","authors":"Seonje Jung , Junsu Gil , Meehye Lee , Clara Betancourt , Martin Schultz , Yunsoo Choi , Taekyu Joo , Daigon Kim","doi":"10.1016/j.envsoft.2025.106466","DOIUrl":null,"url":null,"abstract":"<div><div>Ozone (O<sub>3</sub>), a short-lived climate pollutant, continues to increase despite policies aimed at suppressing its precursors in South Korea. The government operates approximately 500 observatories to monitor O<sub>3</sub> and trace gases. Researchers use these data to address the ongoing issue of increasing O<sub>3</sub> levels. However, challenges in data retrieval from observatories may introduce biases in O<sub>3</sub> studies. In this study, we developed a graph-based machine learning model to simulate missing O<sub>3</sub> concentrations for mitigate bias. The model incorporates spatiotemporal distribution characteristics using a merged observation dataset from South Korea in 2021. Regardless of region or length of missing data, the model effectively simulates O<sub>3</sub> variations with R<sup>2</sup> of up to 0.9 and RMSE of 3.6. To determine the influence of input parameters on O<sub>3</sub> interpolation, we used eXplainable AI methods. The results indicated that NO<sub>2</sub> is the most important factor in cities, while photochemical indicators are more influential in provinces.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"190 ","pages":"Article 106466"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpolation of missing ozone data using graph machine learning and parameter analysis through eXplainable artificial intelligence comparison\",\"authors\":\"Seonje Jung , Junsu Gil , Meehye Lee , Clara Betancourt , Martin Schultz , Yunsoo Choi , Taekyu Joo , Daigon Kim\",\"doi\":\"10.1016/j.envsoft.2025.106466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Ozone (O<sub>3</sub>), a short-lived climate pollutant, continues to increase despite policies aimed at suppressing its precursors in South Korea. The government operates approximately 500 observatories to monitor O<sub>3</sub> and trace gases. Researchers use these data to address the ongoing issue of increasing O<sub>3</sub> levels. However, challenges in data retrieval from observatories may introduce biases in O<sub>3</sub> studies. In this study, we developed a graph-based machine learning model to simulate missing O<sub>3</sub> concentrations for mitigate bias. The model incorporates spatiotemporal distribution characteristics using a merged observation dataset from South Korea in 2021. Regardless of region or length of missing data, the model effectively simulates O<sub>3</sub> variations with R<sup>2</sup> of up to 0.9 and RMSE of 3.6. To determine the influence of input parameters on O<sub>3</sub> interpolation, we used eXplainable AI methods. The results indicated that NO<sub>2</sub> is the most important factor in cities, while photochemical indicators are more influential in provinces.</div></div>\",\"PeriodicalId\":310,\"journal\":{\"name\":\"Environmental Modelling & Software\",\"volume\":\"190 \",\"pages\":\"Article 106466\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Modelling & Software\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1364815225001501\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225001501","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Interpolation of missing ozone data using graph machine learning and parameter analysis through eXplainable artificial intelligence comparison
Ozone (O3), a short-lived climate pollutant, continues to increase despite policies aimed at suppressing its precursors in South Korea. The government operates approximately 500 observatories to monitor O3 and trace gases. Researchers use these data to address the ongoing issue of increasing O3 levels. However, challenges in data retrieval from observatories may introduce biases in O3 studies. In this study, we developed a graph-based machine learning model to simulate missing O3 concentrations for mitigate bias. The model incorporates spatiotemporal distribution characteristics using a merged observation dataset from South Korea in 2021. Regardless of region or length of missing data, the model effectively simulates O3 variations with R2 of up to 0.9 and RMSE of 3.6. To determine the influence of input parameters on O3 interpolation, we used eXplainable AI methods. The results indicated that NO2 is the most important factor in cities, while photochemical indicators are more influential in provinces.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.