Xutong Liu , Xuhui Kong , Wufan Xuan , Jialin Li , Andrew Nyakundi , Yuxuan Zhang , Lina Zheng , Fubao Zhou
{"title":"基于机器学习的后向散射图像粉尘浓度分布预测方法","authors":"Xutong Liu , Xuhui Kong , Wufan Xuan , Jialin Li , Andrew Nyakundi , Yuxuan Zhang , Lina Zheng , Fubao Zhou","doi":"10.1016/j.envsoft.2025.106620","DOIUrl":null,"url":null,"abstract":"<div><div>Dust is a significant environmental pollutant that poses serious risks to human health, so its proper monitoring is quite significant. However, traditional light scattering single-point measurements are limited to reflecting the dust concentration distribution in a large-scale environment. In this paper, we introduced the light intensity distance and obtained a large-scale dust concentration distribution prediction model by analyzing the relationship between dust concentration and the attenuation of light intensity caused by dust scattering and absorption in the optical path. Fifteen classical machine learning algorithms were applied, which proved the importance of light intensity distance in predicting dust concentration. When only fitting the relationship between light intensity and concentration, each algorithm gave the result with R2 of about 0.9000, while Kolmogorov–Arnold Networks (KAN) had the best prediction result 0.9076. When light intensity distance was added, the prediction accuracy of each algorithm was improved correspondingly. Light Gradient Boosting Machine (LightGBM) methods have the best performance (R2: 0.9500), and KAN followed (R2: 0.9472). We added another set of <span><math><msub><mrow><mi>SiO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> experiments and sensitivity analyses for light intensity distance to demonstrate the applicability of the model. Finally, LightGBM was used to predict the dust concentration distribution in the whole process.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"193 ","pages":"Article 106620"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based methods for dust concentration distribution prediction by utilizing back scattering images\",\"authors\":\"Xutong Liu , Xuhui Kong , Wufan Xuan , Jialin Li , Andrew Nyakundi , Yuxuan Zhang , Lina Zheng , Fubao Zhou\",\"doi\":\"10.1016/j.envsoft.2025.106620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Dust is a significant environmental pollutant that poses serious risks to human health, so its proper monitoring is quite significant. However, traditional light scattering single-point measurements are limited to reflecting the dust concentration distribution in a large-scale environment. In this paper, we introduced the light intensity distance and obtained a large-scale dust concentration distribution prediction model by analyzing the relationship between dust concentration and the attenuation of light intensity caused by dust scattering and absorption in the optical path. Fifteen classical machine learning algorithms were applied, which proved the importance of light intensity distance in predicting dust concentration. When only fitting the relationship between light intensity and concentration, each algorithm gave the result with R2 of about 0.9000, while Kolmogorov–Arnold Networks (KAN) had the best prediction result 0.9076. When light intensity distance was added, the prediction accuracy of each algorithm was improved correspondingly. Light Gradient Boosting Machine (LightGBM) methods have the best performance (R2: 0.9500), and KAN followed (R2: 0.9472). We added another set of <span><math><msub><mrow><mi>SiO</mi></mrow><mrow><mn>2</mn></mrow></msub></math></span> experiments and sensitivity analyses for light intensity distance to demonstrate the applicability of the model. Finally, LightGBM was used to predict the dust concentration distribution in the whole process.</div></div>\",\"PeriodicalId\":310,\"journal\":{\"name\":\"Environmental Modelling & Software\",\"volume\":\"193 \",\"pages\":\"Article 106620\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Modelling & Software\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1364815225003044\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225003044","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Machine learning-based methods for dust concentration distribution prediction by utilizing back scattering images
Dust is a significant environmental pollutant that poses serious risks to human health, so its proper monitoring is quite significant. However, traditional light scattering single-point measurements are limited to reflecting the dust concentration distribution in a large-scale environment. In this paper, we introduced the light intensity distance and obtained a large-scale dust concentration distribution prediction model by analyzing the relationship between dust concentration and the attenuation of light intensity caused by dust scattering and absorption in the optical path. Fifteen classical machine learning algorithms were applied, which proved the importance of light intensity distance in predicting dust concentration. When only fitting the relationship between light intensity and concentration, each algorithm gave the result with R2 of about 0.9000, while Kolmogorov–Arnold Networks (KAN) had the best prediction result 0.9076. When light intensity distance was added, the prediction accuracy of each algorithm was improved correspondingly. Light Gradient Boosting Machine (LightGBM) methods have the best performance (R2: 0.9500), and KAN followed (R2: 0.9472). We added another set of experiments and sensitivity analyses for light intensity distance to demonstrate the applicability of the model. Finally, LightGBM was used to predict the dust concentration distribution in the whole process.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.