{"title":"通过机器学习分析环境因素与灰尘积聚之间的关系","authors":"Komiljon Yakubov, R. Bazarbayev, Davron Qurbanov, Maksud Sharipov, Jamshid Masharipov, Smagul Karazhanov","doi":"10.1515/zpch-2023-0479","DOIUrl":null,"url":null,"abstract":"\n This study aims to explore the relationship between dust accumulation on a glass and various environmental factors including temperature, humidity, atmospheric pressure, and wind speed. The data was analyzed using Python, a popular language for data science and artificial intelligence, and regression algorithms from the scikit-learn library. The data was divided into training (80 %) and test (20 %) sets and different models were used, such as linear regression, decision tree, K-neighbor regression, random forest regression, and decision tree regression. The accuracy of the models was determined using R\n 2 scores, where a score of 1.0 indicates a perfect fit and negative values suggest that the model is worse than predicting the mean value. The accuracy of the selected models was calculated as a percentage by multiplying the obtained R\n 2 scores by 100. Graphs were used to visualise the data and determine the appropriate analysis model. The study found that the amount of dust is directly proportional to temperature and humidity. The accuracy levels of the linear models were suboptimal, leading to the use of nonlinear models like random forest regressor, decision tree regressor, and gradient boosting regressor, which showed improved performance.","PeriodicalId":506520,"journal":{"name":"Zeitschrift für Physikalische Chemie","volume":"136 26","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The relationship between environmental factors and dust accumulation by machine learning\",\"authors\":\"Komiljon Yakubov, R. Bazarbayev, Davron Qurbanov, Maksud Sharipov, Jamshid Masharipov, Smagul Karazhanov\",\"doi\":\"10.1515/zpch-2023-0479\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This study aims to explore the relationship between dust accumulation on a glass and various environmental factors including temperature, humidity, atmospheric pressure, and wind speed. The data was analyzed using Python, a popular language for data science and artificial intelligence, and regression algorithms from the scikit-learn library. The data was divided into training (80 %) and test (20 %) sets and different models were used, such as linear regression, decision tree, K-neighbor regression, random forest regression, and decision tree regression. The accuracy of the models was determined using R\\n 2 scores, where a score of 1.0 indicates a perfect fit and negative values suggest that the model is worse than predicting the mean value. The accuracy of the selected models was calculated as a percentage by multiplying the obtained R\\n 2 scores by 100. Graphs were used to visualise the data and determine the appropriate analysis model. The study found that the amount of dust is directly proportional to temperature and humidity. The accuracy levels of the linear models were suboptimal, leading to the use of nonlinear models like random forest regressor, decision tree regressor, and gradient boosting regressor, which showed improved performance.\",\"PeriodicalId\":506520,\"journal\":{\"name\":\"Zeitschrift für Physikalische Chemie\",\"volume\":\"136 26\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Zeitschrift für Physikalische Chemie\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1515/zpch-2023-0479\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zeitschrift für Physikalische Chemie","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/zpch-2023-0479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本研究旨在探讨玻璃上的灰尘积累与温度、湿度、气压和风速等各种环境因素之间的关系。数据分析使用了数据科学和人工智能领域的流行语言 Python 和 scikit-learn 库中的回归算法。数据被分为训练集(80%)和测试集(20%),并使用了不同的模型,如线性回归、决策树、K-邻居回归、随机森林回归和决策树回归。模型的准确度使用 R 2 分数来确定,1.0 分表示完全拟合,负值表示模型比预测平均值差。所选模型的准确度是用 R 2 分数乘以 100 计算得出的百分比。图表用于直观显示数据并确定合适的分析模型。研究发现,灰尘量与温度和湿度成正比。线性模型的准确度不理想,因此使用了非线性模型,如随机森林回归模型、决策树回归模型和梯度提升回归模型,这些模型的性能有所提高。
The relationship between environmental factors and dust accumulation by machine learning
This study aims to explore the relationship between dust accumulation on a glass and various environmental factors including temperature, humidity, atmospheric pressure, and wind speed. The data was analyzed using Python, a popular language for data science and artificial intelligence, and regression algorithms from the scikit-learn library. The data was divided into training (80 %) and test (20 %) sets and different models were used, such as linear regression, decision tree, K-neighbor regression, random forest regression, and decision tree regression. The accuracy of the models was determined using R
2 scores, where a score of 1.0 indicates a perfect fit and negative values suggest that the model is worse than predicting the mean value. The accuracy of the selected models was calculated as a percentage by multiplying the obtained R
2 scores by 100. Graphs were used to visualise the data and determine the appropriate analysis model. The study found that the amount of dust is directly proportional to temperature and humidity. The accuracy levels of the linear models were suboptimal, leading to the use of nonlinear models like random forest regressor, decision tree regressor, and gradient boosting regressor, which showed improved performance.