{"title":"Logistic回归与支持向量机在空气污染数据集预测中的比较","authors":"S. Mohammad, O. Hannon","doi":"10.33899/IQJOSS.2020.165445","DOIUrl":null,"url":null,"abstract":"Particular matter (PM10) studying and forecasting is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution (Co, So2, O 3, Nox, No, Wind Speed, and Ambient Temperature) may effect on PM10 variable. PM10 and the pollutant variables have been taken from the meteorological station in Kuala Lumpur, Malaysia. All of these variables classified as nonlinear data. Logistic regression (LR) model can be used for modeling and forecasting these multivariable datasets. LR is one of linear statistical methods, therefore it may reflect inaccurate results when used with nonlinear datasets. To improve the results of forecasting, support vector machine (SVM) method has been suggested in this study. The results in this study reflect outperforming for SVM method comparing to LR. In conclusion, SVM forecasting can be used for more accuracy with nonlinear multivariate datasets when PM10 is as dependent variable.","PeriodicalId":351789,"journal":{"name":"IRAQI JOURNAL OF STATISTICAL SCIENCES","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparisons between Logistic Regression and Support Vector Machine for Air Pollution Datasets Forecasting\",\"authors\":\"S. Mohammad, O. Hannon\",\"doi\":\"10.33899/IQJOSS.2020.165445\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Particular matter (PM10) studying and forecasting is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution (Co, So2, O 3, Nox, No, Wind Speed, and Ambient Temperature) may effect on PM10 variable. PM10 and the pollutant variables have been taken from the meteorological station in Kuala Lumpur, Malaysia. All of these variables classified as nonlinear data. Logistic regression (LR) model can be used for modeling and forecasting these multivariable datasets. LR is one of linear statistical methods, therefore it may reflect inaccurate results when used with nonlinear datasets. To improve the results of forecasting, support vector machine (SVM) method has been suggested in this study. The results in this study reflect outperforming for SVM method comparing to LR. In conclusion, SVM forecasting can be used for more accuracy with nonlinear multivariate datasets when PM10 is as dependent variable.\",\"PeriodicalId\":351789,\"journal\":{\"name\":\"IRAQI JOURNAL OF STATISTICAL SCIENCES\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IRAQI JOURNAL OF STATISTICAL SCIENCES\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33899/IQJOSS.2020.165445\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IRAQI JOURNAL OF STATISTICAL SCIENCES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33899/IQJOSS.2020.165445","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparisons between Logistic Regression and Support Vector Machine for Air Pollution Datasets Forecasting
Particular matter (PM10) studying and forecasting is necessary to control and reduce the damage of environment and human health. There are many pollutants as sources of air pollution (Co, So2, O 3, Nox, No, Wind Speed, and Ambient Temperature) may effect on PM10 variable. PM10 and the pollutant variables have been taken from the meteorological station in Kuala Lumpur, Malaysia. All of these variables classified as nonlinear data. Logistic regression (LR) model can be used for modeling and forecasting these multivariable datasets. LR is one of linear statistical methods, therefore it may reflect inaccurate results when used with nonlinear datasets. To improve the results of forecasting, support vector machine (SVM) method has been suggested in this study. The results in this study reflect outperforming for SVM method comparing to LR. In conclusion, SVM forecasting can be used for more accuracy with nonlinear multivariate datasets when PM10 is as dependent variable.