{"title":"望加锡市空气污染预测的数据挖掘方法","authors":"Nur Aini, M. S. Mustafa","doi":"10.1109/ICORIS50180.2020.9320800","DOIUrl":null,"url":null,"abstract":"Air pollution level in Makassar has increased based on data from 2018 to 2019. There were 646 data obtained from the Ministry of Environment and Forestry data archive through the official site, there as five variables in data training, particulate matter (PM10), sulfur dioxide (SO2), carbon monoxide (CO), nitrogen dioxide (NO2), and ozone (O3). Lack of information on air pollution causes the people become unaware on their personal health. There is an effective analysis method for exploring data. This research used knowledge discovery technique in databases in data mining to facilitate decision making. Finally, continuing from the results of previous studies where the prediction of air pollution levels used Naïve Bayes algorithm, this research predicts the level of air pollution using the K-Nearest Neighbor Algorithm to classification data testing and data training with an accuracy rate of 96%, a precision value of 97% and also a recall value of 100%.","PeriodicalId":280589,"journal":{"name":"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Data Mining Approach to Predict Air Pollution in Makassar\",\"authors\":\"Nur Aini, M. S. Mustafa\",\"doi\":\"10.1109/ICORIS50180.2020.9320800\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Air pollution level in Makassar has increased based on data from 2018 to 2019. There were 646 data obtained from the Ministry of Environment and Forestry data archive through the official site, there as five variables in data training, particulate matter (PM10), sulfur dioxide (SO2), carbon monoxide (CO), nitrogen dioxide (NO2), and ozone (O3). Lack of information on air pollution causes the people become unaware on their personal health. There is an effective analysis method for exploring data. This research used knowledge discovery technique in databases in data mining to facilitate decision making. Finally, continuing from the results of previous studies where the prediction of air pollution levels used Naïve Bayes algorithm, this research predicts the level of air pollution using the K-Nearest Neighbor Algorithm to classification data testing and data training with an accuracy rate of 96%, a precision value of 97% and also a recall value of 100%.\",\"PeriodicalId\":280589,\"journal\":{\"name\":\"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICORIS50180.2020.9320800\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICORIS50180.2020.9320800","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data Mining Approach to Predict Air Pollution in Makassar
Air pollution level in Makassar has increased based on data from 2018 to 2019. There were 646 data obtained from the Ministry of Environment and Forestry data archive through the official site, there as five variables in data training, particulate matter (PM10), sulfur dioxide (SO2), carbon monoxide (CO), nitrogen dioxide (NO2), and ozone (O3). Lack of information on air pollution causes the people become unaware on their personal health. There is an effective analysis method for exploring data. This research used knowledge discovery technique in databases in data mining to facilitate decision making. Finally, continuing from the results of previous studies where the prediction of air pollution levels used Naïve Bayes algorithm, this research predicts the level of air pollution using the K-Nearest Neighbor Algorithm to classification data testing and data training with an accuracy rate of 96%, a precision value of 97% and also a recall value of 100%.