N. Masseran, Razik Ridzuan Mohd Tajuddin, Mohd Talib Latif
{"title":"马来西亚不健康空气污染事件严重程度分类:决策树模型","authors":"N. Masseran, Razik Ridzuan Mohd Tajuddin, Mohd Talib Latif","doi":"10.17576/jsm-2023-5210-18","DOIUrl":null,"url":null,"abstract":"The application of data mining technique in dealing with real problems is popular and ubiquitous in various knowledge domains. This study proposes the concept of severity measures correspond to the characteristics of duration and intensity size for evaluating unhealthy air pollution events. In parallel with that, the present study also proposes a decision tree as a predictive model to deal with a binary classification corresponding to extreme and non-extreme unhealthy air pollution events, which is established based on threshold of the power-law behavior. In a similar vein, other characteristics, such as duration and intensity size, were also determined as important related features. A case study was conducted using the air pollution index data of Klang, Malaysia, from January 1st, 1997 to August 31st, 2020. The results found that the decision tree model can provide a high degree of precision and generalization with 100% accuracy in classifying a class for extreme and non-extreme events for the air pollution severity in the Klang area. In addition, a duration size is the most influential feature that leads to the occurrence of an extreme air pollution event. Thus, this study also suggests that authorities should exercise some vigilance precautions with respect to pollution incidents with a consecutive duration exceeding 11 hours.","PeriodicalId":21366,"journal":{"name":"Sains Malaysiana","volume":"193 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Classifying Severity of Unhealthy Air Pollution Events in Malaysia: A Decision Tree Model\",\"authors\":\"N. Masseran, Razik Ridzuan Mohd Tajuddin, Mohd Talib Latif\",\"doi\":\"10.17576/jsm-2023-5210-18\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The application of data mining technique in dealing with real problems is popular and ubiquitous in various knowledge domains. This study proposes the concept of severity measures correspond to the characteristics of duration and intensity size for evaluating unhealthy air pollution events. In parallel with that, the present study also proposes a decision tree as a predictive model to deal with a binary classification corresponding to extreme and non-extreme unhealthy air pollution events, which is established based on threshold of the power-law behavior. In a similar vein, other characteristics, such as duration and intensity size, were also determined as important related features. A case study was conducted using the air pollution index data of Klang, Malaysia, from January 1st, 1997 to August 31st, 2020. The results found that the decision tree model can provide a high degree of precision and generalization with 100% accuracy in classifying a class for extreme and non-extreme events for the air pollution severity in the Klang area. In addition, a duration size is the most influential feature that leads to the occurrence of an extreme air pollution event. Thus, this study also suggests that authorities should exercise some vigilance precautions with respect to pollution incidents with a consecutive duration exceeding 11 hours.\",\"PeriodicalId\":21366,\"journal\":{\"name\":\"Sains Malaysiana\",\"volume\":\"193 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2023-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sains Malaysiana\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.17576/jsm-2023-5210-18\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sains Malaysiana","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.17576/jsm-2023-5210-18","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Classifying Severity of Unhealthy Air Pollution Events in Malaysia: A Decision Tree Model
The application of data mining technique in dealing with real problems is popular and ubiquitous in various knowledge domains. This study proposes the concept of severity measures correspond to the characteristics of duration and intensity size for evaluating unhealthy air pollution events. In parallel with that, the present study also proposes a decision tree as a predictive model to deal with a binary classification corresponding to extreme and non-extreme unhealthy air pollution events, which is established based on threshold of the power-law behavior. In a similar vein, other characteristics, such as duration and intensity size, were also determined as important related features. A case study was conducted using the air pollution index data of Klang, Malaysia, from January 1st, 1997 to August 31st, 2020. The results found that the decision tree model can provide a high degree of precision and generalization with 100% accuracy in classifying a class for extreme and non-extreme events for the air pollution severity in the Klang area. In addition, a duration size is the most influential feature that leads to the occurrence of an extreme air pollution event. Thus, this study also suggests that authorities should exercise some vigilance precautions with respect to pollution incidents with a consecutive duration exceeding 11 hours.
期刊介绍:
Sains Malaysiana is a refereed journal committed to the advancement of scholarly knowledge and research findings of the several branches of science and technology. It contains articles on Earth Sciences, Health Sciences, Life Sciences, Mathematical Sciences and Physical Sciences. The journal publishes articles, reviews, and research notes whose content and approach are of interest to a wide range of scholars. Sains Malaysiana is published by the UKM Press an its autonomous Editorial Board are drawn from the Faculty of Science and Technology, Universiti Kebangsaan Malaysia. In addition, distinguished scholars from local and foreign universities are appointed to serve as advisory board members and referees.