T. Abidin, Nur Ratna Sari, Ahmad Zuhri Ramadhan, Irvanizam, R. P. F. Afidh
{"title":"Performance Evaluation of n-Grams Ratio Features in Solving Multi-Classes Classification Problems","authors":"T. Abidin, Nur Ratna Sari, Ahmad Zuhri Ramadhan, Irvanizam, R. P. F. Afidh","doi":"10.1109/ICITEED.2018.8534773","DOIUrl":null,"url":null,"abstract":"We present experimental results that compare k-Nearest Neighbors (k-NN) and Support Vector Machines (SVM) algorithms to classifythe natural disasters multi-classes problem when n-grams ratio is used as the numerical features and compare three SVM approaches to classify the transportation accidents multi-classes problem when the same n-grams ratio is used as the features. In the former problem, we would like to investigate which of the two prominent algorithms have a better accuracy, while in thelatter problem, we would like to compare which of the three well-known SVM approaches for solving multi-classes problem performs best.In the natural disasters problem, the class labels are earthquakes, volcanic eruptions, flooding, landslides, and others, while in the transportation accidents problem, the categories are traffic collisions, maritime accidents, aviation accidents, and others. n-grams dictionaries of each category are used as the references in creating numerical features of the news articles. The results show that for the natural disasters problem, k-NN performs better than SVM and for the transportation accidents problem, DAGSVMoutperforms the other two SVM binary classification approaches.","PeriodicalId":142523,"journal":{"name":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"755 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2018.8534773","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
We present experimental results that compare k-Nearest Neighbors (k-NN) and Support Vector Machines (SVM) algorithms to classifythe natural disasters multi-classes problem when n-grams ratio is used as the numerical features and compare three SVM approaches to classify the transportation accidents multi-classes problem when the same n-grams ratio is used as the features. In the former problem, we would like to investigate which of the two prominent algorithms have a better accuracy, while in thelatter problem, we would like to compare which of the three well-known SVM approaches for solving multi-classes problem performs best.In the natural disasters problem, the class labels are earthquakes, volcanic eruptions, flooding, landslides, and others, while in the transportation accidents problem, the categories are traffic collisions, maritime accidents, aviation accidents, and others. n-grams dictionaries of each category are used as the references in creating numerical features of the news articles. The results show that for the natural disasters problem, k-NN performs better than SVM and for the transportation accidents problem, DAGSVMoutperforms the other two SVM binary classification approaches.