Performance Evaluation of n-Grams Ratio Features in Solving Multi-Classes Classification Problems

T. Abidin, Nur Ratna Sari, Ahmad Zuhri Ramadhan, Irvanizam, R. P. F. Afidh
{"title":"Performance Evaluation of n-Grams Ratio Features in Solving Multi-Classes Classification Problems","authors":"T. Abidin, Nur Ratna Sari, Ahmad Zuhri Ramadhan, Irvanizam, R. P. F. Afidh","doi":"10.1109/ICITEED.2018.8534773","DOIUrl":null,"url":null,"abstract":"We present experimental results that compare k-Nearest Neighbors (k-NN) and Support Vector Machines (SVM) algorithms to classifythe natural disasters multi-classes problem when n-grams ratio is used as the numerical features and compare three SVM approaches to classify the transportation accidents multi-classes problem when the same n-grams ratio is used as the features. In the former problem, we would like to investigate which of the two prominent algorithms have a better accuracy, while in thelatter problem, we would like to compare which of the three well-known SVM approaches for solving multi-classes problem performs best.In the natural disasters problem, the class labels are earthquakes, volcanic eruptions, flooding, landslides, and others, while in the transportation accidents problem, the categories are traffic collisions, maritime accidents, aviation accidents, and others. n-grams dictionaries of each category are used as the references in creating numerical features of the news articles. The results show that for the natural disasters problem, k-NN performs better than SVM and for the transportation accidents problem, DAGSVMoutperforms the other two SVM binary classification approaches.","PeriodicalId":142523,"journal":{"name":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"755 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2018.8534773","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

We present experimental results that compare k-Nearest Neighbors (k-NN) and Support Vector Machines (SVM) algorithms to classifythe natural disasters multi-classes problem when n-grams ratio is used as the numerical features and compare three SVM approaches to classify the transportation accidents multi-classes problem when the same n-grams ratio is used as the features. In the former problem, we would like to investigate which of the two prominent algorithms have a better accuracy, while in thelatter problem, we would like to compare which of the three well-known SVM approaches for solving multi-classes problem performs best.In the natural disasters problem, the class labels are earthquakes, volcanic eruptions, flooding, landslides, and others, while in the transportation accidents problem, the categories are traffic collisions, maritime accidents, aviation accidents, and others. n-grams dictionaries of each category are used as the references in creating numerical features of the news articles. The results show that for the natural disasters problem, k-NN performs better than SVM and for the transportation accidents problem, DAGSVMoutperforms the other two SVM binary classification approaches.
n-Grams比率特征在求解多类分类问题中的性能评价
实验结果比较了k-最近邻(k-NN)算法和支持向量机(SVM)算法在以n-grams比率作为数值特征时对自然灾害多类问题的分类,并比较了三种支持向量机算法在以相同n-grams比率作为特征时对交通事故多类问题的分类。在前一个问题中,我们想研究两种突出的算法中哪一种具有更好的准确性,而在后一个问题中,我们想比较三种众所周知的支持向量机方法中哪一种用于解决多类问题表现最好。在自然灾害问题中,类标签是地震、火山爆发、洪水、山体滑坡等,而在交通事故问题中,类标签是交通碰撞、海上事故、航空事故等。在创建新闻文章的数字特征时,使用每个类别的N-grams字典作为参考。结果表明,对于自然灾害问题,k-NN优于SVM;对于交通事故问题,dagsvm优于其他两种SVM二分类方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信