{"title":"Comparison of Machine Learning Classification Method on Text-based Case in Twitter","authors":"P. Telnoni, Reza Budiawan, Mutia Qana’a","doi":"10.1109/ICISS48059.2019.8969850","DOIUrl":null,"url":null,"abstract":"As Artificial Intelligence (AI) and Machine Learning (ML) gaining momentum on industry and academic field, a deeper understanding for AI and ML are highly required. One of the most popular sub-field in this field is text analysis. This paper will discuss the performance of classification methods for text-based data and give the best choices of classification method in term of accuracy and training time, so that will help ML enthusiast to build ML project that does not require high computational cost. This paper aimed to give recommendation to practitioner and academic about which classifier best for text classification. This paper will limit its study in supervised learning only. The tested algorithm will be Support Vector Machine, Logistic Regression, Naive Bayes, Random Forest, and K-Nearest Neighbor. To simplify the project, text will be labelled into single-label data, not multi-label. The test shows that SVM gives best result, in term of accuracy and training time among other methods.","PeriodicalId":125643,"journal":{"name":"2019 International Conference on ICT for Smart Society (ICISS)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on ICT for Smart Society (ICISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISS48059.2019.8969850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
As Artificial Intelligence (AI) and Machine Learning (ML) gaining momentum on industry and academic field, a deeper understanding for AI and ML are highly required. One of the most popular sub-field in this field is text analysis. This paper will discuss the performance of classification methods for text-based data and give the best choices of classification method in term of accuracy and training time, so that will help ML enthusiast to build ML project that does not require high computational cost. This paper aimed to give recommendation to practitioner and academic about which classifier best for text classification. This paper will limit its study in supervised learning only. The tested algorithm will be Support Vector Machine, Logistic Regression, Naive Bayes, Random Forest, and K-Nearest Neighbor. To simplify the project, text will be labelled into single-label data, not multi-label. The test shows that SVM gives best result, in term of accuracy and training time among other methods.