Sheila Maulida Intani, B. I. Nasution, M. E. Aminanto, Y. Nugraha, Nurhaya Muchtar, J. Kanggrawan
{"title":"Automating Public Complaint Classification Through JakLapor Channel: A Case Study of Jakarta, Indonesia","authors":"Sheila Maulida Intani, B. I. Nasution, M. E. Aminanto, Y. Nugraha, Nurhaya Muchtar, J. Kanggrawan","doi":"10.1109/ISC255366.2022.9922346","DOIUrl":null,"url":null,"abstract":"The DKI Jakarta provincial government is ready to support the digital transformation program with a series of digitally integrated policies. Residents of DKI Jakarta can now easily submit complaints about problems in their surrounding environment through the JakLapor service feature on the JAKI application. However, incoming reports are still manually classified. As a result, many citizens still report unsuitable complaints based on their category. This research aims to compare and find the best complaint classification model by applying multiple machine learning models to classify texts automatically. We also use feature extraction to see which model performs the best. This study employed Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Adaptive Boosting (AdaBoost) algorithms as the machine learning model. Meanwhile, we use Count Vectorizer, Terms Frequency-Inverse Document Frequency (TF-IDF), N-Gram, and Latent Semantic Analysis (LSA) as the feature extraction algorithms. The classification results show that the Random Forest method model with TFIDF feature extraction is the most accurate and optimal model among the others, with a 90% accuracy rate.","PeriodicalId":277015,"journal":{"name":"2022 IEEE International Smart Cities Conference (ISC2)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Smart Cities Conference (ISC2)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISC255366.2022.9922346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The DKI Jakarta provincial government is ready to support the digital transformation program with a series of digitally integrated policies. Residents of DKI Jakarta can now easily submit complaints about problems in their surrounding environment through the JakLapor service feature on the JAKI application. However, incoming reports are still manually classified. As a result, many citizens still report unsuitable complaints based on their category. This research aims to compare and find the best complaint classification model by applying multiple machine learning models to classify texts automatically. We also use feature extraction to see which model performs the best. This study employed Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Adaptive Boosting (AdaBoost) algorithms as the machine learning model. Meanwhile, we use Count Vectorizer, Terms Frequency-Inverse Document Frequency (TF-IDF), N-Gram, and Latent Semantic Analysis (LSA) as the feature extraction algorithms. The classification results show that the Random Forest method model with TFIDF feature extraction is the most accurate and optimal model among the others, with a 90% accuracy rate.