{"title":"印尼Twitter文本权威分类在万隆政府","authors":"Janice Laksana, A. Purwarianti","doi":"10.1109/ICAICTA.2014.7005928","DOIUrl":null,"url":null,"abstract":"Nowadays, social media based complaint management systems have been deployed in several countries and cities including Bandung. We proposed an automatic authority classification for Twitter text in Indonesian as part of the complaint management system. Our analysis showed that there are several Twitter message types raised in official account Twitter of the city government. The classification employed a statistical based multi-label text classification. Here, we compared several techniques in the classification such as the features, the algorithms and the classification schemes. In the features comparison, we examined several features such as the complaint word feature, n-gram feature, and the @username feature. In the algorithms comparison, we employed Decision Tree algorithm, Naïve Bayes algorithm, and Support Vector Machine algorithm with multi-label classification techniques of Binary Relevance and Label Power Set. In the complaint classification schemes, we compared the direct classification and two steps classification. Using 2244 twitter texts from twitter of Bandung city government and 5-fold cross validation, the best experimental result of 70.90% accuracy was achieved by the feature combination of 1-gram and complaint word, with Support Vector Machine and Label Power Set as the algorithm, in the direct scheme of text classification.","PeriodicalId":173600,"journal":{"name":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Indonesian Twitter text authority classification for government in Bandung\",\"authors\":\"Janice Laksana, A. Purwarianti\",\"doi\":\"10.1109/ICAICTA.2014.7005928\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays, social media based complaint management systems have been deployed in several countries and cities including Bandung. We proposed an automatic authority classification for Twitter text in Indonesian as part of the complaint management system. Our analysis showed that there are several Twitter message types raised in official account Twitter of the city government. The classification employed a statistical based multi-label text classification. Here, we compared several techniques in the classification such as the features, the algorithms and the classification schemes. In the features comparison, we examined several features such as the complaint word feature, n-gram feature, and the @username feature. In the algorithms comparison, we employed Decision Tree algorithm, Naïve Bayes algorithm, and Support Vector Machine algorithm with multi-label classification techniques of Binary Relevance and Label Power Set. In the complaint classification schemes, we compared the direct classification and two steps classification. Using 2244 twitter texts from twitter of Bandung city government and 5-fold cross validation, the best experimental result of 70.90% accuracy was achieved by the feature combination of 1-gram and complaint word, with Support Vector Machine and Label Power Set as the algorithm, in the direct scheme of text classification.\",\"PeriodicalId\":173600,\"journal\":{\"name\":\"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAICTA.2014.7005928\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAICTA.2014.7005928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Indonesian Twitter text authority classification for government in Bandung
Nowadays, social media based complaint management systems have been deployed in several countries and cities including Bandung. We proposed an automatic authority classification for Twitter text in Indonesian as part of the complaint management system. Our analysis showed that there are several Twitter message types raised in official account Twitter of the city government. The classification employed a statistical based multi-label text classification. Here, we compared several techniques in the classification such as the features, the algorithms and the classification schemes. In the features comparison, we examined several features such as the complaint word feature, n-gram feature, and the @username feature. In the algorithms comparison, we employed Decision Tree algorithm, Naïve Bayes algorithm, and Support Vector Machine algorithm with multi-label classification techniques of Binary Relevance and Label Power Set. In the complaint classification schemes, we compared the direct classification and two steps classification. Using 2244 twitter texts from twitter of Bandung city government and 5-fold cross validation, the best experimental result of 70.90% accuracy was achieved by the feature combination of 1-gram and complaint word, with Support Vector Machine and Label Power Set as the algorithm, in the direct scheme of text classification.