{"title":"泰国-英语垃圾短信过滤","authors":"C. Khemapatapan","doi":"10.1109/APCC.2010.5679770","DOIUrl":null,"url":null,"abstract":"SMS spam filtering for Thai-English language has not previously been studied and implemented. Two methods of spam SMS message filtering objected to filter spam SMS messages written in Thai and English have been studied and implemented. The first method simply uses current spam English message filtering and then upgrades for Thai language support. The second one applies text normalization, word segmentation process, and analyzing/correcting the semantic of Thai words. However, both methods are applied by 2 different decision-making algorithms: Support Vector Machine (SVM) and Naive Bayesian (NB) algorithms. Finally, the results from trial applying in the real system are shown. The results show that the second filtering method has higher accuracy. Moreover, the SVM based filtering consumes more processing time than the NB based filtering about 2.5 times for both proposed methods.","PeriodicalId":402292,"journal":{"name":"2010 16th Asia-Pacific Conference on Communications (APCC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Thai-English spam SMS filtering\",\"authors\":\"C. Khemapatapan\",\"doi\":\"10.1109/APCC.2010.5679770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"SMS spam filtering for Thai-English language has not previously been studied and implemented. Two methods of spam SMS message filtering objected to filter spam SMS messages written in Thai and English have been studied and implemented. The first method simply uses current spam English message filtering and then upgrades for Thai language support. The second one applies text normalization, word segmentation process, and analyzing/correcting the semantic of Thai words. However, both methods are applied by 2 different decision-making algorithms: Support Vector Machine (SVM) and Naive Bayesian (NB) algorithms. Finally, the results from trial applying in the real system are shown. The results show that the second filtering method has higher accuracy. Moreover, the SVM based filtering consumes more processing time than the NB based filtering about 2.5 times for both proposed methods.\",\"PeriodicalId\":402292,\"journal\":{\"name\":\"2010 16th Asia-Pacific Conference on Communications (APCC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 16th Asia-Pacific Conference on Communications (APCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APCC.2010.5679770\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 16th Asia-Pacific Conference on Communications (APCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCC.2010.5679770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SMS spam filtering for Thai-English language has not previously been studied and implemented. Two methods of spam SMS message filtering objected to filter spam SMS messages written in Thai and English have been studied and implemented. The first method simply uses current spam English message filtering and then upgrades for Thai language support. The second one applies text normalization, word segmentation process, and analyzing/correcting the semantic of Thai words. However, both methods are applied by 2 different decision-making algorithms: Support Vector Machine (SVM) and Naive Bayesian (NB) algorithms. Finally, the results from trial applying in the real system are shown. The results show that the second filtering method has higher accuracy. Moreover, the SVM based filtering consumes more processing time than the NB based filtering about 2.5 times for both proposed methods.