{"title":"Extraction of Trend Keywords from Thai Twitters using N-Gram Word Combination","authors":"Tanatorn Tanantong, Sasitorn Kreangkriwanich, Nasith Laosen","doi":"10.1109/ecti-con49241.2020.9158061","DOIUrl":null,"url":null,"abstract":"Extracting keywords from text on social media facilitates people to update news and trends. It reduces time spent for identifying main content from huge amount of data, and it can be used to identify situations or events that most of people mention in each period of time. This paper proposes a method for extracting keywords from Thai text on social media. A N-gram-based word-combination technique is presented to segment words that are not in dictionaries and increase the precision of word segmentation. Posts on Twitter concerning universities in Thailand are used as a case study for extracting keywords and analyzing trends. The experimental results show that the proposed method yield the highest precision of 70%.","PeriodicalId":371552,"journal":{"name":"2020 17th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 17th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ecti-con49241.2020.9158061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Extracting keywords from text on social media facilitates people to update news and trends. It reduces time spent for identifying main content from huge amount of data, and it can be used to identify situations or events that most of people mention in each period of time. This paper proposes a method for extracting keywords from Thai text on social media. A N-gram-based word-combination technique is presented to segment words that are not in dictionaries and increase the precision of word segmentation. Posts on Twitter concerning universities in Thailand are used as a case study for extracting keywords and analyzing trends. The experimental results show that the proposed method yield the highest precision of 70%.