Kris Capao, Ken Gorro, Kim D. Gorro, M. J. Sabellano, C. Militante, Justin Paul C. Manalili
{"title":"Aspect Analysis of Cebu Establishments' Online Reviews using k-means Clustering and word2vec","authors":"Kris Capao, Ken Gorro, Kim D. Gorro, M. J. Sabellano, C. Militante, Justin Paul C. Manalili","doi":"10.1109/CCOMS.2018.8463246","DOIUrl":null,"url":null,"abstract":"Customer reviews are important part to any business. With the development of the technology, customer reviews are usually found on the internet. In this study, online reviews from different Cebu establishments were gathered using selenium as web scraper. A total of 3776 online reviews were gathered. Word2vec and k-means clustering were utilized to analyze and discover different online review corpora. To identify the best number of clusters, a series of experiments were conducted to find for the best Silhouette coefficient. For better analysis of k-means clustering, open coding was used to understand the significant qualitative codes. Based on the k-means clustering results, the following qualitative codes were identified: time, staff, friendly, service, affordable, love, food, price, ambiance, good, great, relax. Analyses of the clusters show that quality service, tasty and affordable food and good atmosphere are the significant aspect that the online reviews are concerned. Based on the word2vec results, the researchers focused on the following words: Waiters, relax, great, ambiance, service and tasty. The results of the study provide meaningful insights on the group of words obtained using the analogy to word2vec model, as well as the subject focus of the categories.","PeriodicalId":405664,"journal":{"name":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 3rd International Conference on Computer and Communication Systems (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCOMS.2018.8463246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Customer reviews are important part to any business. With the development of the technology, customer reviews are usually found on the internet. In this study, online reviews from different Cebu establishments were gathered using selenium as web scraper. A total of 3776 online reviews were gathered. Word2vec and k-means clustering were utilized to analyze and discover different online review corpora. To identify the best number of clusters, a series of experiments were conducted to find for the best Silhouette coefficient. For better analysis of k-means clustering, open coding was used to understand the significant qualitative codes. Based on the k-means clustering results, the following qualitative codes were identified: time, staff, friendly, service, affordable, love, food, price, ambiance, good, great, relax. Analyses of the clusters show that quality service, tasty and affordable food and good atmosphere are the significant aspect that the online reviews are concerned. Based on the word2vec results, the researchers focused on the following words: Waiters, relax, great, ambiance, service and tasty. The results of the study provide meaningful insights on the group of words obtained using the analogy to word2vec model, as well as the subject focus of the categories.