{"title":"A Model of Discovering Customer Insights in Tourism Sector Approach to Vietnamese Reviews Analytics","authors":"T. Le, Van-Ho Nguyen, T. Ho","doi":"10.1109/NICS56915.2022.10013410","DOIUrl":null,"url":null,"abstract":"Today, customer-centric business models are more focused. In addition, businesses increasingly own more data about customers and user behavior, especially from social networking platforms. However, the effective exploitation and use of this amount of data is still very limited. Based on those objective motivations, this research proposes a model as a foundation for analyzing and understanding customers to help businesses transform and transform this data into forms of knowledge to support the business, make intelligent business decisions. A data set of 90,513 reviews in Vietnamese from Agoda e-commerce websites in the tourism sector is collected and experimented. In experimental phase, raw corpus was cleaned by text preprocessing, then, the research will use the Net Promoter Score (NPS) to separate the data into three data sets according to three groups of Detractors, Passives and Promoters. From that, Latent Dirichlet Allocation (LDA) was built to extract the hidden topics in the input dataset. The experimental results show that the extracted topics and their features from corpus of customer reviews are reliable. The novelty in this research is to identify the topics that customers are interested in and complaining about.","PeriodicalId":381028,"journal":{"name":"2022 9th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS56915.2022.10013410","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Today, customer-centric business models are more focused. In addition, businesses increasingly own more data about customers and user behavior, especially from social networking platforms. However, the effective exploitation and use of this amount of data is still very limited. Based on those objective motivations, this research proposes a model as a foundation for analyzing and understanding customers to help businesses transform and transform this data into forms of knowledge to support the business, make intelligent business decisions. A data set of 90,513 reviews in Vietnamese from Agoda e-commerce websites in the tourism sector is collected and experimented. In experimental phase, raw corpus was cleaned by text preprocessing, then, the research will use the Net Promoter Score (NPS) to separate the data into three data sets according to three groups of Detractors, Passives and Promoters. From that, Latent Dirichlet Allocation (LDA) was built to extract the hidden topics in the input dataset. The experimental results show that the extracted topics and their features from corpus of customer reviews are reliable. The novelty in this research is to identify the topics that customers are interested in and complaining about.