{"title":"基于概率潜在语义分析的Booking.com评论标题情感检测","authors":"D. Khotimah, R. Sarno","doi":"10.1109/ICOICT.2018.8528784","DOIUrl":null,"url":null,"abstract":"In a competitive and dynamic environment, many hotels compete to provide the best quality for customers. Hotel quality affects the brand image of a hotel marked from whether or not customers are satisfied. Booking.com website is chosen as the ideal data source as being able to utilize User Generated Content (UGC). UGC, serves as a data source ensuring the authenticity of data for customer reviews in Ponorogo, Indonesia. This experiment uses English text classification, to determine customer satisfaction and dissatisfaction based on the text they write on the title of customer testimony. The classification method used is PLSA and python programming language is used for preprocessing data. The data sampling is performed by data crawling using WebHarvy. The test results show that PLSA slightly outperforms previous research methods, namely LSA.","PeriodicalId":266335,"journal":{"name":"2018 6th International Conference on Information and Communication Technology (ICoICT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"Sentiment Detection of Comment Titles in Booking.com Using Probabilistic Latent Semantic Analysis\",\"authors\":\"D. Khotimah, R. Sarno\",\"doi\":\"10.1109/ICOICT.2018.8528784\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In a competitive and dynamic environment, many hotels compete to provide the best quality for customers. Hotel quality affects the brand image of a hotel marked from whether or not customers are satisfied. Booking.com website is chosen as the ideal data source as being able to utilize User Generated Content (UGC). UGC, serves as a data source ensuring the authenticity of data for customer reviews in Ponorogo, Indonesia. This experiment uses English text classification, to determine customer satisfaction and dissatisfaction based on the text they write on the title of customer testimony. The classification method used is PLSA and python programming language is used for preprocessing data. The data sampling is performed by data crawling using WebHarvy. The test results show that PLSA slightly outperforms previous research methods, namely LSA.\",\"PeriodicalId\":266335,\"journal\":{\"name\":\"2018 6th International Conference on Information and Communication Technology (ICoICT)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 6th International Conference on Information and Communication Technology (ICoICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOICT.2018.8528784\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 6th International Conference on Information and Communication Technology (ICoICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOICT.2018.8528784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sentiment Detection of Comment Titles in Booking.com Using Probabilistic Latent Semantic Analysis
In a competitive and dynamic environment, many hotels compete to provide the best quality for customers. Hotel quality affects the brand image of a hotel marked from whether or not customers are satisfied. Booking.com website is chosen as the ideal data source as being able to utilize User Generated Content (UGC). UGC, serves as a data source ensuring the authenticity of data for customer reviews in Ponorogo, Indonesia. This experiment uses English text classification, to determine customer satisfaction and dissatisfaction based on the text they write on the title of customer testimony. The classification method used is PLSA and python programming language is used for preprocessing data. The data sampling is performed by data crawling using WebHarvy. The test results show that PLSA slightly outperforms previous research methods, namely LSA.