Wen-Jie Jia, Shu Zhang, Yingju Xia, Jie Zhang, Hao Yu
{"title":"A Novel Product Features Categorize Method Based on Twice-Clustering","authors":"Wen-Jie Jia, Shu Zhang, Yingju Xia, Jie Zhang, Hao Yu","doi":"10.1109/WISM.2010.71","DOIUrl":null,"url":null,"abstract":"Recently, the number of freely available online reviews is increasing in a high speed. More and more aspect base dopinion mining technique has been employed to find out customers' opinions. In this paper, we only focus on categorize product features that the customers have commented on. An unsupervised twice-clustering based product features categorization method is proposed. Opinion words in context of product features are chosen to represent the interrelationship among product features instead of full context information. The cluster result of active product features is used as constraints to improve the whole categorization quality. Our experimental results show that opinion words in context and their group information are very important features in measuring the semantic similarity of their associated product features. The twice-clustering strategy achieves better performance than single-clustering method.","PeriodicalId":119569,"journal":{"name":"2010 International Conference on Web Information Systems and Mining","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Web Information Systems and Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISM.2010.71","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
Recently, the number of freely available online reviews is increasing in a high speed. More and more aspect base dopinion mining technique has been employed to find out customers' opinions. In this paper, we only focus on categorize product features that the customers have commented on. An unsupervised twice-clustering based product features categorization method is proposed. Opinion words in context of product features are chosen to represent the interrelationship among product features instead of full context information. The cluster result of active product features is used as constraints to improve the whole categorization quality. Our experimental results show that opinion words in context and their group information are very important features in measuring the semantic similarity of their associated product features. The twice-clustering strategy achieves better performance than single-clustering method.