{"title":"Semi-Automatic Category Estimation and Data Augmentation for Opinion Extraction of Product Components","authors":"Shogo Anda, Masato Kikuchi, Tadachika Ozono","doi":"10.52731/ijskm.v7.i2.807","DOIUrl":null,"url":null,"abstract":"When customers purchase a product online, they use reviews to gather information about that product to help them make a purchase decision. Aspect-based Sentiment Analysis is a task that analyzes the review content from various perspectives, including the product itself, its components, and its retail outlets. We focus on comparing the characteristics of each component in a product with those of other products at the time of purchase. We define a task called component-based sentiment analysis (CBSA), which analyzes the review content from the perspective of only each component in the product. The CBSA task consists of opinion target extraction and polarity analysis. We approach that task with a classifier. We describe a semi-automatic category determination method for creating classification labels for CBSA and a data augmentation method to improve its classification performance. In experiments, we show that our category determination method can generate categories that cover 95% of the existing categories on e-commerce sites and that our data augmentation method improves the macro-F1-measure for uncommon opinions by 10%.","PeriodicalId":487422,"journal":{"name":"International journal of service and knowledge management","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of service and knowledge management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52731/ijskm.v7.i2.807","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
When customers purchase a product online, they use reviews to gather information about that product to help them make a purchase decision. Aspect-based Sentiment Analysis is a task that analyzes the review content from various perspectives, including the product itself, its components, and its retail outlets. We focus on comparing the characteristics of each component in a product with those of other products at the time of purchase. We define a task called component-based sentiment analysis (CBSA), which analyzes the review content from the perspective of only each component in the product. The CBSA task consists of opinion target extraction and polarity analysis. We approach that task with a classifier. We describe a semi-automatic category determination method for creating classification labels for CBSA and a data augmentation method to improve its classification performance. In experiments, we show that our category determination method can generate categories that cover 95% of the existing categories on e-commerce sites and that our data augmentation method improves the macro-F1-measure for uncommon opinions by 10%.