Aspect-Sentiment-Guided Opinion Summarization for User Need Elicitation From Online Reviews

Volume 2: 42nd Computers and Information in Engineering Conference (CIE) Pub Date : 2022-08-14 DOI:10.1115/detc2022-90108

Yi Han, Mohsen Moghaddam, Meet Tusharbhai Suthar, Gaurav Nanda

{"title":"Aspect-Sentiment-Guided Opinion Summarization for User Need Elicitation From Online Reviews","authors":"Yi Han, Mohsen Moghaddam, Meet Tusharbhai Suthar, Gaurav Nanda","doi":"10.1115/detc2022-90108","DOIUrl":null,"url":null,"abstract":"\n Extracting and analyzing informative user opinion from large-scale online reviews is a key success factor in product design processes. However, user reviews are naturally unstructured, noisy, and verbose. Recent advances in abstractive text summrization provide an unprecedented opportunity to systematically generate summaries of user opinions to facilitate need finding for designers. Yet, two main gaps in the state-of-the-art opinion summarization methods limit their applicability to the product design domain. First is the lack of capabilities to guide the generative process with respect to various product aspects and user sentiments (e.g., polarity, subjectivity), and the second gap is the lack of annotated training datasets for supervised learning. This paper tackles these gaps by (1) devising an efficient and scalable methodology for abstractive opinion summarization from online reviews guided by aspects terms and sentiment polarities, and (2) automatically generating a reusable synthetic training dataset that captures various degrees of granularity and polarity. The methodology contributes a multi-instance pooling model with aspect and sentiment information integrated (MAS), a synthetic data assembled using the results of the MAS model, and a fine-tuned pretrained sequence-to-sequence model “T5” for summary generation. Numerical experiments are conducted on a large dataset scraped from a major e-commerce retail store for sneakers to demonstrate the performance, feasibility, and potentials of the developed methodology. Several directions are provided for future exploration in the area of automated opinion summarization for user-centered product design.","PeriodicalId":382970,"journal":{"name":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2022-90108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Extracting and analyzing informative user opinion from large-scale online reviews is a key success factor in product design processes. However, user reviews are naturally unstructured, noisy, and verbose. Recent advances in abstractive text summrization provide an unprecedented opportunity to systematically generate summaries of user opinions to facilitate need finding for designers. Yet, two main gaps in the state-of-the-art opinion summarization methods limit their applicability to the product design domain. First is the lack of capabilities to guide the generative process with respect to various product aspects and user sentiments (e.g., polarity, subjectivity), and the second gap is the lack of annotated training datasets for supervised learning. This paper tackles these gaps by (1) devising an efficient and scalable methodology for abstractive opinion summarization from online reviews guided by aspects terms and sentiment polarities, and (2) automatically generating a reusable synthetic training dataset that captures various degrees of granularity and polarity. The methodology contributes a multi-instance pooling model with aspect and sentiment information integrated (MAS), a synthetic data assembled using the results of the MAS model, and a fine-tuned pretrained sequence-to-sequence model “T5” for summary generation. Numerical experiments are conducted on a large dataset scraped from a major e-commerce retail store for sneakers to demonstrate the performance, feasibility, and potentials of the developed methodology. Several directions are provided for future exploration in the area of automated opinion summarization for user-centered product design.

查看原文本刊更多论文

面向在线评论用户需求的面向情感导向的意见总结

从大规模在线评论中提取和分析信息丰富的用户意见是产品设计过程中成功的关键因素。然而，用户评论自然是非结构化的、嘈杂的和冗长的。抽象文本摘要的最新进展为系统地生成用户意见摘要提供了前所未有的机会，以促进设计师的需求发现。然而，在最先进的意见总结方法的两个主要差距限制了它们在产品设计领域的适用性。首先是缺乏针对各种产品方面和用户情感(例如极性、主观性)指导生成过程的能力，第二个差距是缺乏用于监督学习的带注释的训练数据集。本文通过(1)设计一种高效且可扩展的方法，以方面术语和情感极性为指导，从在线评论中抽象地总结意见，以及(2)自动生成捕获不同粒度和极性程度的可重用合成训练数据集来解决这些差距。该方法提供了一个集成了方面和情感信息(MAS)的多实例池化模型，一个使用MAS模型结果组装的合成数据，以及一个用于摘要生成的微调预训练序列到序列模型“T5”。数值实验是在一个大型数据集上进行的，该数据集来自一个主要的电子商务零售商店的运动鞋，以证明所开发的方法的性能，可行性和潜力。在以用户为中心的产品设计的自动意见总结领域，提出了未来探索的几个方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Volume 2: 42nd Computers and Information in Engineering Conference (CIE)

自引率

0.00%

发文量