Exploring User and Item Representation, Justification Generation, and Data Augmentation for Conversational Recommender Systems

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval Pub Date : 2023-07-18 DOI:10.1145/3539618.3591795

Sergey Volokhin

{"title":"Exploring User and Item Representation, Justification Generation, and Data Augmentation for Conversational Recommender Systems","authors":"Sergey Volokhin","doi":"10.1145/3539618.3591795","DOIUrl":null,"url":null,"abstract":"Conversational Recommender Systems (CRS) aim to provide personalized and contextualized recommendations through natural language conversations with users. The objective of my proposed dissertation is to capitalize on the recent developments in conversational interfaces to advance the field of Recommender Systems in several directions. I aim to address several problems in recommender systems: user and item representation, justification generation, and data sparsity. A critical challenge in CRS is learning effective representations of users and items that capture their preferences and characteristics. First, we focus on user representation, where we use a separate corpus of reviews to learn user representation. We attempt to map conversational users into the space of reviewers using semantic similarity between the conversation and the texts of reviews. Second, we improve item representation by incorporating textual features such as item descriptions into the user-item interaction graph, which captures a great deal of semantic and behavioral information unavailable from the purely topological structure of the interaction graph. Justifications for recommendations enhance the explainability and transparency of CRS; however, existing approaches, such as rule-based and template-based methods, have limitations. In this work, we propose an extractive method using a corpus of reviews to identify relevant information for generating concise and coherent justifications. We address the challenge of data scarcity for CRS by generating synthetic conversations using SOTA generative pre trained transformers (GPT). These synthetic conversations are used to augment the data used for training the CRS. In addition, we also evaluate if the GPTs exhibit emerging abilities of CRS (or a non-conversational RecSys) due to the large amount of data they are trained on, which potentially includes the reviews and opinions of users.","PeriodicalId":425056,"journal":{"name":"Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3539618.3591795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Conversational Recommender Systems (CRS) aim to provide personalized and contextualized recommendations through natural language conversations with users. The objective of my proposed dissertation is to capitalize on the recent developments in conversational interfaces to advance the field of Recommender Systems in several directions. I aim to address several problems in recommender systems: user and item representation, justification generation, and data sparsity. A critical challenge in CRS is learning effective representations of users and items that capture their preferences and characteristics. First, we focus on user representation, where we use a separate corpus of reviews to learn user representation. We attempt to map conversational users into the space of reviewers using semantic similarity between the conversation and the texts of reviews. Second, we improve item representation by incorporating textual features such as item descriptions into the user-item interaction graph, which captures a great deal of semantic and behavioral information unavailable from the purely topological structure of the interaction graph. Justifications for recommendations enhance the explainability and transparency of CRS; however, existing approaches, such as rule-based and template-based methods, have limitations. In this work, we propose an extractive method using a corpus of reviews to identify relevant information for generating concise and coherent justifications. We address the challenge of data scarcity for CRS by generating synthetic conversations using SOTA generative pre trained transformers (GPT). These synthetic conversations are used to augment the data used for training the CRS. In addition, we also evaluate if the GPTs exhibit emerging abilities of CRS (or a non-conversational RecSys) due to the large amount of data they are trained on, which potentially includes the reviews and opinions of users.

查看原文本刊更多论文

探索会话推荐系统的用户和项目表示，证明生成和数据增强

会话推荐系统(CRS)旨在通过与用户的自然语言对话提供个性化和情境化的推荐。我提出的论文的目的是利用对话界面的最新发展，在几个方向上推进推荐系统领域。我的目标是解决推荐系统中的几个问题:用户和项目表示、证明生成和数据稀疏性。CRS中的一个关键挑战是学习用户和项目的有效表示，以捕获他们的偏好和特征。首先，我们关注用户表示，我们使用一个单独的评论语料库来学习用户表示。我们尝试使用会话和评论文本之间的语义相似性将会话用户映射到评论者的空间。其次，我们通过将项目描述等文本特征整合到用户-项目交互图中来改进项目表示，从而捕获了大量从纯拓扑结构的交互图中无法获得的语义和行为信息。提出合理化建议可提高CRS的可解释性和透明度;然而，现有的方法，如基于规则和基于模板的方法，都有局限性。在这项工作中，我们提出了一种使用评论语料库来识别相关信息的提取方法，以生成简洁连贯的理由。我们通过使用SOTA生成预训练转换器(GPT)生成合成会话来解决CRS数据稀缺的挑战。这些合成对话用于增强用于训练CRS的数据。此外，我们还评估了gpt是否表现出CRS(或非会话RecSys)的新兴能力，因为他们接受了大量的数据训练，其中可能包括用户的评论和意见。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

自引率

0.00%

发文量