Product return prediction in live streaming e-commerce with cross-modal contrastive transformer

IF 6.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Decision Support Systems Pub Date : 2025-05-05 DOI:10.1016/j.dss.2025.114470

Wen Zhang , Rui Xie , Pei Quan , Zhenzhong Ma

{"title":"Product return prediction in live streaming e-commerce with cross-modal contrastive transformer","authors":"Wen Zhang , Rui Xie , Pei Quan , Zhenzhong Ma","doi":"10.1016/j.dss.2025.114470","DOIUrl":null,"url":null,"abstract":"<div><div>The live-streaming e-commerce industry is suffering heavy economic losses due to the high product return rate, which leads to rising logistics costs, greater inventory pressure, and unsatisfactory consumer experiences. Accurate product return prediction is highly desirable for the vendors to optimize their business operations in advance to reduce return-related costs. This paper proposes a novel approach, called Contraformer (Contrastive transformer), to predict product returns in live streaming e-commerce by leveraging fine-grained streamer behavior features extracted from three modalities (i.e., visual, acoustic, and language). The primary contribution lies in that we adopt Transformer with the encoder-decoder architecture with a novel class-supervised contrastive learning (CSCL) to fuse streamer behavior for multimodal representation alignment and inter-modal interaction characterization. By using a real-world dataset with 2584 product streamers and 864 items collected from Tiktok China live streaming platform, we demonstrate that the proposed Contrasformer approach outperforms the baseline methods in predicting product return rate with a 25 % reduction in terms of mean absolute error. This study offers great managerial implications for vendors to manage their practice in live streaming commerce.</div></div>","PeriodicalId":55181,"journal":{"name":"Decision Support Systems","volume":"194 ","pages":"Article 114470"},"PeriodicalIF":6.8000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Support Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167923625000715","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The live-streaming e-commerce industry is suffering heavy economic losses due to the high product return rate, which leads to rising logistics costs, greater inventory pressure, and unsatisfactory consumer experiences. Accurate product return prediction is highly desirable for the vendors to optimize their business operations in advance to reduce return-related costs. This paper proposes a novel approach, called Contraformer (Contrastive transformer), to predict product returns in live streaming e-commerce by leveraging fine-grained streamer behavior features extracted from three modalities (i.e., visual, acoustic, and language). The primary contribution lies in that we adopt Transformer with the encoder-decoder architecture with a novel class-supervised contrastive learning (CSCL) to fuse streamer behavior for multimodal representation alignment and inter-modal interaction characterization. By using a real-world dataset with 2584 product streamers and 864 items collected from Tiktok China live streaming platform, we demonstrate that the proposed Contrasformer approach outperforms the baseline methods in predicting product return rate with a 25 % reduction in terms of mean absolute error. This study offers great managerial implications for vendors to manage their practice in live streaming commerce.

查看原文本刊更多论文

基于跨模态对比变压器的电商直播产品退货预测

由于产品退货率高，导致物流成本上升，库存压力加大，消费者体验不满意，直播电商行业遭受了严重的经济损失。准确的产品退货预测是供应商提前优化业务运营，降低退货相关成本的重要手段。本文提出了一种新的方法，称为contrasformer（对比变压器），通过利用从三种模式（即视觉、听觉和语言）中提取的细粒度流媒体行为特征，来预测实时流媒体电子商务中的产品回报。主要贡献在于我们采用了具有编码器-解码器架构的Transformer和一种新颖的类监督对比学习（CSCL），以融合多模态表示对齐和多模态交互表征的流媒体行为。通过使用从Tiktok中国直播平台收集的2584个产品流媒体和864个项目的真实数据集，我们证明了所提出的Contrasformer方法在预测产品退货率方面优于基线方法，平均绝对误差降低了25%。这项研究为供应商管理他们在直播商业中的实践提供了重要的管理启示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Decision Support Systems 工程技术-计算机：人工智能

CiteScore

14.70

自引率

6.70%

发文量

119

审稿时长

13 months

期刊介绍： The common thread of articles published in Decision Support Systems is their relevance to theoretical and technical issues in the support of enhanced decision making. The areas addressed may include foundations, functionality, interfaces, implementation, impacts, and evaluation of decision support systems (DSSs).