An Adaptive Entire-Space Multi-Scenario Multi-Task Transfer Learning Model for Recommendations

IF 8.9 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2025-01-30 DOI:10.1109/TKDE.2025.3536334

Qingqing Yi;Jingjing Tang;Xiangyu Zhao;Yujian Zeng;Zengchun Song;Jia Wu

{"title":"An Adaptive Entire-Space Multi-Scenario Multi-Task Transfer Learning Model for Recommendations","authors":"Qingqing Yi;Jingjing Tang;Xiangyu Zhao;Yujian Zeng;Zengchun Song;Jia Wu","doi":"10.1109/TKDE.2025.3536334","DOIUrl":null,"url":null,"abstract":"Multi-scenario and multi-task recommendation systems efficiently facilitate knowledge transfer across different scenarios and tasks. However, many existing approaches inadequately incorporate personalized information across users and scenarios. Moreover, the conversion rate (CVR) task in multi-task learning often encounters challenges like sample selection bias, resulting from systematic differences between the training and inference sample spaces, and data sparsity due to infrequent clicks. To address these issues, we propose Adaptive Entire-space Multi-scenario Multi-task Transfer Learning model (AEM<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>TL) with four key modules: 1) Scenario-CGC (Scenario-Customized Gate Control), 2) Task-CGC (Task-Customized Gate Control), 3) Personalized Gating Network, and 4) Entire-space Supervised Multi-Task Module. AEM<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>TL employs a multi-gate mechanism to effectively integrate shared and specific information across scenarios and tasks, enhancing prediction adaptability. To further improve task-specific personalization, it incorporates personalized prior features and applies a gating mechanism that dynamically scales the top-layer neural units. A novel post-impression behavior decomposition technique is designed to leverage all impression samples across the entire space, mitigating sample selection bias and data sparsity. Furthermore, an adaptive weighting mechanism dynamically allocates attention to tasks based on their relative importance, ensuring optimal task prioritization. Extensive experiments on one industrial and two real-world public datasets indicate the superiority of AEM<inline-formula><tex-math>$^{2}$</tex-math></inline-formula>TL over state-of-the-art methods.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"37 4","pages":"1585-1598"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10858412/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-scenario and multi-task recommendation systems efficiently facilitate knowledge transfer across different scenarios and tasks. However, many existing approaches inadequately incorporate personalized information across users and scenarios. Moreover, the conversion rate (CVR) task in multi-task learning often encounters challenges like sample selection bias, resulting from systematic differences between the training and inference sample spaces, and data sparsity due to infrequent clicks. To address these issues, we propose Adaptive Entire-space Multi-scenario Multi-task Transfer Learning model (AEM

$^{2}$

TL) with four key modules: 1) Scenario-CGC (Scenario-Customized Gate Control), 2) Task-CGC (Task-Customized Gate Control), 3) Personalized Gating Network, and 4) Entire-space Supervised Multi-Task Module. AEM

$^{2}$

TL employs a multi-gate mechanism to effectively integrate shared and specific information across scenarios and tasks, enhancing prediction adaptability. To further improve task-specific personalization, it incorporates personalized prior features and applies a gating mechanism that dynamically scales the top-layer neural units. A novel post-impression behavior decomposition technique is designed to leverage all impression samples across the entire space, mitigating sample selection bias and data sparsity. Furthermore, an adaptive weighting mechanism dynamically allocates attention to tasks based on their relative importance, ensuring optimal task prioritization. Extensive experiments on one industrial and two real-world public datasets indicate the superiority of AEM

$^{2}$

TL over state-of-the-art methods.

查看原文本刊更多论文

自适应全空间多场景多任务推荐迁移学习模型

多场景多任务推荐系统可以有效地促进不同场景和任务之间的知识转移。然而，许多现有的方法不能充分地整合跨用户和场景的个性化信息。此外，在多任务学习中，转化率（CVR）任务经常会遇到样本选择偏差（由于训练样本空间和推理样本空间的系统性差异）以及由于不频繁点击导致的数据稀疏性等挑战。为了解决这些问题，我们提出了自适应全空间多场景多任务迁移学习模型（AEM$^{2}$TL），该模型具有四个关键模块：1)场景自定义门控制（Scenario-CGC）， 2)任务自定义门控制（Task-CGC）， 3)个性化门控网络和4)全空间监督多任务模块。AEM$^{2}$TL采用多门机制，有效地集成了跨场景和任务的共享和特定信息，增强了预测的适应性。为了进一步提高特定任务的个性化，它结合了个性化的先验特征，并应用了一种动态扩展顶层神经单元的门控机制。一种新的印象后行为分解技术旨在利用整个空间的所有印象样本，减轻样本选择偏差和数据稀疏性。此外，自适应加权机制根据任务的相对重要性动态分配注意力，确保任务的最优优先级。在一个工业和两个现实世界的公共数据集上进行的大量实验表明，AEM$^{2}$TL优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.