通过多源数据融合构建行为意向预测的基础模型

IF 15.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Information Fusion Pub Date : 2025-10-03 DOI:10.1016/j.inffus.2025.103799

Weikang He, Yunpeng Xiao, Tun Li, Rong Wang, Qian Li

{"title":"通过多源数据融合构建行为意向预测的基础模型","authors":"Weikang He, Yunpeng Xiao, Tun Li, Rong Wang, Qian Li","doi":"10.1016/j.inffus.2025.103799","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, some studies have attempted to build foundation models to predict user behavioral intentions across various scenarios. Existing methods typically involve two steps: first, using Pre-trained Language Models (PLMs) to convert user historical behaviors into semantic vectors; second, capturing latent user intentions through a behavioral sequence encoder. However, limitations still exist in this pipeline. Firstly, most studies directly fix the parameters of PLMs, making it difficult to optimize the generated semantic vectors for specific application scenarios. Even with fine-tuning PLMs considered, the huge model size brings computational burdens. To address this challenge, we introduce a Low-Rank Adaptation technique, which enhances the quality of semantic vectors by training two lightweight low-rank matrices while keeping the original parameters of PLMs frozen. Additionally, the semantic vectors generated by PLMs often exhibit anisotropy, which weakens the expressive power. To address this, we design a parameter whitening expert and a mixed observation expert module to achieve more isotropic semantic representations. Finally, there are significant differences in user behavior patterns across different domains, and such behavioral conflicts can limit the model’s generalization ability. Therefore, during the pre-training phase, we introduce multi-domain negative sample fusion and design two contrastive learning tasks. By employing a multi-task learning strategy, we enhance the model’s ability to handle multi-domain data. With these improvements, our model achieves stronger adaptability and accuracy in predicting user behavior across different domains.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"127 ","pages":"Article 103799"},"PeriodicalIF":15.5000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a foundation model for behavioral intention prediction through multi-source data fusion\",\"authors\":\"Weikang He, Yunpeng Xiao, Tun Li, Rong Wang, Qian Li\",\"doi\":\"10.1016/j.inffus.2025.103799\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, some studies have attempted to build foundation models to predict user behavioral intentions across various scenarios. Existing methods typically involve two steps: first, using Pre-trained Language Models (PLMs) to convert user historical behaviors into semantic vectors; second, capturing latent user intentions through a behavioral sequence encoder. However, limitations still exist in this pipeline. Firstly, most studies directly fix the parameters of PLMs, making it difficult to optimize the generated semantic vectors for specific application scenarios. Even with fine-tuning PLMs considered, the huge model size brings computational burdens. To address this challenge, we introduce a Low-Rank Adaptation technique, which enhances the quality of semantic vectors by training two lightweight low-rank matrices while keeping the original parameters of PLMs frozen. Additionally, the semantic vectors generated by PLMs often exhibit anisotropy, which weakens the expressive power. To address this, we design a parameter whitening expert and a mixed observation expert module to achieve more isotropic semantic representations. Finally, there are significant differences in user behavior patterns across different domains, and such behavioral conflicts can limit the model’s generalization ability. Therefore, during the pre-training phase, we introduce multi-domain negative sample fusion and design two contrastive learning tasks. By employing a multi-task learning strategy, we enhance the model’s ability to handle multi-domain data. With these improvements, our model achieves stronger adaptability and accuracy in predicting user behavior across different domains.</div></div>\",\"PeriodicalId\":50367,\"journal\":{\"name\":\"Information Fusion\",\"volume\":\"127 \",\"pages\":\"Article 103799\"},\"PeriodicalIF\":15.5000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Fusion\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1566253525008619\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525008619","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

近年来，一些研究试图建立基础模型来预测不同场景下的用户行为意图。现有的方法通常包括两个步骤：首先，使用预训练语言模型（PLMs）将用户历史行为转换为语义向量；其次，通过行为序列编码器捕获潜在的用户意图。然而，这条管道仍然存在局限性。首先，大多数研究直接固定plm的参数，难以针对特定应用场景优化生成的语义向量。即使考虑微调plm，巨大的模型尺寸也会带来计算负担。为了解决这一挑战，我们引入了一种低秩自适应技术，该技术通过训练两个轻量级的低秩矩阵来提高语义向量的质量，同时保持plm的原始参数不变。此外，plm生成的语义向量往往具有各向异性，这削弱了表达能力。为了解决这个问题，我们设计了一个参数白化专家和一个混合观察专家模块，以实现更各向同性的语义表示。最后，不同领域的用户行为模式存在显著差异，这种行为冲突会限制模型的泛化能力。因此，在预训练阶段，我们引入了多域负样本融合，设计了两个对比学习任务。通过采用多任务学习策略，增强了模型处理多领域数据的能力。通过这些改进，我们的模型在预测不同领域的用户行为方面具有更强的适应性和准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Towards a foundation model for behavioral intention prediction through multi-source data fusion

In recent years, some studies have attempted to build foundation models to predict user behavioral intentions across various scenarios. Existing methods typically involve two steps: first, using Pre-trained Language Models (PLMs) to convert user historical behaviors into semantic vectors; second, capturing latent user intentions through a behavioral sequence encoder. However, limitations still exist in this pipeline. Firstly, most studies directly fix the parameters of PLMs, making it difficult to optimize the generated semantic vectors for specific application scenarios. Even with fine-tuning PLMs considered, the huge model size brings computational burdens. To address this challenge, we introduce a Low-Rank Adaptation technique, which enhances the quality of semantic vectors by training two lightweight low-rank matrices while keeping the original parameters of PLMs frozen. Additionally, the semantic vectors generated by PLMs often exhibit anisotropy, which weakens the expressive power. To address this, we design a parameter whitening expert and a mixed observation expert module to achieve more isotropic semantic representations. Finally, there are significant differences in user behavior patterns across different domains, and such behavioral conflicts can limit the model’s generalization ability. Therefore, during the pre-training phase, we introduce multi-domain negative sample fusion and design two contrastive learning tasks. By employing a multi-task learning strategy, we enhance the model’s ability to handle multi-domain data. With these improvements, our model achieves stronger adaptability and accuracy in predicting user behavior across different domains.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.