CMAAN：下一个POI推荐的跨模态聚合关注网络

IF 4.5 2区计算机科学 Q1 COMPUTER SCIENCE, CYBERNETICS

IEEE Transactions on Computational Social Systems Pub Date : 2024-12-27 DOI:10.1109/TCSS.2024.3513947

Zhuang Zhuang;Lingbo Liu;Heng Qi;Yanming Shen;Baocai Yin

{"title":"CMAAN：下一个POI推荐的跨模态聚合关注网络","authors":"Zhuang Zhuang;Lingbo Liu;Heng Qi;Yanming Shen;Baocai Yin","doi":"10.1109/TCSS.2024.3513947","DOIUrl":null,"url":null,"abstract":"Next point-of-interest (POI) recommendation is to explore the historical check-in sequence information in location-based social networks (LBSNs) to recommend the next location that he/she might be interested in. However, most previous methods used only limited information of unimodal data (i.e., check-in sequences), while some recent methods have attempted to explore multimodal data (e.g., textual content) but lacked sufficient interactions between geographic behavior patterns and content behavior patterns. In this work, we argue that users usually consider geographical trajectories and textual content interdependently to determine the next location to visit. To this end, we propose a novel cross-modal aggregation attention network (CMAAN), which interactively learns multiview representations from POI sequence and content sequence for predicting the next POI. Our approach models inter-modal interaction correlations, intra-modal sequence correlations, and intra-modal semantic correlations simultaneously to fully discover contextual potential relations along the trajectories. Specifically, the intra-modal semantic correlations are able to capture the variable location functionalities under different contextual relationships of cross-modal interaction information. Moreover, we apply the aggregation attention to adaptively aggregate multiview representations which represent the comprehensive hidden state of the next POI. Extensive experiments on two large-scale datasets clearly demonstrate that our CMAAN achieves state-of-the-art performance.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"12 3","pages":"1025-1037"},"PeriodicalIF":4.5000,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CMAAN: Cross-Modal Aggregation Attention Network for Next POI Recommendation\",\"authors\":\"Zhuang Zhuang;Lingbo Liu;Heng Qi;Yanming Shen;Baocai Yin\",\"doi\":\"10.1109/TCSS.2024.3513947\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Next point-of-interest (POI) recommendation is to explore the historical check-in sequence information in location-based social networks (LBSNs) to recommend the next location that he/she might be interested in. However, most previous methods used only limited information of unimodal data (i.e., check-in sequences), while some recent methods have attempted to explore multimodal data (e.g., textual content) but lacked sufficient interactions between geographic behavior patterns and content behavior patterns. In this work, we argue that users usually consider geographical trajectories and textual content interdependently to determine the next location to visit. To this end, we propose a novel cross-modal aggregation attention network (CMAAN), which interactively learns multiview representations from POI sequence and content sequence for predicting the next POI. Our approach models inter-modal interaction correlations, intra-modal sequence correlations, and intra-modal semantic correlations simultaneously to fully discover contextual potential relations along the trajectories. Specifically, the intra-modal semantic correlations are able to capture the variable location functionalities under different contextual relationships of cross-modal interaction information. Moreover, we apply the aggregation attention to adaptively aggregate multiview representations which represent the comprehensive hidden state of the next POI. Extensive experiments on two large-scale datasets clearly demonstrate that our CMAAN achieves state-of-the-art performance.\",\"PeriodicalId\":13044,\"journal\":{\"name\":\"IEEE Transactions on Computational Social Systems\",\"volume\":\"12 3\",\"pages\":\"1025-1037\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2024-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Computational Social Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10817117/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, CYBERNETICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10817117/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}

引用次数: 0

摘要

下一个兴趣点（POI）推荐是探索基于位置的社交网络（LBSNs）中的历史登记序列信息，以推荐他/她可能感兴趣的下一个位置。然而，以往的方法大多只利用了有限的单模态数据信息（如登记序列），而最近的一些方法试图探索多模态数据（如文本内容），但缺乏地理行为模式和内容行为模式之间的充分相互作用。在这项工作中，我们认为用户通常会相互依赖地考虑地理轨迹和文本内容来确定下一个访问的位置。为此，我们提出了一种新的跨模态聚合注意网络（CMAAN），该网络可以交互式地从POI序列和内容序列中学习多视图表示，以预测下一个POI。我们的方法同时建立了模态间交互关联、模态内序列关联和模态内语义关联的模型，以充分发现沿轨迹的上下文潜在关系。具体而言，模态内语义关联能够捕捉跨模态交互信息在不同语境关系下的可变定位功能。此外，我们将聚合注意力应用于自适应聚合多视图表示，这些表示表示下一个POI的综合隐藏状态。在两个大规模数据集上的大量实验清楚地表明，我们的CMAAN达到了最先进的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CMAAN: Cross-Modal Aggregation Attention Network for Next POI Recommendation

Next point-of-interest (POI) recommendation is to explore the historical check-in sequence information in location-based social networks (LBSNs) to recommend the next location that he/she might be interested in. However, most previous methods used only limited information of unimodal data (i.e., check-in sequences), while some recent methods have attempted to explore multimodal data (e.g., textual content) but lacked sufficient interactions between geographic behavior patterns and content behavior patterns. In this work, we argue that users usually consider geographical trajectories and textual content interdependently to determine the next location to visit. To this end, we propose a novel cross-modal aggregation attention network (CMAAN), which interactively learns multiview representations from POI sequence and content sequence for predicting the next POI. Our approach models inter-modal interaction correlations, intra-modal sequence correlations, and intra-modal semantic correlations simultaneously to fully discover contextual potential relations along the trajectories. Specifically, the intra-modal semantic correlations are able to capture the variable location functionalities under different contextual relationships of cross-modal interaction information. Moreover, we apply the aggregation attention to adaptively aggregate multiview representations which represent the comprehensive hidden state of the next POI. Extensive experiments on two large-scale datasets clearly demonstrate that our CMAAN achieves state-of-the-art performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Computational Social Systems Social Sciences-Social Sciences (miscellaneous)

CiteScore

10.00

自引率

20.00%

发文量

316

期刊介绍： IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.