Training-Free Graph-Based Imputation of Missing Modalities in Multimodal Recommendation

IF 10.4 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Knowledge and Data Engineering Pub Date : 2026-03-01 Epub Date: 2026-03-03 DOI:10.1109/TKDE.2026.3667005

Daniele Malitesta;Emanuele Rossi;Claudio Pomo;Tommaso Di Noia;Fragkiskos D. Malliaros

{"title":"Training-Free Graph-Based Imputation of Missing Modalities in Multimodal Recommendation","authors":"Daniele Malitesta;Emanuele Rossi;Claudio Pomo;Tommaso Di Noia;Fragkiskos D. Malliaros","doi":"10.1109/TKDE.2026.3667005","DOIUrl":null,"url":null,"abstract":"Multimodal recommender systems (RSs) represent items in the catalog through multimodal data (e.g., product images and descriptions) that, in some cases, might be noisy or (even worse) missing. In those scenarios, the common practice is to drop items with missing modalities and train the multimodal RSs on a subsample of the original dataset. To date, the problem of missing modalities in multimodal recommendation has still received limited attention in the literature, lacking a precise formalisation as done with missing information in traditional machine learning. In this work, we first provide a problem formalisation for missing modalities in multimodal recommendation. Second, by leveraging the user-item graph structure, we re-cast the problem of missing multimodal information as a problem of graph features interpolation on the item-item co-purchase graph. On this basis, we propose four training-free approaches that propagate the available multimodal features throughout the item-item graph to impute the missing features. Extensive experiments on popular multimodal recommendation datasets demonstrate that our solutions can be seamlessly plugged into any existing multimodal RS and benchmarking framework while still preserving (or even widen) the performance gap between multimodal and traditional RSs. Moreover, we show that our graph-based techniques can perform better than traditional imputations in machine learning under different missing modalities settings. Finally, we analyse (for the first time in multimodal RSs) how feature homophily calculated on the item-item graph can influence our graph-based imputations.","PeriodicalId":13496,"journal":{"name":"IEEE Transactions on Knowledge and Data Engineering","volume":"38 5","pages":"3250-3263"},"PeriodicalIF":10.4000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Knowledge and Data Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11420250/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Multimodal recommender systems (RSs) represent items in the catalog through multimodal data (e.g., product images and descriptions) that, in some cases, might be noisy or (even worse) missing. In those scenarios, the common practice is to drop items with missing modalities and train the multimodal RSs on a subsample of the original dataset. To date, the problem of missing modalities in multimodal recommendation has still received limited attention in the literature, lacking a precise formalisation as done with missing information in traditional machine learning. In this work, we first provide a problem formalisation for missing modalities in multimodal recommendation. Second, by leveraging the user-item graph structure, we re-cast the problem of missing multimodal information as a problem of graph features interpolation on the item-item co-purchase graph. On this basis, we propose four training-free approaches that propagate the available multimodal features throughout the item-item graph to impute the missing features. Extensive experiments on popular multimodal recommendation datasets demonstrate that our solutions can be seamlessly plugged into any existing multimodal RS and benchmarking framework while still preserving (or even widen) the performance gap between multimodal and traditional RSs. Moreover, we show that our graph-based techniques can perform better than traditional imputations in machine learning under different missing modalities settings. Finally, we analyse (for the first time in multimodal RSs) how feature homophily calculated on the item-item graph can influence our graph-based imputations.

查看原文本刊更多论文

多模态推荐中缺失模态的无训练图插值

多模式推荐系统（RSs）通过多模式数据（例如，产品图像和描述）表示目录中的项目，在某些情况下，这些数据可能有噪声或（更糟糕的是）丢失。在这些场景中，通常的做法是删除缺少模态的项，并在原始数据集的子样本上训练多模态RSs。迄今为止，多模态推荐中缺失模态的问题在文献中仍然受到有限的关注，缺乏传统机器学习中缺失信息的精确形式化。在这项工作中，我们首先提供了多模式推荐中缺失模式的问题形式化。其次，利用用户-物品图结构，将多模态信息缺失问题重新转化为物品-物品共同购买图上的图形特征插值问题。在此基础上，我们提出了四种无需训练的方法，将可用的多模态特征传播到整个item-item图中，以估算缺失的特征。在流行的多模态推荐数据集上进行的大量实验表明，我们的解决方案可以无缝地插入任何现有的多模态RSs和基准测试框架，同时仍然保持（甚至扩大）多模态RSs和传统RSs之间的性能差距。此外，我们表明，在不同的缺失模态设置下，我们的基于图的技术在机器学习中可以比传统的imputations表现得更好。最后，我们分析了（在多模态RSs中第一次）在项目-项目图上计算的特征同质性如何影响我们基于图的插值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Knowledge and Data Engineering 工程技术-工程：电子与电气

CiteScore

11.70

自引率

3.40%

发文量

515

审稿时长

6 months

期刊介绍： The IEEE Transactions on Knowledge and Data Engineering encompasses knowledge and data engineering aspects within computer science, artificial intelligence, electrical engineering, computer engineering, and related fields. It provides an interdisciplinary platform for disseminating new developments in knowledge and data engineering and explores the practicality of these concepts in both hardware and software. Specific areas covered include knowledge-based and expert systems, AI techniques for knowledge and data management, tools, and methodologies, distributed processing, real-time systems, architectures, data management practices, database design, query languages, security, fault tolerance, statistical databases, algorithms, performance evaluation, and applications.