{"title":"MESN: A multimodal knowledge graph embedding framework with expert fusion and relational attention","authors":"Ban Tran , Thanh Le","doi":"10.1016/j.knosys.2025.113541","DOIUrl":null,"url":null,"abstract":"<div><div>Knowledge graph embedding is essential for knowledge graph completion and downstream applications. However, in multimodal knowledge graphs, this task is particularly challenging due to incomplete and noisy multimodal data, which often fails to capture semantic relationships between entities. While existing methods attempt to integrate multimodal features, they frequently overlook relational semantics and cross-modal dependencies, leading to suboptimal entity representations. To address these limitations, we propose MESN, a novel multimodal embedding framework that integrates relational and multimodal signals through semantic aggregation and neighbor-aware attention mechanisms. MESN selectively extracts informative multimodal features via adaptive attention and expert-driven learning, ensuring more expressive entity embeddings. Additionally, we introduce an enhanced ComplEx-based scoring function, which effectively combines structured graph interactions with multimodal information, capturing both relational and feature diversity. Extensive experiments on standard multimodal datasets confirm that MESN significantly outperforms baselines across multiple evaluation metrics. Our findings highlight the importance of relational guidance in multimodal embedding tasks, paving the way for more robust and semantically-aware knowledge representations.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"318 ","pages":"Article 113541"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125005878","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Knowledge graph embedding is essential for knowledge graph completion and downstream applications. However, in multimodal knowledge graphs, this task is particularly challenging due to incomplete and noisy multimodal data, which often fails to capture semantic relationships between entities. While existing methods attempt to integrate multimodal features, they frequently overlook relational semantics and cross-modal dependencies, leading to suboptimal entity representations. To address these limitations, we propose MESN, a novel multimodal embedding framework that integrates relational and multimodal signals through semantic aggregation and neighbor-aware attention mechanisms. MESN selectively extracts informative multimodal features via adaptive attention and expert-driven learning, ensuring more expressive entity embeddings. Additionally, we introduce an enhanced ComplEx-based scoring function, which effectively combines structured graph interactions with multimodal information, capturing both relational and feature diversity. Extensive experiments on standard multimodal datasets confirm that MESN significantly outperforms baselines across multiple evaluation metrics. Our findings highlight the importance of relational guidance in multimodal embedding tasks, paving the way for more robust and semantically-aware knowledge representations.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.