Multimodal Named Entity Recognition and Relation Extraction with Retrieval-Augmented Strategy

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval Pub Date : 2023-07-18 DOI:10.1145/3539618.3591790

Xuming Hu

引用次数: 0

Abstract

Multimodal Named Entity Recognition (MNER) and Multimodal Relation Extraction (MRE) are tasks in information retrieval that aim to recognize entities and extract relations among them using information from multiple modalities, such as text and images. Although current methods have attempted a variety of modality fusion approaches to enhance the information in text, a large amount of readily available internet retrieval data has not been considered. Therefore, we attempt to retrieve real-world text related to images, objects, and entire sentences from the internet and use this retrieved text as input for cross-modal fusion to improve the performance of entity and relation extraction tasks in the text.

查看原文本刊更多论文

基于检索增强策略的多模态命名实体识别与关系提取

多模态命名实体识别(MNER)和多模态关系提取(MRE)是利用文本和图像等多模态信息识别实体并提取实体之间关系的信息检索任务。虽然目前的方法尝试了多种情态融合方法来增强文本中的信息，但没有考虑到大量现成的互联网检索数据。因此，我们尝试从互联网上检索与图像、对象和整个句子相关的现实世界文本，并将这些检索到的文本作为跨模态融合的输入，以提高文本中实体和关系提取任务的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

自引率

0.00%

发文量