跨模态信息检索的图卷积网络

Wirel. Commun. Mob. Comput. Pub Date : 2022-01-06 DOI:10.1155/2022/6133142

Xianben Yang, Wei Zhang

{"title":"跨模态信息检索的图卷积网络","authors":"Xianben Yang, Wei Zhang","doi":"10.1155/2022/6133142","DOIUrl":null,"url":null,"abstract":"In recent years, due to the wide application of deep learning and more modal research, the corresponding image retrieval system has gradually extended from traditional text retrieval to visual retrieval combined with images and has become the field of computer vision and natural language understanding and one of the important cross-research hotspots. This paper focuses on the research of graph convolutional networks for cross-modal information retrieval and has a general understanding of cross-modal information retrieval and the related theories of convolutional networks on the basis of literature data. Modal information retrieval is designed to combine high-level semantics with low-level visual capabilities in cross-modal information retrieval to improve the accuracy of information retrieval and then use experiments to verify the designed network model, and the result is that the model designed in this paper is more accurate than the traditional retrieval model, which is up to 90%.","PeriodicalId":23995,"journal":{"name":"Wirel. Commun. Mob. Comput.","volume":"202 1","pages":"6133142:1-6133142:8"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Graph Convolutional Networks for Cross-Modal Information Retrieval\",\"authors\":\"Xianben Yang, Wei Zhang\",\"doi\":\"10.1155/2022/6133142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, due to the wide application of deep learning and more modal research, the corresponding image retrieval system has gradually extended from traditional text retrieval to visual retrieval combined with images and has become the field of computer vision and natural language understanding and one of the important cross-research hotspots. This paper focuses on the research of graph convolutional networks for cross-modal information retrieval and has a general understanding of cross-modal information retrieval and the related theories of convolutional networks on the basis of literature data. Modal information retrieval is designed to combine high-level semantics with low-level visual capabilities in cross-modal information retrieval to improve the accuracy of information retrieval and then use experiments to verify the designed network model, and the result is that the model designed in this paper is more accurate than the traditional retrieval model, which is up to 90%.\",\"PeriodicalId\":23995,\"journal\":{\"name\":\"Wirel. Commun. Mob. Comput.\",\"volume\":\"202 1\",\"pages\":\"6133142:1-6133142:8\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wirel. Commun. Mob. Comput.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1155/2022/6133142\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wirel. Commun. Mob. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2022/6133142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

近年来，由于深度学习的广泛应用和模态研究的增多，相应的图像检索系统逐渐从传统的文本检索扩展到结合图像的视觉检索，成为计算机视觉和自然语言理解领域的重要交叉研究热点之一。本文主要研究面向跨模态信息检索的图卷积网络，在文献数据的基础上对跨模态信息检索和卷积网络的相关理论有一个大致的了解。模态信息检索旨在将跨模态信息检索中的高级语义与低级视觉能力相结合，以提高信息检索的准确性，然后通过实验验证所设计的网络模型，结果表明本文设计的模型比传统的检索模型准确率更高，达到90%以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Graph Convolutional Networks for Cross-Modal Information Retrieval

In recent years, due to the wide application of deep learning and more modal research, the corresponding image retrieval system has gradually extended from traditional text retrieval to visual retrieval combined with images and has become the field of computer vision and natural language understanding and one of the important cross-research hotspots. This paper focuses on the research of graph convolutional networks for cross-modal information retrieval and has a general understanding of cross-modal information retrieval and the related theories of convolutional networks on the basis of literature data. Modal information retrieval is designed to combine high-level semantics with low-level visual capabilities in cross-modal information retrieval to improve the accuracy of information retrieval and then use experiments to verify the designed network model, and the result is that the model designed in this paper is more accurate than the traditional retrieval model, which is up to 90%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Wirel. Commun. Mob. Comput.

自引率

0.00%

发文量