Graph Embeddings for One-pass Processing of Heterogeneous Queries

Chi Thang Duong, Hongzhi Yin, Dung Hoang, Minn Hung Nguyen, M. Weidlich, Quoc Viet Hung Nguyen, K. Aberer
{"title":"Graph Embeddings for One-pass Processing of Heterogeneous Queries","authors":"Chi Thang Duong, Hongzhi Yin, Dung Hoang, Minn Hung Nguyen, M. Weidlich, Quoc Viet Hung Nguyen, K. Aberer","doi":"10.1109/ICDE48307.2020.00222","DOIUrl":null,"url":null,"abstract":"Effective information retrieval (IR) relies on the ability to comprehensively capture a user’s information needs. Traditional IR systems are limited to homogeneous queries that define the information to retrieve by a single modality. Support for heterogeneous queries that combine different modalities has been proposed recently. Yet, existing approaches for heterogeneous querying are computationally expensive, as they require several passes over the data to construct a query answer.In this paper, we propose an IR system that overcomes the computational challenges imposed by heterogeneous queries by adopting graph embeddings. Specifically, we propose graph-based models in which both, data and queries, incorporate information of different modalities. Then, we show how either representation is transformed into a graph embedding in the same space, capturing relations between information of different modalities. By grounding query processing in graph embeddings, we enable processing of heterogeneous queries with a single pass over the data representation. Our experiments on several real-world and synthetic datasets illustrate that our technique is able to return twice the amount of relevant information in comparison with several baselines, while being scalable to large-scale data.","PeriodicalId":6709,"journal":{"name":"2020 IEEE 36th International Conference on Data Engineering (ICDE)","volume":"6 1","pages":"1994-1997"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 36th International Conference on Data Engineering (ICDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE48307.2020.00222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Effective information retrieval (IR) relies on the ability to comprehensively capture a user’s information needs. Traditional IR systems are limited to homogeneous queries that define the information to retrieve by a single modality. Support for heterogeneous queries that combine different modalities has been proposed recently. Yet, existing approaches for heterogeneous querying are computationally expensive, as they require several passes over the data to construct a query answer.In this paper, we propose an IR system that overcomes the computational challenges imposed by heterogeneous queries by adopting graph embeddings. Specifically, we propose graph-based models in which both, data and queries, incorporate information of different modalities. Then, we show how either representation is transformed into a graph embedding in the same space, capturing relations between information of different modalities. By grounding query processing in graph embeddings, we enable processing of heterogeneous queries with a single pass over the data representation. Our experiments on several real-world and synthetic datasets illustrate that our technique is able to return twice the amount of relevant information in comparison with several baselines, while being scalable to large-scale data.
异构查询一次处理的图嵌入
有效的信息检索依赖于全面捕获用户信息需求的能力。传统的IR系统仅限于同构查询,这些查询定义了要通过单一模式检索的信息。最近提出了对结合不同模式的异构查询的支持。然而,现有的异构查询方法在计算上很昂贵,因为它们需要多次传递数据来构造查询答案。在本文中,我们提出了一种红外系统,通过采用图嵌入来克服异构查询带来的计算挑战。具体来说,我们提出了基于图的模型,其中数据和查询都包含不同模式的信息。然后,我们展示了如何将这两种表示转换为嵌入在同一空间中的图,捕获不同模态信息之间的关系。通过在图嵌入中进行查询处理,我们可以通过对数据表示的一次传递来处理异构查询。我们在几个真实世界和合成数据集上的实验表明,与几个基线相比,我们的技术能够返回两倍的相关信息,同时可扩展到大规模数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信