RTA: A Framework for the Integration of Local and Relational Open Data

Yusuke Kosaka, Shu Murakami, Thomas Laurent, Kento Goto, Motomichi Toyama
{"title":"RTA: A Framework for the Integration of Local and Relational Open Data","authors":"Yusuke Kosaka, Shu Murakami, Thomas Laurent, Kento Goto, Motomichi Toyama","doi":"10.1145/3105831.3105852","DOIUrl":null,"url":null,"abstract":"There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a \"Public Table Library\" (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.","PeriodicalId":319729,"journal":{"name":"Proceedings of the 21st International Database Engineering & Applications Symposium","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3105831.3105852","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

There are currently massive amounts of public data, also refereed to as open data, for example stock price data or weather data. However, such data is distributed in a variety of ways, such as downloadable files like CSV or XML files, or through API calls to web services. Each data source thus requires a specific workflow, making it a burden for the users to process and use this data. This barrier to use diminishes the openness of this data We thus propose the Remote Table Access (RTA) system, a simple and safe architecture for publishing, i.e. giving open read only access to relational data, and easily integrating it with the user's local data. RTA enables the user to query relational open data and their own local data seamlessly through a single SQL query. To allow this, we designed a three parties architecture featuring a client-side application, an optional server-side module and a "Public Table Library" (PTL). The client side application processes the RTA query and fetches the necessary data, the server side system acts as an agent between the remote database and the client, offering added security as well as scalability in terms of connections, and the PTL list all the published data and stores its access information. We implemented an early prototype of this architecture as a proof of concept. We validated it against two datasets, including data from the TPC-C benchmark and make it available1. Our results show the feasability of RTA and possible significant reduction of query processing time mainly because of the reduction on transmission volume by condition pushing and semijoin.
RTA:一个集成本地和关系开放数据的框架
目前有大量的公共数据,也被称为开放数据,例如股票价格数据或天气数据。然而,这样的数据以各种方式分发,比如像CSV或XML文件这样的可下载文件,或者通过对web服务的API调用。因此,每个数据源都需要一个特定的工作流,这使得处理和使用这些数据成为用户的负担。因此,我们提出了远程表访问(RTA)系统,这是一种简单而安全的发布架构,即对关系数据提供开放的只读访问,并轻松地将其与用户的本地数据集成。RTA使用户能够通过单个SQL查询无缝地查询关系开放数据和他们自己的本地数据。为此,我们设计了一个三方架构,其中包括一个客户端应用程序、一个可选的服务器端模块和一个“公共表库”(Public Table Library, PTL)。客户端应用程序处理RTA查询并获取必要的数据,服务器端系统充当远程数据库和客户端之间的代理,在连接方面提供额外的安全性和可伸缩性,而PTL列出所有已发布的数据并存储其访问信息。我们实现了这个架构的早期原型作为概念验证。我们针对两个数据集验证了它,包括来自TPC-C基准测试的数据,并使其可用。我们的研究结果表明RTA的可行性和查询处理时间的显著减少主要是由于条件推送和半连接减少了传输量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信