使用TripleBit +的高性能RDF更新

Pingpeng Yuan, Lijian Fan, Hai Jin
{"title":"使用TripleBit +的高性能RDF更新","authors":"Pingpeng Yuan, Lijian Fan, Hai Jin","doi":"10.1109/ICDIM.2018.8847004","DOIUrl":null,"url":null,"abstract":"The volume of RDF data continues to grow over the past decade and many known RDF datasets have billions of triples. A grant challenge of managing this huge RDF data is how to access this big RDF data efficiently. A popular approach to addressing the problem is to build a full set of permutations of (S, P, O) indexes. Although this approach has shown to accelerate joins by orders of magnitude, the large space overhead limits the scalability of this approach and makes it heavyweight. In this paper, we present TripleBit +, a fast and compact system for updating RDF data. The design of TripleBit + has two salient features. First, the efficient maintenance strategies of TripleBit + reduces both the overhead to update data and indexes. Second, effective maintenance technologies to handle online updates over RDF repositories are proposed. Our experiments show that TripleBit + outperforms RDF-3X, MonetDB, BitMat on LUBM, UniProt, and BTC 2012 benchmark queries and it offers orders of mangnitude performance improvement for some complex join queries. Our design also yields high task rates as high as 660,000 per second and fast average response time of task which is faster than x-RDF-3X and PostgreSQL.","PeriodicalId":120884,"journal":{"name":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High Performance RDF Updates with TripleBit +\",\"authors\":\"Pingpeng Yuan, Lijian Fan, Hai Jin\",\"doi\":\"10.1109/ICDIM.2018.8847004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The volume of RDF data continues to grow over the past decade and many known RDF datasets have billions of triples. A grant challenge of managing this huge RDF data is how to access this big RDF data efficiently. A popular approach to addressing the problem is to build a full set of permutations of (S, P, O) indexes. Although this approach has shown to accelerate joins by orders of magnitude, the large space overhead limits the scalability of this approach and makes it heavyweight. In this paper, we present TripleBit +, a fast and compact system for updating RDF data. The design of TripleBit + has two salient features. First, the efficient maintenance strategies of TripleBit + reduces both the overhead to update data and indexes. Second, effective maintenance technologies to handle online updates over RDF repositories are proposed. Our experiments show that TripleBit + outperforms RDF-3X, MonetDB, BitMat on LUBM, UniProt, and BTC 2012 benchmark queries and it offers orders of mangnitude performance improvement for some complex join queries. Our design also yields high task rates as high as 660,000 per second and fast average response time of task which is faster than x-RDF-3X and PostgreSQL.\",\"PeriodicalId\":120884,\"journal\":{\"name\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Thirteenth International Conference on Digital Information Management (ICDIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDIM.2018.8847004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2018.8847004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

RDF数据量在过去十年中持续增长,许多已知的RDF数据集有数十亿个三元组。管理这些庞大的RDF数据的最大挑战是如何有效地访问这些庞大的RDF数据。解决这个问题的一种流行方法是构建(S, P, O)索引的全套排列。尽管这种方法已经证明可以将连接的速度提高几个数量级,但是巨大的空间开销限制了这种方法的可伸缩性,并使其变得重量级。在本文中,我们提出了TripleBit +,一个快速和紧凑的RDF数据更新系统。TripleBit +的设计有两个显著特点。首先,TripleBit +的高效维护策略降低了更新数据和索引的开销。其次,提出了通过RDF存储库处理在线更新的有效维护技术。我们的实验表明,TripleBit +在LUBM, UniProt和BTC 2012基准查询上优于RDF-3X, MonetDB, BitMat,并且它为一些复杂的连接查询提供了数量级的性能改进。我们的设计还产生了高达每秒66万个的高任务率和比x-RDF-3X和PostgreSQL更快的任务平均响应时间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
High Performance RDF Updates with TripleBit +
The volume of RDF data continues to grow over the past decade and many known RDF datasets have billions of triples. A grant challenge of managing this huge RDF data is how to access this big RDF data efficiently. A popular approach to addressing the problem is to build a full set of permutations of (S, P, O) indexes. Although this approach has shown to accelerate joins by orders of magnitude, the large space overhead limits the scalability of this approach and makes it heavyweight. In this paper, we present TripleBit +, a fast and compact system for updating RDF data. The design of TripleBit + has two salient features. First, the efficient maintenance strategies of TripleBit + reduces both the overhead to update data and indexes. Second, effective maintenance technologies to handle online updates over RDF repositories are proposed. Our experiments show that TripleBit + outperforms RDF-3X, MonetDB, BitMat on LUBM, UniProt, and BTC 2012 benchmark queries and it offers orders of mangnitude performance improvement for some complex join queries. Our design also yields high task rates as high as 660,000 per second and fast average response time of task which is faster than x-RDF-3X and PostgreSQL.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信