{"title":"Large RDF representation framework for GPUs case study key-value storage and binary triple pattern","authors":"Chidchanok Choksuchat, C. Chantrapornchai","doi":"10.1109/ICSEC.2013.6694745","DOIUrl":null,"url":null,"abstract":"In this paper, we present the experimental framework which operates the search of RDF data in GPUs. We explore the use of triple storages for query processing in GPUs. From the user query, the SPARQL query is generated by the Middleware Jena. Then, the corresponding triples are searched. The triple search space is large and impossible to load into GPU memory to perform the parallel search. Proper representations are studied to storage data in GPUs for an effective search. We compare the key-value storage in Cassandra and binary triple storage using HDT. The experiments are done using the dataset, DBpedia and Freebase dataset. It is found that HDT compression can compress the data over ten times and the HDT format can be used to store large triples passed through the GPUs. The paper discusses qualitative and quantitative results from both manners. The pros and cons of them can further be combined properly in the further.","PeriodicalId":191620,"journal":{"name":"2013 International Computer Science and Engineering Conference (ICSEC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Computer Science and Engineering Conference (ICSEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEC.2013.6694745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In this paper, we present the experimental framework which operates the search of RDF data in GPUs. We explore the use of triple storages for query processing in GPUs. From the user query, the SPARQL query is generated by the Middleware Jena. Then, the corresponding triples are searched. The triple search space is large and impossible to load into GPU memory to perform the parallel search. Proper representations are studied to storage data in GPUs for an effective search. We compare the key-value storage in Cassandra and binary triple storage using HDT. The experiments are done using the dataset, DBpedia and Freebase dataset. It is found that HDT compression can compress the data over ten times and the HDT format can be used to store large triples passed through the GPUs. The paper discusses qualitative and quantitative results from both manners. The pros and cons of them can further be combined properly in the further.