Amélie Gheerbrant, Leonid Libkin, Liat Peterfreund, Alexandra Rogova
{"title":"GQL and SQL/PGQ: Theoretical Models and Expressive Power","authors":"Amélie Gheerbrant, Leonid Libkin, Liat Peterfreund, Alexandra Rogova","doi":"arxiv-2409.01102","DOIUrl":null,"url":null,"abstract":"SQL/PGQ and GQL are very recent international standards for querying property\ngraphs: SQL/PGQ specifies how to query relational representations of property\ngraphs in SQL, while GQL is a standalone language for graph databases. The\nrapid industrial development of these standards left the academic community\ntrailing in its wake. While digests of the languages have appeared, we do not\nyet have concise foundational models like relational algebra and calculus for\nrelational databases that enable the formal study of languages, including their\nexpressiveness and limitations. At the same time, work on the next versions of\nthe standards has already begun, to address the perceived limitations of their\nfirst versions. Motivated by this, we initiate a formal study of SQL/PGQ and GQL,\nconcentrating on their concise formal model and expressiveness. For the former,\nwe define simple core languages -- Core GQL and Core PGQ -- that capture the\nessence of the new standards, are amenable to theoretical analysis, and fully\nclarify the difference between PGQ's bottom up evaluation versus GQL's linear,\nor pipelined approach. Equipped with these models, we both confirm the\nnecessity to extend the language to fill in the expressiveness gaps and\nidentify the source of these deficiencies. We complement our theoretical\nanalysis with an experimental study, demonstrating that existing workarounds in\nfull GQL and PGQ are impractical which further underscores the necessity to\ncorrect deficiencies in the language design.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
SQL/PGQ and GQL are very recent international standards for querying property
graphs: SQL/PGQ specifies how to query relational representations of property
graphs in SQL, while GQL is a standalone language for graph databases. The
rapid industrial development of these standards left the academic community
trailing in its wake. While digests of the languages have appeared, we do not
yet have concise foundational models like relational algebra and calculus for
relational databases that enable the formal study of languages, including their
expressiveness and limitations. At the same time, work on the next versions of
the standards has already begun, to address the perceived limitations of their
first versions. Motivated by this, we initiate a formal study of SQL/PGQ and GQL,
concentrating on their concise formal model and expressiveness. For the former,
we define simple core languages -- Core GQL and Core PGQ -- that capture the
essence of the new standards, are amenable to theoretical analysis, and fully
clarify the difference between PGQ's bottom up evaluation versus GQL's linear,
or pipelined approach. Equipped with these models, we both confirm the
necessity to extend the language to fill in the expressiveness gaps and
identify the source of these deficiencies. We complement our theoretical
analysis with an experimental study, demonstrating that existing workarounds in
full GQL and PGQ are impractical which further underscores the necessity to
correct deficiencies in the language design.