{"title":"表达图模式高效匹配的强包容","authors":"Houari Mahfoud","doi":"10.1145/3508397.3564851","DOIUrl":null,"url":null,"abstract":"We consider conditional graph patterns (CGPs) that allow querying data graphs with complex features like quantification, negation and predicates. To overcome the prohibitive cost of subgraph isomorphism, we consider matching of CGPs under simulation semantics which can be conducted in quadratic time. In emerging applications that deal with large data graphs, one would like to reduce more this matching time. We consider the optimization problem that checks whether match results of some pattern P1, over any data graph, can be extracted from those of another pattern P2. This allows an efficient matching of P1 using only match results of P2 which may be much smaller than the underlying data graph. We show that when patterns are very simple then the traditional containment can be used to meet our optimization goal. However, in case of complex pattern features, the containment semantics does not suffice since containment between two patterns does not imply a possible extraction between their match results. Hence, we propose an enhanced semantics, called strong containment, that consists in checking match results extraction between two patterns. We show that strong containment can be decided in cubic time for CGPs by providing such an algorithm. This new semantics can be applied in many emerging applications such as view-based answering, query optimization and caching systems maintenance.","PeriodicalId":266269,"journal":{"name":"Proceedings of the 14th International Conference on Management of Digital EcoSystems","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a Strong Containment for Efficient Matching of Expressive Graph Patterns\",\"authors\":\"Houari Mahfoud\",\"doi\":\"10.1145/3508397.3564851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider conditional graph patterns (CGPs) that allow querying data graphs with complex features like quantification, negation and predicates. To overcome the prohibitive cost of subgraph isomorphism, we consider matching of CGPs under simulation semantics which can be conducted in quadratic time. In emerging applications that deal with large data graphs, one would like to reduce more this matching time. We consider the optimization problem that checks whether match results of some pattern P1, over any data graph, can be extracted from those of another pattern P2. This allows an efficient matching of P1 using only match results of P2 which may be much smaller than the underlying data graph. We show that when patterns are very simple then the traditional containment can be used to meet our optimization goal. However, in case of complex pattern features, the containment semantics does not suffice since containment between two patterns does not imply a possible extraction between their match results. Hence, we propose an enhanced semantics, called strong containment, that consists in checking match results extraction between two patterns. We show that strong containment can be decided in cubic time for CGPs by providing such an algorithm. This new semantics can be applied in many emerging applications such as view-based answering, query optimization and caching systems maintenance.\",\"PeriodicalId\":266269,\"journal\":{\"name\":\"Proceedings of the 14th International Conference on Management of Digital EcoSystems\",\"volume\":\"118 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 14th International Conference on Management of Digital EcoSystems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3508397.3564851\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 14th International Conference on Management of Digital EcoSystems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3508397.3564851","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards a Strong Containment for Efficient Matching of Expressive Graph Patterns
We consider conditional graph patterns (CGPs) that allow querying data graphs with complex features like quantification, negation and predicates. To overcome the prohibitive cost of subgraph isomorphism, we consider matching of CGPs under simulation semantics which can be conducted in quadratic time. In emerging applications that deal with large data graphs, one would like to reduce more this matching time. We consider the optimization problem that checks whether match results of some pattern P1, over any data graph, can be extracted from those of another pattern P2. This allows an efficient matching of P1 using only match results of P2 which may be much smaller than the underlying data graph. We show that when patterns are very simple then the traditional containment can be used to meet our optimization goal. However, in case of complex pattern features, the containment semantics does not suffice since containment between two patterns does not imply a possible extraction between their match results. Hence, we propose an enhanced semantics, called strong containment, that consists in checking match results extraction between two patterns. We show that strong containment can be decided in cubic time for CGPs by providing such an algorithm. This new semantics can be applied in many emerging applications such as view-based answering, query optimization and caching systems maintenance.