{"title":"核心标注:压缩传递闭包的一种新方法","authors":"Yangjun Chen, Yibin Chen","doi":"10.1109/SITIS.2008.31","DOIUrl":null,"url":null,"abstract":"A graph reachability query, as one of the primary tasks in numerous applications, is to find whether two given data objects, u and v, are related in any way in a large and complex dataset. Formally, the query is about to find if v is reachable from u in a directed graph which is large in size. In this paper, we focus ourselves on building a reachability labeling for large directed graphs, in order to process reachability queries efficiently. A new approach is proposed to compress transitive closure to support reachability checkings. The approach consists of two schemes, called Core-I labeling and Core-II labeling, respectively. For a graph G with n nodes and e edges, the labeling time of Core-I is bounded by O(n + e + t¿min{b, s}), where b is the number of the leaf nodes of a spanning tree of G, t is the number of non-tree edges (edges that do not appear in the spanning tree) and s is the number of the start nodes of all non-tree edges in G. The space overhead is bounded by O(n + s¿min{b, s}) and the querying time is O(log(min{b, s})). Core-II needs O(n + e + t¿min{b, s} + d¿s¿logmin{b, s}) labeling time and O(n + d¿s) space, where d is the number of the end nodes of all non-tree edges in G. But the query time is reduced to O(1). Experiments have been performed, showing that our method is promising.","PeriodicalId":202698,"journal":{"name":"2008 IEEE International Conference on Signal Image Technology and Internet Based Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Core Labeling: A New Way to Compress Transitive Closure\",\"authors\":\"Yangjun Chen, Yibin Chen\",\"doi\":\"10.1109/SITIS.2008.31\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A graph reachability query, as one of the primary tasks in numerous applications, is to find whether two given data objects, u and v, are related in any way in a large and complex dataset. Formally, the query is about to find if v is reachable from u in a directed graph which is large in size. In this paper, we focus ourselves on building a reachability labeling for large directed graphs, in order to process reachability queries efficiently. A new approach is proposed to compress transitive closure to support reachability checkings. The approach consists of two schemes, called Core-I labeling and Core-II labeling, respectively. For a graph G with n nodes and e edges, the labeling time of Core-I is bounded by O(n + e + t¿min{b, s}), where b is the number of the leaf nodes of a spanning tree of G, t is the number of non-tree edges (edges that do not appear in the spanning tree) and s is the number of the start nodes of all non-tree edges in G. The space overhead is bounded by O(n + s¿min{b, s}) and the querying time is O(log(min{b, s})). Core-II needs O(n + e + t¿min{b, s} + d¿s¿logmin{b, s}) labeling time and O(n + d¿s) space, where d is the number of the end nodes of all non-tree edges in G. But the query time is reduced to O(1). Experiments have been performed, showing that our method is promising.\",\"PeriodicalId\":202698,\"journal\":{\"name\":\"2008 IEEE International Conference on Signal Image Technology and Internet Based Systems\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE International Conference on Signal Image Technology and Internet Based Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SITIS.2008.31\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Signal Image Technology and Internet Based Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2008.31","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
图可达性查询,作为许多应用程序中的主要任务之一,是查找两个给定的数据对象,u和v,在一个大型和复杂的数据集中是否以任何方式相关。形式上,查询是要找出在一个大的有向图中v是否可以从u到达。在本文中,我们致力于为大型有向图构建可达性标记,以有效地处理可达性查询。提出了一种压缩传递闭包以支持可达性检查的新方法。该方法包括两种方案,分别称为Core-I标记和Core-II标记。图G n e节点和边的标签时间Core-I界是O (n + e + t害怕敏{b}), b在哪里生成树的叶节点的数量的G, t非树木的数量的边缘(边缘,不出现在生成树)和s是所有非树木的数量开始节点的边在G的空间开销,O (n + s害怕敏{b})和查询时间是O (log (min {b}))。Core-II需要O(n + e + t¿min{b, s} + d¿s¿logmin{b, s})标注时间和O(n + d´s)空间,其中d为g中所有非树边的末端节点数,查询时间减少到O(1)。实验结果表明,该方法是可行的。
Core Labeling: A New Way to Compress Transitive Closure
A graph reachability query, as one of the primary tasks in numerous applications, is to find whether two given data objects, u and v, are related in any way in a large and complex dataset. Formally, the query is about to find if v is reachable from u in a directed graph which is large in size. In this paper, we focus ourselves on building a reachability labeling for large directed graphs, in order to process reachability queries efficiently. A new approach is proposed to compress transitive closure to support reachability checkings. The approach consists of two schemes, called Core-I labeling and Core-II labeling, respectively. For a graph G with n nodes and e edges, the labeling time of Core-I is bounded by O(n + e + t¿min{b, s}), where b is the number of the leaf nodes of a spanning tree of G, t is the number of non-tree edges (edges that do not appear in the spanning tree) and s is the number of the start nodes of all non-tree edges in G. The space overhead is bounded by O(n + s¿min{b, s}) and the querying time is O(log(min{b, s})). Core-II needs O(n + e + t¿min{b, s} + d¿s¿logmin{b, s}) labeling time and O(n + d¿s) space, where d is the number of the end nodes of all non-tree edges in G. But the query time is reduced to O(1). Experiments have been performed, showing that our method is promising.