{"title":"Mining Path Association Rules in Large Property Graphs (with Appendix)","authors":"Yuya Sasaki, Panagiotis Karras","doi":"arxiv-2408.02029","DOIUrl":null,"url":null,"abstract":"How can we mine frequent path regularities from a graph with edge labels and\nvertex attributes? The task of association rule mining successfully discovers\nregular patterns in item sets and substructures. Still, to our best knowledge,\nthis concept has not yet been extended to path patterns in large property\ngraphs. In this paper, we introduce the problem of path association rule mining\n(PARM). Applied to any \\emph{reachability path} between two vertices within a\nlarge graph, PARM discovers regular ways in which path patterns, identified by\nvertex attributes and edge labels, co-occur with each other. We develop an\nefficient and scalable algorithm PIONEER that exploits an anti-monotonicity\nproperty to effectively prune the search space. Further, we devise\napproximation techniques and employ parallelization to achieve scalable path\nassociation rule mining. Our experimental study using real-world graph data\nverifies the significance of path association rules and the efficiency of our\nsolutions.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02029","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
How can we mine frequent path regularities from a graph with edge labels and
vertex attributes? The task of association rule mining successfully discovers
regular patterns in item sets and substructures. Still, to our best knowledge,
this concept has not yet been extended to path patterns in large property
graphs. In this paper, we introduce the problem of path association rule mining
(PARM). Applied to any \emph{reachability path} between two vertices within a
large graph, PARM discovers regular ways in which path patterns, identified by
vertex attributes and edge labels, co-occur with each other. We develop an
efficient and scalable algorithm PIONEER that exploits an anti-monotonicity
property to effectively prune the search space. Further, we devise
approximation techniques and employ parallelization to achieve scalable path
association rule mining. Our experimental study using real-world graph data
verifies the significance of path association rules and the efficiency of our
solutions.