Fengcai Qiao, Yuanfa Zhang, Jinsheng Deng, Zhaoyun Ding, Aiping Li
{"title":"基于频繁子图挖掘的图事务并行算法","authors":"Fengcai Qiao, Yuanfa Zhang, Jinsheng Deng, Zhaoyun Ding, Aiping Li","doi":"10.1109/DSC50466.2020.00061","DOIUrl":null,"url":null,"abstract":"Frequent subgraph patterns play an important role in feature mining for graph data. The problem of mining these patterns is defined as finding subgraphs that appear frequently according to a given frequency threshold. Usually, frequent subgraph mining (FSM) is conducted in graph transaction setting, in which graph database contains many small graphs. Since multicore processors are quite popular this day, many algorithms can be accelerated with multi-thread technique. This paper proposed a multi-thread frequent subgraph mining algorithm and achieved considerable acceleration in the experiments. In this paper, a parallel frequent subgraph mining algorithm named PTRGRAM (Parallel Transaction based Graph Mining) which can take full advantage of the multi-core performance of current processors was proposed. In the algorithm, the data synchronization between multiple threads is based on the producer-consumer model. In addition, to speed the support computing, the embedding node list is introduced for optimization. Finally, experimental performance evaluations were conducted with two graph datasets, demonstrating that the proposed algorithm outperforms the single-threaded gSpan and FFSM algorithm.","PeriodicalId":423182,"journal":{"name":"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Parallel Algorithm for Graph Transaction Based Frequent Subgraph Mining\",\"authors\":\"Fengcai Qiao, Yuanfa Zhang, Jinsheng Deng, Zhaoyun Ding, Aiping Li\",\"doi\":\"10.1109/DSC50466.2020.00061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Frequent subgraph patterns play an important role in feature mining for graph data. The problem of mining these patterns is defined as finding subgraphs that appear frequently according to a given frequency threshold. Usually, frequent subgraph mining (FSM) is conducted in graph transaction setting, in which graph database contains many small graphs. Since multicore processors are quite popular this day, many algorithms can be accelerated with multi-thread technique. This paper proposed a multi-thread frequent subgraph mining algorithm and achieved considerable acceleration in the experiments. In this paper, a parallel frequent subgraph mining algorithm named PTRGRAM (Parallel Transaction based Graph Mining) which can take full advantage of the multi-core performance of current processors was proposed. In the algorithm, the data synchronization between multiple threads is based on the producer-consumer model. In addition, to speed the support computing, the embedding node list is introduced for optimization. Finally, experimental performance evaluations were conducted with two graph datasets, demonstrating that the proposed algorithm outperforms the single-threaded gSpan and FFSM algorithm.\",\"PeriodicalId\":423182,\"journal\":{\"name\":\"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DSC50466.2020.00061\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Fifth International Conference on Data Science in Cyberspace (DSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSC50466.2020.00061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
频繁子图模式在图数据特征挖掘中起着重要的作用。挖掘这些模式的问题被定义为根据给定的频率阈值查找频繁出现的子图。通常在图事务设置中进行频繁子图挖掘(FSM),其中图数据库包含许多小图。由于如今多核处理器非常流行,许多算法都可以使用多线程技术来加速。本文提出了一种多线程频繁子图挖掘算法,并在实验中取得了较好的加速效果。本文提出了一种能够充分利用当前处理器多核性能的并行频繁子图挖掘算法PTRGRAM (parallel Transaction based Graph mining)。在该算法中,多线程之间的数据同步是基于生产者-消费者模型的。此外,为了提高支持度计算速度,引入嵌入节点列表进行优化。最后,在两个图数据集上进行了实验性能评估,结果表明该算法优于单线程gSpan和FFSM算法。
A Parallel Algorithm for Graph Transaction Based Frequent Subgraph Mining
Frequent subgraph patterns play an important role in feature mining for graph data. The problem of mining these patterns is defined as finding subgraphs that appear frequently according to a given frequency threshold. Usually, frequent subgraph mining (FSM) is conducted in graph transaction setting, in which graph database contains many small graphs. Since multicore processors are quite popular this day, many algorithms can be accelerated with multi-thread technique. This paper proposed a multi-thread frequent subgraph mining algorithm and achieved considerable acceleration in the experiments. In this paper, a parallel frequent subgraph mining algorithm named PTRGRAM (Parallel Transaction based Graph Mining) which can take full advantage of the multi-core performance of current processors was proposed. In the algorithm, the data synchronization between multiple threads is based on the producer-consumer model. In addition, to speed the support computing, the embedding node list is introduced for optimization. Finally, experimental performance evaluations were conducted with two graph datasets, demonstrating that the proposed algorithm outperforms the single-threaded gSpan and FFSM algorithm.