并行性的选择:多gpu驱动的管道用于庞大的学术骨干网络

IF 0.7 Q4 COMPUTER SCIENCE, THEORY & METHODS

International Journal of Parallel Emergent and Distributed Systems Pub Date : 2021-06-24 DOI:10.1080/17445760.2021.1941009

R. Ando, Y. Kadobayashi, H. Takakura

{"title":"并行性的选择:多gpu驱动的管道用于庞大的学术骨干网络","authors":"R. Ando, Y. Kadobayashi, H. Takakura","doi":"10.1080/17445760.2021.1941009","DOIUrl":null,"url":null,"abstract":"Science Information Network (SINET) is a Japanese academic backbone network for more than 800 research institutions and universities. In this paper, we present a multi-GPU-driven pipeline for handling huge session data of SINET. Our pipeline consists of ELK stack, multi-GPU server, and Splunk. A multi-GPU server is responsible for two procedures: discrimination and histogramming. Discrimination is dividing session data into ingoing/outgoing with subnet mask calculation and network address matching. Histogramming is grouping ingoing/outgoing session data into bins with map-reduce. In our architecture, we use GPU for the acceleration of ingress/egress discrimination of session data. Also, we use a tiling design pattern for building a two-stage map-reduce of CPU and GPU. Our multi-GPU-driven pipeline has succeeded in processing huge workloads of about 1.2–1.6 billion session streams (500–650 GB) within 24 hours. GRAPHICAL ABSTRACT","PeriodicalId":45411,"journal":{"name":"International Journal of Parallel Emergent and Distributed Systems","volume":"36 1","pages":"609 - 622"},"PeriodicalIF":0.7000,"publicationDate":"2021-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/17445760.2021.1941009","citationCount":"2","resultStr":"{\"title\":\"Choice of parallelism: multi-GPU driven pipeline for huge academic backbone network\",\"authors\":\"R. Ando, Y. Kadobayashi, H. Takakura\",\"doi\":\"10.1080/17445760.2021.1941009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Science Information Network (SINET) is a Japanese academic backbone network for more than 800 research institutions and universities. In this paper, we present a multi-GPU-driven pipeline for handling huge session data of SINET. Our pipeline consists of ELK stack, multi-GPU server, and Splunk. A multi-GPU server is responsible for two procedures: discrimination and histogramming. Discrimination is dividing session data into ingoing/outgoing with subnet mask calculation and network address matching. Histogramming is grouping ingoing/outgoing session data into bins with map-reduce. In our architecture, we use GPU for the acceleration of ingress/egress discrimination of session data. Also, we use a tiling design pattern for building a two-stage map-reduce of CPU and GPU. Our multi-GPU-driven pipeline has succeeded in processing huge workloads of about 1.2–1.6 billion session streams (500–650 GB) within 24 hours. GRAPHICAL ABSTRACT\",\"PeriodicalId\":45411,\"journal\":{\"name\":\"International Journal of Parallel Emergent and Distributed Systems\",\"volume\":\"36 1\",\"pages\":\"609 - 622\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2021-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/17445760.2021.1941009\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Parallel Emergent and Distributed Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/17445760.2021.1941009\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Parallel Emergent and Distributed Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17445760.2021.1941009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}

引用次数: 2

摘要

科学信息网(SINET)是日本800多所研究机构和大学的学术骨干网络。在本文中，我们提出了一个多gpu驱动的管道来处理SINET的海量会话数据。我们的流水线由ELK堆栈、多gpu服务器和Splunk组成。一个多gpu服务器负责两个程序:判别和直方图。判别是通过子网掩码计算和网络地址匹配将会话数据划分为入/出。直方图是使用map-reduce将入/出会话数据分组到bin中。在我们的架构中，我们使用GPU来加速会话数据的入口/出口识别。此外，我们使用平铺设计模式来构建CPU和GPU的两阶段映射缩减。我们的多gpu驱动管道已经成功地在24小时内处理了大约12 - 16亿个会话流(500-650 GB)的巨大工作负载。图形抽象

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Choice of parallelism: multi-GPU driven pipeline for huge academic backbone network

Science Information Network (SINET) is a Japanese academic backbone network for more than 800 research institutions and universities. In this paper, we present a multi-GPU-driven pipeline for handling huge session data of SINET. Our pipeline consists of ELK stack, multi-GPU server, and Splunk. A multi-GPU server is responsible for two procedures: discrimination and histogramming. Discrimination is dividing session data into ingoing/outgoing with subnet mask calculation and network address matching. Histogramming is grouping ingoing/outgoing session data into bins with map-reduce. In our architecture, we use GPU for the acceleration of ingress/egress discrimination of session data. Also, we use a tiling design pattern for building a two-stage map-reduce of CPU and GPU. Our multi-GPU-driven pipeline has succeeded in processing huge workloads of about 1.2–1.6 billion session streams (500–650 GB) within 24 hours. GRAPHICAL ABSTRACT

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Parallel Emergent and Distributed Systems COMPUTER SCIENCE, THEORY & METHODS-

CiteScore

2.30

自引率

0.00%

发文量