Seunghwa Kang, Alexandre Fender, Joe Eaton, Brad Rees
{"title":"使用DGX A100集群计算网页抓取数据的PageRank分数","authors":"Seunghwa Kang, Alexandre Fender, Joe Eaton, Brad Rees","doi":"10.1109/HPEC43674.2020.9286216","DOIUrl":null,"url":null,"abstract":"PageRank is a widely used graph analytics algorithm to rank vertices using relationship data. Large-scale Page Rank is challenging due to its irregular and communication intensive computational characteristics. We implemented Page Rank on NVIDIA's newly released DGX A100 cluster and compared the performance with two recent notable large-scale Page Rank computations using the Common Crawl dataset. The ShenTu framework computed Page Rank scores using a large number of custom microprocessors connected with an HPC class interconnect. The Hronos framework reported the state-of-the-art performance using 3000 commodity CPU nodes and 10 Gbps Ethernet. The Common Crawl dataset captures link relationships between web pages in a graph with 3.563 billion vertices and 128.736 billion edges. Our implementation demonstrated 13x faster PageRank iteration time than the Hronos framework using a cluster with NVLink connected 32 A100 GPUs.","PeriodicalId":168544,"journal":{"name":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","volume":"433 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Computing PageRank Scores of Web Crawl Data Using DGX A100 Clusters\",\"authors\":\"Seunghwa Kang, Alexandre Fender, Joe Eaton, Brad Rees\",\"doi\":\"10.1109/HPEC43674.2020.9286216\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"PageRank is a widely used graph analytics algorithm to rank vertices using relationship data. Large-scale Page Rank is challenging due to its irregular and communication intensive computational characteristics. We implemented Page Rank on NVIDIA's newly released DGX A100 cluster and compared the performance with two recent notable large-scale Page Rank computations using the Common Crawl dataset. The ShenTu framework computed Page Rank scores using a large number of custom microprocessors connected with an HPC class interconnect. The Hronos framework reported the state-of-the-art performance using 3000 commodity CPU nodes and 10 Gbps Ethernet. The Common Crawl dataset captures link relationships between web pages in a graph with 3.563 billion vertices and 128.736 billion edges. Our implementation demonstrated 13x faster PageRank iteration time than the Hronos framework using a cluster with NVLink connected 32 A100 GPUs.\",\"PeriodicalId\":168544,\"journal\":{\"name\":\"2020 IEEE High Performance Extreme Computing Conference (HPEC)\",\"volume\":\"433 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE High Performance Extreme Computing Conference (HPEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HPEC43674.2020.9286216\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPEC43674.2020.9286216","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Computing PageRank Scores of Web Crawl Data Using DGX A100 Clusters
PageRank is a widely used graph analytics algorithm to rank vertices using relationship data. Large-scale Page Rank is challenging due to its irregular and communication intensive computational characteristics. We implemented Page Rank on NVIDIA's newly released DGX A100 cluster and compared the performance with two recent notable large-scale Page Rank computations using the Common Crawl dataset. The ShenTu framework computed Page Rank scores using a large number of custom microprocessors connected with an HPC class interconnect. The Hronos framework reported the state-of-the-art performance using 3000 commodity CPU nodes and 10 Gbps Ethernet. The Common Crawl dataset captures link relationships between web pages in a graph with 3.563 billion vertices and 128.736 billion edges. Our implementation demonstrated 13x faster PageRank iteration time than the Hronos framework using a cluster with NVLink connected 32 A100 GPUs.