Kai Huang, Mehmet Güngör, Stratis Ioannidis, M. Leeser
{"title":"Optimizing Use of Different Types of Memory for FPGAs in High Performance Computing","authors":"Kai Huang, Mehmet Güngör, Stratis Ioannidis, M. Leeser","doi":"10.1109/HPEC43674.2020.9286144","DOIUrl":null,"url":null,"abstract":"Accelerators such as Field Programmable Gate Arrays (FPGAs) are increasingly used in high performance computing, and the problems they are applied to process larger and larger amounts of data. FPGA manufacturers have added new types of memory on chip to help ease the memory bottleneck; however, the burden is on the designer to determine how data is allocated to different memory types. We study the use of ultraRAM for a graph application running on Amazon Web Services (AWS) that generates a large amount of intermediate data that is not subsequently accessed sequentially. We investigate different algorithms for mapping data to ultraRAM. Our results show that use of ultraRAM can speed up overall application run time by a factor of 3 or more. Maximizing the amount of ultraRAM used produces the best results, and as problem size grows, judiciously assigning data to ultraRAM vs. DDR results in better performance.","PeriodicalId":168544,"journal":{"name":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE High Performance Extreme Computing Conference (HPEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPEC43674.2020.9286144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Accelerators such as Field Programmable Gate Arrays (FPGAs) are increasingly used in high performance computing, and the problems they are applied to process larger and larger amounts of data. FPGA manufacturers have added new types of memory on chip to help ease the memory bottleneck; however, the burden is on the designer to determine how data is allocated to different memory types. We study the use of ultraRAM for a graph application running on Amazon Web Services (AWS) that generates a large amount of intermediate data that is not subsequently accessed sequentially. We investigate different algorithms for mapping data to ultraRAM. Our results show that use of ultraRAM can speed up overall application run time by a factor of 3 or more. Maximizing the amount of ultraRAM used produces the best results, and as problem size grows, judiciously assigning data to ultraRAM vs. DDR results in better performance.
像现场可编程门阵列(fpga)这样的加速器越来越多地用于高性能计算,它们被应用于处理越来越多的数据量。FPGA制造商已经在芯片上添加了新型存储器,以帮助缓解存储器瓶颈;然而,设计人员的负担在于确定如何将数据分配给不同的内存类型。我们研究了在Amazon Web Services (AWS)上运行的图形应用程序中使用ultraRAM,该应用程序生成大量中间数据,这些数据随后不会被顺序访问。我们研究了将数据映射到ultraRAM的不同算法。我们的结果表明,使用ultraRAM可以将整个应用程序的运行时间提高3倍或更多。最大限度地使用ultraRAM可以产生最佳结果,并且随着问题规模的增长,明智地将数据分配给ultraRAM和DDR可以获得更好的性能。