有效地址转换的矢量集合指令页地址合并

2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3) Pub Date : 2022-11-01 DOI:10.1109/IA356718.2022.00007

Hikaru Takayashiki, Masayuki Sato, K. Komatsu, Hiroaki Kobayashi

{"title":"有效地址转换的矢量集合指令页地址合并","authors":"Hikaru Takayashiki, Masayuki Sato, K. Komatsu, Hiroaki Kobayashi","doi":"10.1109/IA356718.2022.00007","DOIUrl":null,"url":null,"abstract":"Vector gather instructions are available in various processors, which are essential for handling irregular memory accesses. Additionally, the processors support virtual memory that allows programmers not to consider the limitation of the physical memory space. To realize the virtual memory, the processors require address translation between virtual and physical addresses. When a vector gather instruction loads data elements distributed over the physical memory space, all virtual addresses must be translated one by one, causing many translations by accessing a Translation Lookaside Buffer (TLB). Hence, the TLB easily becomes a bottleneck in handling vector gather instructions. To relieve the bottleneck, this paper proposes an address coalescing method for the address translations of vector gather instructions by utilizing vector arithmetic units in the processor. The evaluation results show that the proposed method can achieve a 2x performance improvement in numerical and 1.08x in graph applications, which contain many vector gather instructions.","PeriodicalId":144759,"journal":{"name":"2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Page-Address Coalescing of Vector Gather Instructions for Efficient Address Translation\",\"authors\":\"Hikaru Takayashiki, Masayuki Sato, K. Komatsu, Hiroaki Kobayashi\",\"doi\":\"10.1109/IA356718.2022.00007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vector gather instructions are available in various processors, which are essential for handling irregular memory accesses. Additionally, the processors support virtual memory that allows programmers not to consider the limitation of the physical memory space. To realize the virtual memory, the processors require address translation between virtual and physical addresses. When a vector gather instruction loads data elements distributed over the physical memory space, all virtual addresses must be translated one by one, causing many translations by accessing a Translation Lookaside Buffer (TLB). Hence, the TLB easily becomes a bottleneck in handling vector gather instructions. To relieve the bottleneck, this paper proposes an address coalescing method for the address translations of vector gather instructions by utilizing vector arithmetic units in the processor. The evaluation results show that the proposed method can achieve a 2x performance improvement in numerical and 1.08x in graph applications, which contain many vector gather instructions.\",\"PeriodicalId\":144759,\"journal\":{\"name\":\"2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IA356718.2022.00007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IA356718.2022.00007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

矢量收集指令可用于各种处理器，这对于处理不规则的内存访问是必不可少的。此外，处理器支持虚拟内存，使程序员不必考虑物理内存空间的限制。为了实现虚拟内存，处理器需要在虚拟地址和物理地址之间进行地址转换。当一个矢量收集指令加载分布在物理内存空间上的数据元素时，所有的虚拟地址必须一个接一个地进行转换，通过访问转换Lookaside缓冲区(Translation Lookaside Buffer, TLB)导致许多转换。因此，TLB很容易成为处理矢量采集指令的瓶颈。为了解决这一瓶颈问题，本文提出了一种利用处理器中的矢量运算单元对矢量采集指令进行地址转换的地址合并方法。评估结果表明，该方法在包含大量矢量采集指令的图形应用中性能提高了1.08倍，在数值应用中性能提高了2倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Page-Address Coalescing of Vector Gather Instructions for Efficient Address Translation

Vector gather instructions are available in various processors, which are essential for handling irregular memory accesses. Additionally, the processors support virtual memory that allows programmers not to consider the limitation of the physical memory space. To realize the virtual memory, the processors require address translation between virtual and physical addresses. When a vector gather instruction loads data elements distributed over the physical memory space, all virtual addresses must be translated one by one, causing many translations by accessing a Translation Lookaside Buffer (TLB). Hence, the TLB easily becomes a bottleneck in handling vector gather instructions. To relieve the bottleneck, this paper proposes an address coalescing method for the address translations of vector gather instructions by utilizing vector arithmetic units in the processor. The evaluation results show that the proposed method can achieve a 2x performance improvement in numerical and 1.08x in graph applications, which contain many vector gather instructions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms (IA3)

自引率

0.00%

发文量