Buffer Allocation for Exposed Datapath Architectures

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI:10.1109/MCSoC57363.2022.00013

Anoop Bhagyanath, K. Schneider

{"title":"Buffer Allocation for Exposed Datapath Architectures","authors":"Anoop Bhagyanath, K. Schneider","doi":"10.1109/MCSoC57363.2022.00013","DOIUrl":null,"url":null,"abstract":"Concurrent access to a given number of registers limits the instruction-level parallelism (ILP) used by conventional processors despite the use of many processing units (PUs). Many recent architectures expose their internal datapaths to compilers, allowing the compiler to move intermediate values from program execution directly between PUs, thus bypassing the use of registers. Buffered exposed datapath (BED) architectures additionally implement these inter-PU communication paths with scalable first-in-first-out (FIFO) buffers to avoid the use of registers and to prevent unnecessary synchronization between PUs. However, the BED compiler must ensure that the creation order of intermediate values in a buffer matches their consumption order so that the next executing instructions always find their operands at the heads of the corresponding buffers. In this paper, we present a novel buffer interference analysis that determines a criterion for allocating multiple program variables to the same buffer based on a given instruction schedule that specifies an access order for those variables. We then use the well-known dataflow analysis framework to compute a buffer interference graph whose coloring yields a valid buffer allocation for programs by considering the instructions in the given order. Preliminary experimental results show the effectiveness of our code generation approach compared to traditional register-based compilation. More importantly, the buffer interference graph should serve as the basis for future buffer allocation schemes that maximize ILP usage.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MCSoC57363.2022.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Concurrent access to a given number of registers limits the instruction-level parallelism (ILP) used by conventional processors despite the use of many processing units (PUs). Many recent architectures expose their internal datapaths to compilers, allowing the compiler to move intermediate values from program execution directly between PUs, thus bypassing the use of registers. Buffered exposed datapath (BED) architectures additionally implement these inter-PU communication paths with scalable first-in-first-out (FIFO) buffers to avoid the use of registers and to prevent unnecessary synchronization between PUs. However, the BED compiler must ensure that the creation order of intermediate values in a buffer matches their consumption order so that the next executing instructions always find their operands at the heads of the corresponding buffers. In this paper, we present a novel buffer interference analysis that determines a criterion for allocating multiple program variables to the same buffer based on a given instruction schedule that specifies an access order for those variables. We then use the well-known dataflow analysis framework to compute a buffer interference graph whose coloring yields a valid buffer allocation for programs by considering the instructions in the given order. Preliminary experimental results show the effectiveness of our code generation approach compared to traditional register-based compilation. More importantly, the buffer interference graph should serve as the basis for future buffer allocation schemes that maximize ILP usage.

查看原文本刊更多论文

公开数据路径体系结构的缓冲区分配

对给定数量的寄存器的并发访问限制了传统处理器使用的指令级并行性(ILP)，尽管使用了许多处理单元(pu)。许多最新的体系结构向编译器公开了它们的内部数据路径，允许编译器直接在pu之间移动程序执行的中间值，从而绕过了寄存器的使用。缓冲暴露数据路径(BED)架构还使用可扩展的先进先出(FIFO)缓冲区实现这些pu间通信路径，以避免使用寄存器并防止pu之间不必要的同步。但是，BED编译器必须确保缓冲区中中间值的创建顺序与它们的消费顺序相匹配，以便下一个执行指令总是在相应缓冲区的头部找到它们的操作数。在本文中，我们提出了一种新的缓冲区干扰分析方法，该方法根据给定的指令时间表确定将多个程序变量分配到同一缓冲区的标准，该指令时间表指定了这些变量的访问顺序。然后，我们使用众所周知的数据流分析框架来计算缓冲区干扰图，该图的着色通过考虑给定顺序的指令为程序提供有效的缓冲区分配。初步的实验结果表明，与传统的基于寄存器的编译相比，我们的代码生成方法是有效的。更重要的是，缓冲区干扰图应该作为未来缓冲区分配方案的基础，以最大限度地利用ILP。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)

自引率

0.00%

发文量