Multi armed bandit based resource allocation in Near Memory Processing architectures

Memories - Materials, Devices, Circuits and Systems Pub Date : 2025-07-19 DOI:10.1016/j.memori.2025.100132

Shubhang Pandey, T.G. Venkatesh

{"title":"Multi armed bandit based resource allocation in Near Memory Processing architectures","authors":"Shubhang Pandey, T.G. Venkatesh","doi":"10.1016/j.memori.2025.100132","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in 3D fabrication have allowed handling the memory bottlenecks for modern data-intensive applications by bringing the computation closer to the memory, enabling Near Memory Processing (NMP). Memory Centric Networks (MCN) are advanced memory architectures that use NMP architectures, where multiple stacks of the 3D memory units are equipped with simple processing cores, allowing numerous threads to execute concurrently. The performance of the NMP is crucially dependent upon the efficient task offloading and task-to-NMP allocation. Our work presents a multi-armed bandit (MAB) based approach in formulating an efficient resource allocation strategy for MCN. Most existing literature concentrates only on one application domain and optimizing only one metric, i.e., either execution time or power. However, our solution is more generic and can be applied to diverse application domains. In our approach, we deploy Upper Confidence Bound (UCB) policy to collect rewards and eventually use it for regret optimization. We study the following metrics-instructions per cycle, execution times, NMP core cache misses, packet latencies, and power consumption. Our study covers various applications from PARSEC and SPLASH2 benchmarks suite. The evaluation shows that the system’s performance improves by <span><math><mrow><mo>∼</mo><mn>11</mn><mtext>%</mtext></mrow></math></span> on average and an average reduction in total power consumption by <span><math><mrow><mo>∼</mo><mn>12</mn><mtext>%</mtext></mrow></math></span>.</div></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"11 ","pages":"Article 100132"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Memories - Materials, Devices, Circuits and Systems","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277306462500012X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recent advances in 3D fabrication have allowed handling the memory bottlenecks for modern data-intensive applications by bringing the computation closer to the memory, enabling Near Memory Processing (NMP). Memory Centric Networks (MCN) are advanced memory architectures that use NMP architectures, where multiple stacks of the 3D memory units are equipped with simple processing cores, allowing numerous threads to execute concurrently. The performance of the NMP is crucially dependent upon the efficient task offloading and task-to-NMP allocation. Our work presents a multi-armed bandit (MAB) based approach in formulating an efficient resource allocation strategy for MCN. Most existing literature concentrates only on one application domain and optimizing only one metric, i.e., either execution time or power. However, our solution is more generic and can be applied to diverse application domains. In our approach, we deploy Upper Confidence Bound (UCB) policy to collect rewards and eventually use it for regret optimization. We study the following metrics-instructions per cycle, execution times, NMP core cache misses, packet latencies, and power consumption. Our study covers various applications from PARSEC and SPLASH2 benchmarks suite. The evaluation shows that the system’s performance improves by

\sim 11 %

on average and an average reduction in total power consumption by

\sim 12 %

查看原文本刊更多论文

基于多武装强盗的近内存处理体系结构资源分配

3D制造的最新进展使计算更接近内存，从而实现近内存处理（NMP），从而可以处理现代数据密集型应用的内存瓶颈。内存中心网络（MCN）是使用NMP架构的高级内存架构，其中多个3D存储单元堆栈配备了简单的处理核心，允许多个线程并发执行。NMP的性能在很大程度上取决于有效的任务卸载和任务到NMP的分配。我们的工作提出了一种基于多武装强盗（MAB）的方法来制定MCN的有效资源分配策略。大多数现有文献只关注一个应用领域，只优化一个指标，即执行时间或功率。然而，我们的解决方案更加通用，可以应用于不同的应用程序领域。在我们的方法中，我们部署了上限置信度（UCB）策略来收集奖励，并最终将其用于后悔优化。我们研究了以下指标：每个周期的指令、执行时间、NMP核心缓存丢失、数据包延迟和功耗。我们的研究涵盖了PARSEC和SPLASH2基准测试套件的各种应用程序。评估结果表明，该系统的性能平均提高了~ 11%，总功耗平均降低了~ 12%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Memories - Materials, Devices, Circuits and Systems

自引率

0.00%

发文量