Multi armed bandit based resource allocation in Near Memory Processing architectures

Shubhang Pandey, T.G. Venkatesh
{"title":"Multi armed bandit based resource allocation in Near Memory Processing architectures","authors":"Shubhang Pandey,&nbsp;T.G. Venkatesh","doi":"10.1016/j.memori.2025.100132","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in 3D fabrication have allowed handling the memory bottlenecks for modern data-intensive applications by bringing the computation closer to the memory, enabling Near Memory Processing (NMP). Memory Centric Networks (MCN) are advanced memory architectures that use NMP architectures, where multiple stacks of the 3D memory units are equipped with simple processing cores, allowing numerous threads to execute concurrently. The performance of the NMP is crucially dependent upon the efficient task offloading and task-to-NMP allocation. Our work presents a multi-armed bandit (MAB) based approach in formulating an efficient resource allocation strategy for MCN. Most existing literature concentrates only on one application domain and optimizing only one metric, i.e., either execution time or power. However, our solution is more generic and can be applied to diverse application domains. In our approach, we deploy Upper Confidence Bound (UCB) policy to collect rewards and eventually use it for regret optimization. We study the following metrics-instructions per cycle, execution times, NMP core cache misses, packet latencies, and power consumption. Our study covers various applications from PARSEC and SPLASH2 benchmarks suite. The evaluation shows that the system’s performance improves by <span><math><mrow><mo>∼</mo><mn>11</mn><mtext>%</mtext></mrow></math></span> on average and an average reduction in total power consumption by <span><math><mrow><mo>∼</mo><mn>12</mn><mtext>%</mtext></mrow></math></span>.</div></div>","PeriodicalId":100915,"journal":{"name":"Memories - Materials, Devices, Circuits and Systems","volume":"11 ","pages":"Article 100132"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Memories - Materials, Devices, Circuits and Systems","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277306462500012X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recent advances in 3D fabrication have allowed handling the memory bottlenecks for modern data-intensive applications by bringing the computation closer to the memory, enabling Near Memory Processing (NMP). Memory Centric Networks (MCN) are advanced memory architectures that use NMP architectures, where multiple stacks of the 3D memory units are equipped with simple processing cores, allowing numerous threads to execute concurrently. The performance of the NMP is crucially dependent upon the efficient task offloading and task-to-NMP allocation. Our work presents a multi-armed bandit (MAB) based approach in formulating an efficient resource allocation strategy for MCN. Most existing literature concentrates only on one application domain and optimizing only one metric, i.e., either execution time or power. However, our solution is more generic and can be applied to diverse application domains. In our approach, we deploy Upper Confidence Bound (UCB) policy to collect rewards and eventually use it for regret optimization. We study the following metrics-instructions per cycle, execution times, NMP core cache misses, packet latencies, and power consumption. Our study covers various applications from PARSEC and SPLASH2 benchmarks suite. The evaluation shows that the system’s performance improves by 11% on average and an average reduction in total power consumption by 12%.
基于多武装强盗的近内存处理体系结构资源分配
3D制造的最新进展使计算更接近内存,从而实现近内存处理(NMP),从而可以处理现代数据密集型应用的内存瓶颈。内存中心网络(MCN)是使用NMP架构的高级内存架构,其中多个3D存储单元堆栈配备了简单的处理核心,允许多个线程并发执行。NMP的性能在很大程度上取决于有效的任务卸载和任务到NMP的分配。我们的工作提出了一种基于多武装强盗(MAB)的方法来制定MCN的有效资源分配策略。大多数现有文献只关注一个应用领域,只优化一个指标,即执行时间或功率。然而,我们的解决方案更加通用,可以应用于不同的应用程序领域。在我们的方法中,我们部署了上限置信度(UCB)策略来收集奖励,并最终将其用于后悔优化。我们研究了以下指标:每个周期的指令、执行时间、NMP核心缓存丢失、数据包延迟和功耗。我们的研究涵盖了PARSEC和SPLASH2基准测试套件的各种应用程序。评估结果表明,该系统的性能平均提高了~ 11%,总功耗平均降低了~ 12%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信