NBBS: A Non-Blocking Buddy System for Multi-core Machines

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) Pub Date : 2019-05-01 DOI:10.1109/CCGRID.2019.00011

Romolo Marotta, Mauro Ianni, Andrea Scarselli, Alessandro Pellegrini, F. Quaglia

{"title":"NBBS: A Non-Blocking Buddy System for Multi-core Machines","authors":"Romolo Marotta, Mauro Ianni, Andrea Scarselli, Alessandro Pellegrini, F. Quaglia","doi":"10.1109/CCGRID.2019.00011","DOIUrl":null,"url":null,"abstract":"Common implementations of core memory allocation components, like the Linux buddy system, handle concurrent allocation/release requests by synchronizing threads via spin-locks. This approach is not prone to scale, a problem that has been addressed in the literature by introducing layered allocation services or replicating the core allocators—the bottom most ones within the layered architecture. Both these solutions tend to reduce the pressure of actual concurrent accesses to each individual core allocator. In this article we explore an alternative approach to scalability of memory allocation/release, which can be still combined with those literature proposals. We present a fully non-blocking buddy-system, where threads performing concurrent allocations/releases do not undergo any spin-lock based synchronization. Our solution allows threads to proceed in parallel, and commit their allocations/releases unless a conflict is materialized while handling the allocator metadata. Conflict detection relies on atomic Read-Modify-Write (RMW) machine instructions. Beyond improving scalability and performance, our solution can also avoid wasting clock cycles for spin-lock operations by threads that could in principle carry out their memory allocations/releases in full concurrency.","PeriodicalId":234571,"journal":{"name":"2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)","volume":"119 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCGRID.2019.00011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Common implementations of core memory allocation components, like the Linux buddy system, handle concurrent allocation/release requests by synchronizing threads via spin-locks. This approach is not prone to scale, a problem that has been addressed in the literature by introducing layered allocation services or replicating the core allocators—the bottom most ones within the layered architecture. Both these solutions tend to reduce the pressure of actual concurrent accesses to each individual core allocator. In this article we explore an alternative approach to scalability of memory allocation/release, which can be still combined with those literature proposals. We present a fully non-blocking buddy-system, where threads performing concurrent allocations/releases do not undergo any spin-lock based synchronization. Our solution allows threads to proceed in parallel, and commit their allocations/releases unless a conflict is materialized while handling the allocator metadata. Conflict detection relies on atomic Read-Modify-Write (RMW) machine instructions. Beyond improving scalability and performance, our solution can also avoid wasting clock cycles for spin-lock operations by threads that could in principle carry out their memory allocations/releases in full concurrency.

查看原文本刊更多论文

NBBS:用于多核机器的非阻塞伙伴系统

核心内存分配组件的常见实现(如Linux伙伴系统)通过自旋锁同步线程来处理并发分配/释放请求。这种方法不容易扩展，这个问题已经在文献中通过引入分层分配服务或复制核心分配器(分层体系结构中最底层的分配器)来解决。这两种解决方案都倾向于减少对每个单独核心分配器的实际并发访问的压力。在本文中，我们将探讨内存分配/释放可伸缩性的另一种方法，该方法仍然可以与那些文献建议结合使用。我们提出了一个完全无阻塞的伙伴系统，其中执行并发分配/释放的线程不经历任何基于自旋锁的同步。我们的解决方案允许线程并行进行，并提交它们的分配/释放，除非在处理分配器元数据时发生冲突。冲突检测依赖于原子的读-修改-写(RMW)机器指令。除了提高可伸缩性和性能之外，我们的解决方案还可以避免线程在旋转锁定操作上浪费时钟周期，这些线程原则上可以完全并发地执行内存分配/释放。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

自引率

0.00%

发文量