PokéMem: Taming Wild Memory Consumers in Apache Spark

2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) Pub Date : 2022-05-01 DOI:10.1109/ipdps53621.2022.00015

Minhyeok Kweun, Goeun Kim, Byungsoo Oh, Seong-In Jung, Taegeon Um, Woo-Yeon Lee

{"title":"PokéMem: Taming Wild Memory Consumers in Apache Spark","authors":"Minhyeok Kweun, Goeun Kim, Byungsoo Oh, Seong-In Jung, Taegeon Um, Woo-Yeon Lee","doi":"10.1109/ipdps53621.2022.00015","DOIUrl":null,"url":null,"abstract":"Apache Spark is a widely used in-memory processing system due to its high performance. For fast data processing, Spark manages in-memory data such as cached or shuffling (aggregate and sorting) data in its own managed memory pools. However, despite its sophisticated memory management scheme, we found that Spark still suffers from out-of-memory (OOM) exceptions and high garbage collection (GC) overheads when wild memory consumers, who are not tracked by Spark and execute external codes, use a large amount of memory. To resolve the problems, we propose PokéMem, which is an enhanced Spark that incorporates wild memory consumers into the managed ones to prevent them from taking up memory spaces excessively in stealth. Our main idea is to open the black-box of unmanaged memory regions in external codes by providing customized data collections. PokéMem enables fine-grained controls of created objects within running tasks, by spilling and reloading the objects of custom data collections based on the memory pressure and access patterns. To further reduce memory pressures, PokéMem exploits pre-built memory estimation models to predict the external code's memory usage and proactively acquires memory before the execution of external code, and also performs JVM heap-usage monitoring to avoid critical memory pressures. With the help of these techniques, our evaluations show that PokéMem outperforms vanilla Spark with at most 3× faster execution with 3.9× smaller GC overheads, and successfully runs workloads without OOM exception that vanilla Spark has failed to run.","PeriodicalId":321801,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)","volume":"13 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ipdps53621.2022.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Apache Spark is a widely used in-memory processing system due to its high performance. For fast data processing, Spark manages in-memory data such as cached or shuffling (aggregate and sorting) data in its own managed memory pools. However, despite its sophisticated memory management scheme, we found that Spark still suffers from out-of-memory (OOM) exceptions and high garbage collection (GC) overheads when wild memory consumers, who are not tracked by Spark and execute external codes, use a large amount of memory. To resolve the problems, we propose PokéMem, which is an enhanced Spark that incorporates wild memory consumers into the managed ones to prevent them from taking up memory spaces excessively in stealth. Our main idea is to open the black-box of unmanaged memory regions in external codes by providing customized data collections. PokéMem enables fine-grained controls of created objects within running tasks, by spilling and reloading the objects of custom data collections based on the memory pressure and access patterns. To further reduce memory pressures, PokéMem exploits pre-built memory estimation models to predict the external code's memory usage and proactively acquires memory before the execution of external code, and also performs JVM heap-usage monitoring to avoid critical memory pressures. With the help of these techniques, our evaluations show that PokéMem outperforms vanilla Spark with at most 3× faster execution with 3.9× smaller GC overheads, and successfully runs workloads without OOM exception that vanilla Spark has failed to run.

查看原文本刊更多论文

poksammem:在Apache Spark中驯服狂野的内存消费者

由于其高性能，Apache Spark是一个广泛使用的内存处理系统。对于快速的数据处理，Spark在自己托管的内存池中管理内存中的数据，比如缓存或变换(聚合和排序)数据。然而，尽管采用了复杂的内存管理方案，我们发现Spark在使用大量内存时仍然存在内存不足(OOM)异常和高垃圾收集(GC)开销的问题，这些用户不被Spark跟踪，而是执行外部代码。为了解决这些问题，我们提出了pok mem，这是一个增强的Spark，它将野生内存消费者合并到托管内存消费者中，以防止它们在隐形情况下过度占用内存空间。我们的主要思想是通过提供定制的数据收集，在外部代码中打开非托管内存区域的黑箱。通过基于内存压力和访问模式溢出和重新加载自定义数据集合的对象，poksamim支持对运行任务中创建的对象进行细粒度控制。为了进一步减少内存压力，poksammem利用预构建的内存估计模型来预测外部代码的内存使用情况，并在执行外部代码之前主动获取内存，还执行JVM堆使用情况监控，以避免严重的内存压力。在这些技术的帮助下，我们的评估表明，poksammem的执行速度比vanilla Spark快3倍，GC开销比vanilla Spark小3.9倍，并且成功地运行了工作负载，没有出现vanilla Spark无法运行的OOM异常。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

自引率

0.00%

发文量