Automating Multi-level Performance Elastic Components for IBM Streams

Proceedings of the 20th International Middleware Conference Pub Date : 2019-12-09 DOI:10.1145/3361525.3361544

Xiang Ni, S. Schneider, Raju Pavuluri, Jonathan Kaus, Kun-Lung Wu

{"title":"Automating Multi-level Performance Elastic Components for IBM Streams","authors":"Xiang Ni, S. Schneider, Raju Pavuluri, Jonathan Kaus, Kun-Lung Wu","doi":"10.1145/3361525.3361544","DOIUrl":null,"url":null,"abstract":"Streaming applications exhibit abundant opportunities for pipeline parallelism, data parallelism and task parallelism. Prior work in IBM Streams introduced an elastic threading model that sought the best performance by automatically tuning the number of threads. In this paper, we introduce the ability to automatically discover where that threading model is profitable. However this introduces a new challenge: we have separate performance elastic mechanisms that are designed with different objectives, leading to potential negative interactions and unintended performance degradation. We present our experiences in overcoming these challenges by showing how to coordinate separate but interfering elasticity mechanisms to maxmize performance gains with stable and fast parallelism exploitation. We first describe an elastic performance mechanism that automatically adapts different threading models to different regions of an application. We then show a coherent ecosystem for coordinating this threading model elasticty with thread count elasticity. This system is an online, stable multi-level elastic coordination scheme that adapts different regions of a streaming application to different threading models and number of threads. We implemented this multi-level coordination scheme in IBM Streams and demonstrated that it (a) scales to over a hundred threads; (b) can improve performance by an order of magnitude on two different processor architectures when an application can benefit from multiple threading models; and (c) achieves performance comparable to hand-optimized applications but with much fewer threads.","PeriodicalId":381253,"journal":{"name":"Proceedings of the 20th International Middleware Conference","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 20th International Middleware Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3361525.3361544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Streaming applications exhibit abundant opportunities for pipeline parallelism, data parallelism and task parallelism. Prior work in IBM Streams introduced an elastic threading model that sought the best performance by automatically tuning the number of threads. In this paper, we introduce the ability to automatically discover where that threading model is profitable. However this introduces a new challenge: we have separate performance elastic mechanisms that are designed with different objectives, leading to potential negative interactions and unintended performance degradation. We present our experiences in overcoming these challenges by showing how to coordinate separate but interfering elasticity mechanisms to maxmize performance gains with stable and fast parallelism exploitation. We first describe an elastic performance mechanism that automatically adapts different threading models to different regions of an application. We then show a coherent ecosystem for coordinating this threading model elasticty with thread count elasticity. This system is an online, stable multi-level elastic coordination scheme that adapts different regions of a streaming application to different threading models and number of threads. We implemented this multi-level coordination scheme in IBM Streams and demonstrated that it (a) scales to over a hundred threads; (b) can improve performance by an order of magnitude on two different processor architectures when an application can benefit from multiple threading models; and (c) achieves performance comparable to hand-optimized applications but with much fewer threads.

查看原文本刊更多论文

自动化IBM流的多级性能弹性组件

流应用程序展示了管道并行、数据并行和任务并行的大量机会。IBM Streams之前的工作引入了一个弹性线程模型，该模型通过自动调优线程数量来寻求最佳性能。在本文中，我们引入了自动发现线程模型在哪些地方是有益的能力。然而，这带来了一个新的挑战:我们有单独的性能弹性机制，它们被设计为不同的目标，导致潜在的负面交互和意外的性能下降。我们展示了克服这些挑战的经验，展示了如何协调独立但相互干扰的弹性机制，从而通过稳定和快速的并行性利用最大化性能收益。我们首先描述了一种弹性性能机制，它可以自动将不同的线程模型适应应用程序的不同区域。然后，我们展示了一个协调线程模型弹性和线程数弹性的连贯生态系统。该系统是一种在线的、稳定的多级弹性协调方案，能够适应流应用程序的不同区域，适应不同的线程模型和线程数。我们在IBM Streams中实现了这个多级协调方案，并证明了它(a)可以扩展到100多个线程;(b)当应用程序受益于多线程模型时，可以在两种不同的处理器架构上提高一个数量级的性能;(c)实现与手动优化应用程序相当的性能，但线程要少得多。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 20th International Middleware Conference

自引率

0.00%

发文量