协同统计模型的自动并行计算

2018 IEEE 14th International Conference on e-Science (e-Science) Pub Date : 2018-10-01 DOI:10.1109/eScience.2018.00089

E. G. P. Bos, I. Pelupessy, V. Croft, W. Verkerke, C. Burgard

{"title":"协同统计模型的自动并行计算","authors":"E. G. P. Bos, I. Pelupessy, V. Croft, W. Verkerke, C. Burgard","doi":"10.1109/eScience.2018.00089","DOIUrl":null,"url":null,"abstract":"RooFit [4], [6] is the statistical modeling and fitting package used in many big particle physics experiments to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [1], [2]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize the back-end by automatically running major parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run-time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. To tackle these and possible future bottlenecks, we designed a parallelization layer that allows us to parallelize existing classes with minimal effort, but with high performance and retaining as much of the existing class’s interface as possible. The high-level parallelization model is a task-stealing approach. The implementation is currently based on a multi-process approach using a bi-directional memory mapped pipe for communication, which is both easy to use and highly performant. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.","PeriodicalId":6476,"journal":{"name":"2018 IEEE 14th International Conference on e-Science (e-Science)","volume":"1 1","pages":"345-346"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automated Parallel Calculation of Collaborative Statistical Models in RooFit\",\"authors\":\"E. G. P. Bos, I. Pelupessy, V. Croft, W. Verkerke, C. Burgard\",\"doi\":\"10.1109/eScience.2018.00089\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"RooFit [4], [6] is the statistical modeling and fitting package used in many big particle physics experiments to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [1], [2]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize the back-end by automatically running major parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run-time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. To tackle these and possible future bottlenecks, we designed a parallelization layer that allows us to parallelize existing classes with minimal effort, but with high performance and retaining as much of the existing class’s interface as possible. The high-level parallelization model is a task-stealing approach. The implementation is currently based on a multi-process approach using a bi-directional memory mapped pipe for communication, which is both easy to use and highly performant. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.\",\"PeriodicalId\":6476,\"journal\":{\"name\":\"2018 IEEE 14th International Conference on e-Science (e-Science)\",\"volume\":\"1 1\",\"pages\":\"345-346\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 14th International Conference on e-Science (e-Science)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/eScience.2018.00089\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on e-Science (e-Science)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eScience.2018.00089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

RooFit[4]，[6]是许多大型粒子物理实验中使用的统计建模和拟合包，用于从粒子碰撞数据中提取物理参数，例如大型强子对撞机的希格斯玻色子实验[1]，[2]。RooFit旨在将粒子物理模型的构建和拟合(用户的目标)与后端技术实现和优化分开。在本文中，我们概述了通过在多核机器上自动并行运行用户模型的主要部分来进一步优化后端所做的努力。一个主要的挑战是，RooFit允许用户定义许多不同类型的模型，这些模型具有不同类型的计算瓶颈。然后，我们的自动并行化框架必须是灵活的，同时仍然至少减少一个数量级的运行时间，最好更多。我们已经执行了大量的基准测试，并确定了至少三个将从并行化中受益的瓶颈。为了解决这些和未来可能出现的瓶颈，我们设计了一个并行化层，它允许我们以最小的工作量并行化现有的类，但具有高性能并尽可能多地保留现有类的接口。高级并行化模型是一种任务窃取方法。该实现目前基于多进程方法，使用双向内存映射管道进行通信，既易于使用又高性能。初步结果显示，根据具体的模型和并行化策略，速度可以提高2到20倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automated Parallel Calculation of Collaborative Statistical Models in RooFit

RooFit [4], [6] is the statistical modeling and fitting package used in many big particle physics experiments to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [1], [2]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize the back-end by automatically running major parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run-time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. To tackle these and possible future bottlenecks, we designed a parallelization layer that allows us to parallelize existing classes with minimal effort, but with high performance and retaining as much of the existing class’s interface as possible. The high-level parallelization model is a task-stealing approach. The implementation is currently based on a multi-process approach using a bi-directional memory mapped pipe for communication, which is both easy to use and highly performant. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 IEEE 14th International Conference on e-Science (e-Science)

自引率

0.00%

发文量