B. Keömley-Horváth, G. Horváth, Péter Polcz, Bálint Siklósi, Kálmán Tornai, János Juhász, G. Szederkényi, György Cserey, Attila Csikász-Nagy, I. Reguly
{"title":"The Design and Utilisation of PanSim, a Portable Pandemic Simulator","authors":"B. Keömley-Horváth, G. Horváth, Péter Polcz, Bálint Siklósi, Kálmán Tornai, János Juhász, G. Szederkényi, György Cserey, Attila Csikász-Nagy, I. Reguly","doi":"10.1109/CIW-IUS56691.2022.00006","DOIUrl":"https://doi.org/10.1109/CIW-IUS56691.2022.00006","url":null,"abstract":"The COVID-19 pandemic has presented a clear and present need for urgent decision making. Set in an environment of uncertain and unreliable data, and a diverse range of possible interventions, there is an obvious need for integrating HPC into workflows that include model calibration, and the exploration of the decision space. In this paper, we present the design of PanSim, a portable, performant, and productive agent-based simulator, which has been extensively used to model and forecast the pandemic in Hungary. We show its performance and scalability on CPUs and GPUs, then we discuss the workflows PanSim integrates into. We describe the heterogeneous, resource-constrained HPC environment available to us, and formulate a scheduling optimisation problem, as well as heuristics to solve them, to either minimise the execution time of a given number of simulations or to maximise the number of simulations executed in a given time frame.","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114715327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Bechtel, A. Nelson, L. Lao, Z. Xing, S. Smith, R. Nazikian, S. Flanagan, D. Schissel, L. Stephey, R. Thomas, S. Williams, O. Antepara, E. Dart, E. Koleman, W. Tang
{"title":"Accelerated Workflow for Advanced Kinetic Equilibria","authors":"T. Bechtel, A. Nelson, L. Lao, Z. Xing, S. Smith, R. Nazikian, S. Flanagan, D. Schissel, L. Stephey, R. Thomas, S. Williams, O. Antepara, E. Dart, E. Koleman, W. Tang","doi":"10.1109/CIW-IUS56691.2022.00008","DOIUrl":"https://doi.org/10.1109/CIW-IUS56691.2022.00008","url":null,"abstract":"Kinetic equilibria are a fundamental aspect of tokamak plasma analysis, but are often highly specialized and labor intensive to produce. This has become a bottleneck to both deeper physics understandings and more sophisticated exper-ment controls. This project aims to remove these barriers by developing a rapid, fully-automated workflow to produce better-than-human, high-precision whole-discharge kinetic equilibria. The required elements in this workflow now exist separately, but what is missing is the coupling of different aspects and overall performance optimization. We have designed this workflow for the DIII-D national fusion facility with the goal of producing results quickly enough to be used for experiment planning in the 15–20 minute time window between subsequent discharges. This is made possible by on-demand availability of supercomputing resources and direct access between these systems and experimental data servers. The results will also be stored in a readily available database for more detailed follow-up analysis. Beyond the between shot application, we plan to apply the workflow offline to the full historical database of the DIII-D experiment. This will both provide scientists a wider set of data than presently available for sophisticated scoping studies and also establish a strong base for training AI/ML surrogate models capable of producing equilibria or their derivatives with similar fidelity at vastly greater speed. The workflow developed here is intended to serve as a prototype that can be replicated on other plasma experiments and provide timely and essential information for the international burning plasma experiment, ITER, as well as next stage fusion power plants. Equilibrium analysis is only the tip of the iceberg in terms of the potential which can be offered by automated superfacility workflows. The foundation of this workflow has been established and tested using of a full scale mock example on the Perlmutter system at NERSC. The results suggest that it may be possible to achieve our goal within a target 10 minute window since there is potential for performance improvement.","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127277807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Message from the CIW-IUS 22 Workshop Chairs","authors":"","doi":"10.1109/ciw-ius56691.2022.00004","DOIUrl":"https://doi.org/10.1109/ciw-ius56691.2022.00004","url":null,"abstract":"","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130105904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Web-Based Volunteer Distributed Computing for Handling Time-Critical Urgent Workloads","authors":"Nick Brown, S. Newby","doi":"10.1109/CIW-IUS56691.2022.00007","DOIUrl":"https://doi.org/10.1109/CIW-IUS56691.2022.00007","url":null,"abstract":"Urgent computing workloads are time critical, unpredictable, and highly dynamic. Whilst efforts are on-going to run these on traditional HPC machines, another option is to leverage the computing power donated by volunteers. Volunteer computing, where members of the public donate some of their CPU time to large scale projects has been popular for many years because it is a powerful way of delivering compute for specific problems, with the public often eager to contribute to a good cause with societal benefits. However, traditional volunteer computing has required user installation of specialist software which is a barrier to entry, and the development of the software itself by the projects, even on-top of existing frameworks, is nontrivial. As such, the number of users donating CPU time to these volunteer computing projects has decreased in recent years, and this comes at a time when the frequency of disasters, often driven by climate change, are rising fast. We believe that an alternative approach, where visitors to websites donate some of their CPU time whilst they are browsing, has the potential to address these issues. However, web-based distributed computing is an immature field and there are numerous questions that must be answered to fully understand the viability of leveraging the large scale parallelism that website visitors represent. In this paper we describe our web-based distributed computing framework, Panther, and perform in-depth performance experiments for two benchmarks using real world hardware and real world browsing habits for the first time. By exploring the performance characteristics of our approach we demonstrate that this is viable for urgent workloads, but there are numerous caveats, not least the most appropriate visitor patterns to a website, that must be considered.","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121223351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"CIW-IUS 22 Workshop Organization","authors":"","doi":"10.1109/ciw-ius56691.2022.00005","DOIUrl":"https://doi.org/10.1109/ciw-ius56691.2022.00005","url":null,"abstract":"","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122021656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daniel Balouek-Thomert, E. Caron, L. Lefèvre, Manish Parashar
{"title":"Towards a Methodology for Building Dynamic Urgent Applications on Continuum Computing Platforms","authors":"Daniel Balouek-Thomert, E. Caron, L. Lefèvre, Manish Parashar","doi":"10.1109/CIW-IUS56691.2022.00009","DOIUrl":"https://doi.org/10.1109/CIW-IUS56691.2022.00009","url":null,"abstract":"Advanced cyberinfrastructure aims at making the use of streaming data a common practice in the scientific community. They offer an ecosystem that links data, compute, network, and users to deliver knowledge obtained from multiple data sources using large-scale computational models. However, integrating this heterogeneous data with time-sensitive systems is difficult due to a lack of programming abstractions that can allow data-driven reactive behaviors throughout the edge-to-cloud/HPC computing continuum. Here we present a methodology for incorporating contextual information into the application logic while taking into consideration the heterogeneity of the underlying platform and the unpredictability of the data. A fire science scenario that includes sensors at the network's edge for smoke detection and computational models launched in the cloud for wildfire simulation and air quality assessment serves as the inspiration for this method. We then discuss research directions for tackling similar scenarios with a particular focus on resource management and programming models.","PeriodicalId":360051,"journal":{"name":"2022 First Combined International Workshop on Interactive Urgent Supercomputing (CIW-IUS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129573448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}