Christian Wolf, Andreas Kirmse, Maximilian Burkhalter, Max Hoffmann, Tobias Meisen
{"title":"Model to assess the Economic Profitability of Predictive Maintenance Projects","authors":"Christian Wolf, Andreas Kirmse, Maximilian Burkhalter, Max Hoffmann, Tobias Meisen","doi":"10.1109/HPCS48598.2019.9188221","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188221","url":null,"abstract":"Due to recent developments in data-driven technologies, predictive maintenance has become a promising alternative, especially in comparison to traditional maintenance strategies such as corrective and preventive maintenance. Even though it is currently difficult to assess if the usage of forecasting technologies in the sector of maintenance is able reduce the total cost effectively, answering this question is needed before rolling out algorithms with the aim of adapting predictive maintenance solutions on a larger scale.This paper proposes a profit and cost model that intends to realize an easy application on various processes that involve the assessment of predictive maintenance solutions. The approach divides these solutions into five steps. For each step, technological options are discussed and their costs are quantified. The resulting model can assess the profitability of a single predictive maintenance approach, but can also be applied to evaluate and compare the profitability of different predictive maintenance projects. This approach has been evaluated at a real-world industrial automotive company, where it is currently used to determine future predictive maintenance strategies.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129642807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automatic Configuration of OpenFlow in Wireless Mobile Ad hoc Networks","authors":"Sachin Sharma, A. Nag, Paul Stynes, M. Nekovee","doi":"10.1109/HPCS48598.2019.9188200","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188200","url":null,"abstract":"A Mobile wireless Ad hoc NETwork (MANET) is a decentralized wireless network in which mobile wireless nodes either directly communicate with each other or communicate via other wireless nodes in the network. In addition, OpenFlow has disruptive potential in designing a flexible programmable network which can foster innovation, reduce complexity and deliver right economics. In recent years, there are significant interests from research communities to deploy OpenFlow in MANETs. This paper proposes a configuration method with which OpenFlow can be deployed automatically in a MANET without any manual configuration. The proposed configuration method is tested in an emulated MANET created on the Fed4FIRE testbed using Mininet-WiFi (an emulator for wireless software-defined wireless networks). Experimentation includes automatic configuration of OpenFlow in linear, sparse, and dense mobile ad hoc networks. Results show the effectiveness of the method in configuring OpenFlow in wireless mobile ad hoc networks.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130637877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Andrew Anderson, Jing Su, Rozenn Dahyot, David Gregg
{"title":"Performance-Oriented Neural Architecture Search","authors":"Andrew Anderson, Jing Su, Rozenn Dahyot, David Gregg","doi":"10.1109/HPCS48598.2019.9188213","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188213","url":null,"abstract":"Hardware-Software Co-Design is a highly successful strategy for improving performance of domain-specific computing systems. We argue for the application of the same methodology to deep learning; specifically, we propose to extend neural architecture search with information about the hardware to ensure that the model designs produced are highly efficient in addition to the typical criteria around accuracy. Using the task of keyword spotting in audio on edge computing devices, we demonstrate that our approach results in neural architecture that is not only highly accurate, but also efficiently mapped to the computing platform which will perform the inference. Using our modified neural architecture search, we demonstrate 0.88% increase in TOP-I accuracy with $ 1.85times$ reduction in latency for keyword spotting in audio on an embedded SoC, and $ 1.59times$ on a high-end GPU.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123360486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Throughput Model for Data Stream Processing on Fog Computing","authors":"Felipe Rodrigo de Souza, M. Assunção, E. Caron","doi":"10.1109/HPCS48598.2019.9188146","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188146","url":null,"abstract":"Today’s society faces an unprecedented deluge of data that requires processing and analysis. Data Stream Processing (DSP) applications are often employed to extract valuable information in a timely manner as they can handle data as it is generated. The typical approach for deploying these applications explores the Cloud computing paradigm, which has limitations when data sources are geographically distributed, hence introducing high latency and achieving low processing throughput. To address these problems, current work attempts to take the computation closer to the edges of the Internet, exploring Fog computing. The effective adoption of this approach is achieved with proper throughput modeling that accounts for characteristics of the DSP application and Fog infrastructure, including the location of devices, processing and bandwidth requirements of the application, as well as selectivity and parallelism level of operators. In this work, we propose a throughput model for DSP applications embracing these characteristics. Results show that the model estimates the application throughput with less than 1% error.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"132 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121340706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Transforming non textually aligned SPMD programs into textually aligned SPMD programs by using rewriting rules","authors":"Wadoud Bousdira","doi":"10.1109/HPCS48598.2019.9188223","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188223","url":null,"abstract":"The problem of analyzing parallel programs that access shared memory and use barrier synchronization is known to be hard to study. For a special case of those programs with minimal SPMD (Single Program Multiple Data) constructs, a formal definition of textually aligned barriers with an operational semantics has been proposed in previous work. Then, the textual alignement of the synchronization barriers that is defined prevents deadlocks. However, the textual alignement property is not verified by all SPMD programs. We propose a set of transformation rules using rewriting techniques which allows to turn a non-textually aligned program to be textually aligned. So, we can benefit of a simple static analysis for deadlock detection. We show that the rewrite rules form a terminating confluent system and we prove that the transformation rules preserve the semantics of the programs.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116270323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Extended investigation of performance-energy trade-offs under power capping in HPC environments","authors":"Adam Krzywaniak, P. Czarnul, J. Proficz","doi":"10.1109/HPCS48598.2019.9188149","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188149","url":null,"abstract":"In the paper we present investigation of performance-energy trade-offs under power capping using modern processors. The results are presented for systems targeted at both server and client markets and were collected from Intel Xeon E5 and Intel Xeon Phi server processors as well as from desktop and mobile Intel Core i7 processors. The results, when using power capping, show that we can find various interesting combinations of energy savings and performance drops as well as non-trivial minima of the energy-execution time product. We performed this analysis for a subset of NAS Parallel Benchmark applications: BT, CG, EP and FT and sizes of the computational problem (classes A, B, C, D). We can observe that the energy characteristics visualized by a prototype of our new tool EnergyProfiler do not depend on the size of a computational problem. Consequently, the proposed tool can potentially support quick energy/performance trade-off estimation for codes similar to the tested, well-recognized benchmarks.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121660974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tohma Kawasumi, Ryota Tamura, Yuya Asada, Jixin Han, Hiroki Mikami, K. Kimura, H. Kasahara
{"title":"Fast and Highly Optimizing Separate Compilation for Automatic Parallelization","authors":"Tohma Kawasumi, Ryota Tamura, Yuya Asada, Jixin Han, Hiroki Mikami, K. Kimura, H. Kasahara","doi":"10.1109/HPCS48598.2019.9188148","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188148","url":null,"abstract":"Automatic parallelization by a compiler is a promising approach for fully utilizing a multicore processor. Without compiler support, a programmer must simultaneously take into account parallelism in a program and memory hierarchy utilization. However, the possibility of parallelization and optimization across multiple compilation units is limited due to the lack of interprocedural analysis information at the compile time. This is a serious challenge surrounding parallelizing practical programs because they usually consist of multiple compilation units and employ separate compilation to ensure program maintainability and reduce the recompilation time. In this paper, for automatic parallelization by a compiler, we propose a separate compilation method that enables parallelization across multiple compilation units and minimizes recompilation time by providing information about the analysis along with an object file for each compilation unit at the compile time. We also propose an automatically parallelizing compilation flow with analysis information. The experimental evaluation using large size real control system programs from industry shows the proposed technique can obtain 29% better performance than the separate compilation without the proposed method, and reduce compilation time by up to 90% with only 1% of performance loss, compared with the compilation for the fully unified source code into a single compilation unit.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126563935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Oriol Tintó Prims, M. Acosta, M. Castrillo, S. P. Ticco, K. Serradell, A. Cortés, F. Doblas-Reyes
{"title":"Discriminating accurate results in nonlinear models","authors":"Oriol Tintó Prims, M. Acosta, M. Castrillo, S. P. Ticco, K. Serradell, A. Cortés, F. Doblas-Reyes","doi":"10.1109/HPCS48598.2019.9188178","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188178","url":null,"abstract":"Non-linear models are challenging when it is time to verify that a certain HPC optimization does not degrade the accuracy of a model. Any apparently insignificant change in the code, in the software stack, or in the HPC system used can prevent bit-to-bit reproducibility, which added to the intrinsic nonlinearities of the model can lead to differences in the results of the simulation. Being able to deduce whether the different results can be explained by the internal variability of the model can help to decide if a specific change is acceptable. This manuscript presents a method that consists in estimating the uncertainty of the model outputs by doing many almost-identical simulations slightly modifying the model inputs. The statistical information extracted from these simulations can be used to discern if the results of a given simulation are indistinguishable or instead there are significant differences. Two illustrative usage examples of the method are provided, the first one studying whether a Lorenz system model can use less numerical precision and the second one studying whether the state-of-the art ocean model NEMO can safely use certain compiler optimization flags.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121909213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yi-Hong Lyu, C. Liu, Chen-Pang Lee, Chia-Heng Tu, Shih-Hao Hung
{"title":"Modeling Interprocessor Communication and Performance Scalability for Distributed Deep Learning Systems","authors":"Yi-Hong Lyu, C. Liu, Chen-Pang Lee, Chia-Heng Tu, Shih-Hao Hung","doi":"10.1109/HPCS48598.2019.9188168","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188168","url":null,"abstract":"While deep learning applications become popular, the design of deep learning systems is a critical task to unleash the computing power of underlying systems. Aside from the computing hardware, the computer networking is also a key factor that affects the delivered performance. When considering a large and complex model, the scalability of the system highly depends on the design of the networks, as well as the software behaviors. In this paper, we propose a profile-data-guided performance prediction method to estimate the performance of the system with desired high-speed interconnects, based on the profiling data obtained in a previous run. In particular, we leverage the open-source profiling tool, SOFA, for characterizing the software activities of deep learning software running in a computer cluster, and the characterized information is used to build the performance model for the model training process. When estimating the performance, SOFA is used to capture the performance critical factors for the model to make the predictions. To evaluate the proposed method, four popular deep learning models are adopted in our experiments, ResNet50, Inception3, Alexnet, and VGG16, where a computer cluster formed by four nodes is used to profile the training of the above models on TensorFlow. We ran the scalability analysis to analyze the size of the cluster, and the suitable computer networks for the models. By comparing the predicted data and those measured on the cluster, our model achieves up to 95% accuracy in most of the cases, with the maximum error rate of 10%.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131931787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"RA3D: Reputation-based Adaptive 3D Video Delivery in Heterogeneous Wireless Networks","authors":"Ting Bi, Longhao Zou, Shengyang Chen, Zehou Zhang, R. Trestian, Gabriel-Miro Muntean","doi":"10.1109/HPCS48598.2019.9188117","DOIUrl":"https://doi.org/10.1109/HPCS48598.2019.9188117","url":null,"abstract":"The continuous innovations in both high-end mobile devices and wireless communication technologies, fueled by the growing interest of mobile device users, have driven the latest development of mobile 3D video services. One of the critical challenges for mobile 3D video delivery is the limited bandwidth provided by the wireless communications between mobile devices. Nowadays, network operators are trying to cope with significant increase of data traffic amount and adopt diverse solutions to expand their network capacity. Among long-term solutions is network convergence, which involves close interworking of existing 2.5G/3G/4G networks with the new generation networks in terms of handover, network selection and network integration with other networks (e.g., WLAN, terrestrial microwave network, satellite network, etc.). In this context, the diversification in mobile devices and heterogeneity of the wireless environment make provision of always best connectivity of mobile users a challenge for the service providers. This paper proposes RA3D, a reputation-based adaptive 3D video delivery solution in heterogeneous wireless networks, which supports Always Best Experience to the mobile users by making use of multipath content delivery technologies.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132243524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}