BenchCouncil Transactions on Benchmarks, Standards and Evaluations最新文献

筛选
英文 中文
Workflow Critical Path: A data-oriented critical path metric for Holistic HPC Workflows 工作流关键路径:面向数据的整体HPC工作流关键路径度量
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.1016/j.tbench.2021.100001
Daniel D. Nguyen, Karen L. Karavanic
{"title":"Workflow Critical Path: A data-oriented critical path metric for Holistic HPC Workflows","authors":"Daniel D. Nguyen,&nbsp;Karen L. Karavanic","doi":"10.1016/j.tbench.2021.100001","DOIUrl":"https://doi.org/10.1016/j.tbench.2021.100001","url":null,"abstract":"<div><p>Current trends in HPC, such as the push to exascale, convergence with Big Data, and growing complexity of HPC applications, have created gaps that traditional performance tools do not cover. One example is Holistic HPC Workflows — HPC workflows comprising multiple codes, paradigms, or platforms that are not developed using a workflow management system. To diagnose the performance of these applications, we define a new metric called Workflow Critical Path (WCP), a data-oriented metric for Holistic HPC Workflows. WCP constructs graphs that span across the workflow codes and platforms, using data states as vertices and data mutations as edges. Using cloud-based technologies, we implement a prototype called Crux, a distributed analysis tool for calculating and visualizing WCP. Our experiments with a workflow simulator on Amazon Web Services show Crux is scalable and capable of correctly calculating WCP for common Holistic HPC workflow patterns. We explore the use of WCP and discuss how Crux could be used in a production HPC environment.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"1 1","pages":"Article 100001"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485921000016/pdfft?md5=b03db3d209b242d8a2f663d25834fd8c&pid=1-s2.0-S2772485921000016-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92003797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latency-aware automatic CNN channel pruning with GPU runtime analysis 延迟感知自动CNN频道修剪与GPU运行时分析
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.1016/j.tbench.2021.100009
Jiaqiang Liu, Jingwei Sun, Zhongtian Xu, Guangzhong Sun
{"title":"Latency-aware automatic CNN channel pruning with GPU runtime analysis","authors":"Jiaqiang Liu,&nbsp;Jingwei Sun,&nbsp;Zhongtian Xu,&nbsp;Guangzhong Sun","doi":"10.1016/j.tbench.2021.100009","DOIUrl":"10.1016/j.tbench.2021.100009","url":null,"abstract":"<div><p>The huge storage and computation cost of convolutional neural networks (CNN) make them challenging to meet the real-time inference requirement in many applications. Existing channel pruning methods mainly focus on removing unimportant channels in a CNN model based on rule-of-thumb designs, using reduced floating-point operations (FLOPs) and parameter numbers to measure the pruning quality. The inference latency of pruned models is often overlooked. In this paper, we propose a latency-aware automatic CNN channel pruning method (LACP), which aims to search low latency and accurate pruned network structure automatically. We evaluate the inaccuracy of measuring pruning quality by FLOPs and the number of parameters, and use the model inference latency as the direct optimization metric. To bridge model pruning and inference acceleration, we analyze the inference latency of convolutional layers on GPU. Results show that the inference latency of convolutional layers exhibits a staircase pattern along with channel number due to the GPU tail effect. Based on that observation, we greatly shrink the search space of network structures. Then we apply an evolutionary procedure to search a computationally efficient pruned network structure, which reduces the inference latency and maintains the model accuracy. Experiments and comparisons with state-of-the-art methods on three image classification datasets show that our method can achieve better inference acceleration with less accuracy loss.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"1 1","pages":"Article 100009"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485921000090/pdfft?md5=e3e618453811eda67a8549ff2a96e500&pid=1-s2.0-S2772485921000090-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77841061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Benchmarking for Observability: The Case of Diagnosing Storage Failures 可观察性基准测试:以存储故障诊断为例
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.1016/j.tbench.2021.100006
Duo Zhang, Mai Zheng
{"title":"Benchmarking for Observability: The Case of Diagnosing Storage Failures","authors":"Duo Zhang,&nbsp;Mai Zheng","doi":"10.1016/j.tbench.2021.100006","DOIUrl":"10.1016/j.tbench.2021.100006","url":null,"abstract":"<div><p>Diagnosing storage system failures is challenging even for professionals. One recent example is the “When Solid State Drives Are Not That Solid” incident occurred at Algolia data center, where Samsung SSDs were mistakenly blamed for failures caused by a Linux kernel bug. With the system complexity keeps increasing, diagnosing failures will likely become more difficult.</p><p>To better understand real-world failures and the potential limitations of state-of-the-art tools, we first conduct an empirical study on 277 user-reported storage failures in this paper. We characterize the issues along multiple dimensions (e.g., time to resolve, kernel components involved), which provides a quantitative measurement of the challenge in practice. Moreover, we analyze a set of the storage issues in depth and derive a benchmark suite called <span><math><mrow><mi>B</mi><mi>u</mi><mi>g</mi><mi>B</mi><mi>e</mi><mi>n</mi><mi>c</mi><msup><mrow><mi>h</mi></mrow><mrow><mi>k</mi></mrow></msup></mrow></math></span>. The benchmark suite includes the necessary workloads and software environments to reproduce 9 storage failures, covers 4 different file systems and the block I/O layer of the storage stack, and enables realistic evaluation of diverse kernel-level tools for debugging.</p><p>To demonstrate the usage, we apply <span><math><mrow><mi>B</mi><mi>u</mi><mi>g</mi><mi>B</mi><mi>e</mi><mi>n</mi><mi>c</mi><msup><mrow><mi>h</mi></mrow><mrow><mi>k</mi></mrow></msup></mrow></math></span> to study two representative tools for debugging. We focus on measuring the observations that the tools enable developers to make (i.e., observability), and derive concrete metrics to measure the observability qualitatively and quantitatively. Our measurement demonstrates the different design tradeoffs in terms of debugging information and overhead. More importantly, we observe that both tools may behave abnormally when applied to diagnose a few tricky cases. Also, we find that neither tool can provide low-level information on how the persistent storage states are changed, which is essential for understanding storage failures. To address the limitation, we develop lightweight extensions to enable such functionality in both tools. We hope that <span><math><mrow><mi>B</mi><mi>u</mi><mi>g</mi><mi>B</mi><mi>e</mi><mi>n</mi><mi>c</mi><msup><mrow><mi>h</mi></mrow><mrow><mi>k</mi></mrow></msup></mrow></math></span> and the enabled measurements will inspire follow-up research in benchmarking and tool support and help address the challenge of failure diagnosis in general.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"1 1","pages":"Article 100006"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485921000065/pdfft?md5=ccacbd5e0872d394b75b85db5386c9b4&pid=1-s2.0-S2772485921000065-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75003267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Benchmarking feature selection methods with different prediction models on large-scale healthcare event data 对大规模医疗事件数据上不同预测模型的特征选择方法进行基准测试
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.1016/j.tbench.2021.100004
Fan Zhang , Chunjie Luo , Chuanxin Lan , Jianfeng Zhan
{"title":"Benchmarking feature selection methods with different prediction models on large-scale healthcare event data","authors":"Fan Zhang ,&nbsp;Chunjie Luo ,&nbsp;Chuanxin Lan ,&nbsp;Jianfeng Zhan","doi":"10.1016/j.tbench.2021.100004","DOIUrl":"10.1016/j.tbench.2021.100004","url":null,"abstract":"<div><p>With the development of the Electronic Health Record (EHR) technique, vast volumes of digital clinical data are generated. Based on the data, many methods are developed to improve the performance of clinical predictions. Among those methods, Deep Neural Networks (DNN) have been proven outstanding with respect to accuracy by employing many patient instances and events (features). However, each patient-specific event requires time and money. Collecting too many features before making a decision is insufferable, especially for time-critical tasks such as mortality prediction. So it is essential to predict with high accuracy using as minimal clinical events as possible, which makes feature selection a critical question. This paper presents detailed benchmarking results of various feature selection methods, applying different classification and regression algorithms for clinical prediction tasks, including mortality prediction, length of stay prediction, and ICD-9 code group prediction. We use the publicly available dataset, Medical Information Mart for Intensive Care III (MIMIC-III), in our experiments. Our results show that Genetic Algorithm (GA) based methods perform well with only a few features and outperform others. Besides, for the mortality prediction task, the feature subset selected by GA for one classifier can also be used to others while achieving good performance.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"1 1","pages":"Article 100004"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485921000041/pdfft?md5=99fde57d63abff83585ab50c968fd9b0&pid=1-s2.0-S2772485921000041-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81823992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Workflow Critical Path: A data-oriented critical path metric for Holistic HPC Workflows 工作流关键路径:面向数据的整体HPC工作流关键路径度量
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.15760/etd.7369
Daniel D. Nguyen, K. Karavanic
{"title":"Workflow Critical Path: A data-oriented critical path metric for Holistic HPC Workflows","authors":"Daniel D. Nguyen, K. Karavanic","doi":"10.15760/etd.7369","DOIUrl":"https://doi.org/10.15760/etd.7369","url":null,"abstract":"............................................................................................................................... i List of Tables ...................................................................................................................... iv List of Figures ..................................................................................................................... v Chapter 1: Introduction ....................................................................................................... 1 1.1 Motivation ................................................................................................................. 3 1.2 Definitions ................................................................................................................. 5 1.3 Thesis Statement ....................................................................................................... 6 1.4 Contributions ............................................................................................................. 6 Chapter 2: Background ........................................................................................................ 9 2.1 Parallel Computing .................................................................................................... 9 2.2 Critical Path Analysis ................................................................................................ 9 2.3 High Performance Computing ................................................................................ 11 2.4 Holistic HPC Workflows ........................................................................................ 12 2.5 Instrumentation, Profiling, and Tracing .................................................................. 12 Chapter 3: Related Work ................................................................................................... 14 3.1 Workflow Management Systems ............................................................................ 14 3.2 Distributed Systems Tracing Tools ......................................................................... 17 3.3 HPC Performance Measurement Tools ................................................................... 20 3.4 Performance Analysis of Scientific Workflows ...................................................... 21 Chapter 4: Architecture ..................................................................................................... 22 4.1 Data State ................................................................................................................ 22 4.2 Crux UI .................................................................................................................... 24 4.3 Crux API ................................................................................................................. 24 4.4 Crux Database ......................................................................................................... 27","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"80 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77251801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Revisiting the effects of the Spectre and Meltdown patches using the top-down microarchitectural method and purchasing power parity theory 使用自上而下的微架构方法和购买力平价理论重新审视Spectre和Meltdown补丁的影响
BenchCouncil Transactions on Benchmarks, Standards and Evaluations Pub Date : 2021-10-01 DOI: 10.1016/j.tbench.2021.100011
Yectli A. Huerta , David J. Lilja
{"title":"Revisiting the effects of the Spectre and Meltdown patches using the top-down microarchitectural method and purchasing power parity theory","authors":"Yectli A. Huerta ,&nbsp;David J. Lilja","doi":"10.1016/j.tbench.2021.100011","DOIUrl":"10.1016/j.tbench.2021.100011","url":null,"abstract":"<div><p>Software patches are made available to fix security vulnerabilities, enhance performance, and usability. Previous works focused on measuring the performance effect of patches on benchmark runtimes. In this study, we used the Top-Down microarchitecture analysis method to understand how pipeline bottlenecks were affected by the application of the Spectre and Meltdown security patches. Bottleneck analysis makes it possible to better understand how different hardware resources are being utilized, highlighting portions of the pipeline where possible improvements could be achieved. We complement the Top-Down analysis technique with the use a normalization technique from the field of economics, purchasing power parity (PPP), to better understand the relative difference between patched and unpatched runs. In this study, we showed that security patches had an effect that was reflected on the corresponding Top-Down metrics. We showed that recent compilers are not as negatively affected as previously reported. Out of the 14 benchmarks that make up the SPEC OMP2012 suite, three had noticeable slowdowns when the patches were applied. We also found that Top-Down metrics had large relative differences when the security patches were applied, differences that standard techniques based in absolute, non-normalized, metrics failed to highlight.</p></div>","PeriodicalId":100155,"journal":{"name":"BenchCouncil Transactions on Benchmarks, Standards and Evaluations","volume":"1 1","pages":"Article 100011"},"PeriodicalIF":0.0,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772485921000119/pdfft?md5=6e1139ebac69c084c6fe482b82fdd42c&pid=1-s2.0-S2772485921000119-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91428938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信