2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)最新文献_第4页

A Three-Dimensional Networks-on-Chip Architecture with Dynamic Buffer Sharing 具有动态缓冲区共享的三维片上网络体系结构

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.124

Seyyed Hossein Seyyedaghaei Rezaei, M. Modarressi, M. Daneshtalab, Shervin Roshanisefat

引用次数: 3

Energy-Aware Programming Model for Distributed Infrastructures 分布式基础设施的能量感知规划模型

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.39

F. Lordan, J. Ejarque, R. Sirvent, Rosa M. Badia

引用次数: 8

Randomizing Packet Memory Networks for Low-Latency Processor-Memory Communication 低延迟处理器-存储器通信的随机分组存储器网络

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.18

Daichi Fujiki, Hiroki Matsutani, M. Koibuchi, H. Amano

{"title":"Randomizing Packet Memory Networks for Low-Latency Processor-Memory Communication","authors":"Daichi Fujiki, Hiroki Matsutani, M. Koibuchi, H. Amano","doi":"10.1109/PDP.2016.18","DOIUrl":"https://doi.org/10.1109/PDP.2016.18","url":null,"abstract":"Three-dimensional stacked memory is considered to be one of the innovative elements for the next-generation computing system, for it provides high bandwidth and energy efficiency. Particularly, packet routing ability of Hybrid Memory Cubes (HMCs) enables new interconnects for the memories, giving flexibility to its topological design space. Since memory-processor communication is latency-sensitive, our challenge is to alleviate latency of the memory interconnection network, which is subject to high overheads from hop-count increase. Interestingly, random network topologies are known to have remarkably low diameter that is even comparable to theoretical Moore graph. In this context, we first propose to exploit the random topologies for the memory networks. Second, we also propose several optimizations to leverage the random topologies to be further adaptive to the latency-sensitive memory-processor communication: communication path length based selection, deterministic minimal routing, and page-size granularity memory mapping. Finally, we present interesting results of our evaluation: the random networks with universal memory access outperformed non-random networks of which memory access was optimally localized.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134085223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs gpu快速片上存储器的定量性能评价

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.56

E. Konstantinidis, Y. Cotronis

{"title":"A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs","authors":"E. Konstantinidis, Y. Cotronis","doi":"10.1109/PDP.2016.56","DOIUrl":"https://doi.org/10.1109/PDP.2016.56","url":null,"abstract":"Modern Graphics Processing Units (GPUs) have evolved to high performance general purpose processors, forming an alternative to CPUs. However, programming them effectively has proven to be a challenge, not only due to the mandatory requirement of extracting massive fine grained parallelism but also due to its susceptible performance on memory traffic. Apart from regular memory caches, GPUs feature other types of fast memories as well, for instance scratchpads, texture caches, etc. In order to gain more insight to the efficient usage of these memory types some quantitative performance measures could be beneficial. In this paper we describe a set of micro-benchmarks which aim to provide effective bandwidth performance measurements of the on-chip special memories of GPUs. We compare the peak measurements of different memory types and the use of different data type sizes. In addition, we validate the peak measurements on real world problems as provided by the polybench-gpu benchmark suite. We compare the profiling bandwidth of on-chip memories with the peak measurements as captured with the proposed micro-benchmarks. The source code of the micro-benchmark suite is publicly available.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125639609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Lessons Learned from Spatial and Temporal Correlation of Node Failures in High Performance Computers 高性能计算机节点故障时空相关性研究

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.101

Siavash Ghiasvand, F. Ciorba, R. Tschüter, W. Nagel

引用次数: 13

Cloud-Based NoSQL Data Migration 基于云的NoSQL数据迁移

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.111

Aryan Bansel, H. González-Vélez, Adriana E. Chis

{"title":"Cloud-Based NoSQL Data Migration","authors":"Aryan Bansel, H. González-Vélez, Adriana E. Chis","doi":"10.1109/PDP.2016.111","DOIUrl":"https://doi.org/10.1109/PDP.2016.111","url":null,"abstract":"Cloud computing has enabled the Database-as-a-Service (DBaaS) model to manage large volumes of user-generated data using NoSQL data repositories. There are several NoSQL implementations such as document, columnar, and key-value which ensure high availability, fault tolerance and scalability to serve distinct client requirements. Nonetheless, different NoSQL data models may also introduce unnecessary heterogeneity in DBaaS, which further restricts the user to migrate the application services according to business or technology changes. In this paper, we propose a NoSQL data migration framework to foster data portability across cloud-based heterogeneous NoSQL data repositories. The proposed approach involves data standardisation and classification stages to render an efficient mapping, and translation between cloud-based different NoSQL data stores. The current implementation of the framework supports three different data models: document, columnar and graph. Moreover, the framework is meta-model driven, and therefore allows developers to extend the support for new database models. Our approach includes an online compression algorithm for data migration (document to graph) whereby a graph database requires up to 46% less space. There is also a significant reduction (37% to 55%) in the number of nodes in the compressed graph database.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"261 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115283766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 21

Suitability of the Random Topology for HPC Applications 随机拓扑在高性能计算应用中的适用性

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.10

Fabien Chaix, I. Fujiwara, M. Koibuchi

引用次数: 18

A Machine Learning Approach for the Integration of miRNA-Target Predictions 一种集成mirna目标预测的机器学习方法

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.125

S. Beretta, M. Castelli, Yuliana Martínez, Luis Muñoz Delgado, Sara Silva, L. Trujillo, L. Milanesi, I. Merelli

引用次数: 4

The Efficient In-band Management for Interconnect Network in Tianhe-2 System 天河二号系统互联网络的高效带内管理

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.58

Jijun Cao, Liquan Xiao, Zhengbin Pang, Kefei Wang, Jiaqing Xu

{"title":"The Efficient In-band Management for Interconnect Network in Tianhe-2 System","authors":"Jijun Cao, Liquan Xiao, Zhengbin Pang, Kefei Wang, Jiaqing Xu","doi":"10.1109/PDP.2016.58","DOIUrl":"https://doi.org/10.1109/PDP.2016.58","url":null,"abstract":"Interconnect network plays an important role in high performance computing systems. And its manageability directly affects the RAS (i.e., Reliability, Availability, and Serviceability) of the whole system. The Tianhe-2 system located in NSCC-gz (i.e., National Supercomputing Center of China in Guangzhou) uses proprietary interconnect network, which includes 5,856 high-radix network router chips (i.e., NRC) and 18,304 network interface chips (i.e., NIC). For such a very large-scale interconnect network, it is a great challenge to manage (such as configure, monitor, and debug) the numerous network chips and its network ports in an efficient way. By implementing the in-band management with very few hardware resources, the interconnect network in Tianhe-2 system achieves a highly efficient network management. In this paper, we introduce the design and implementation of the in-band management for interconnect network in Tianhe-2 system, especially emphasizing on several key features, including the set of achieved management functionalities, the architecture of network management, the format of management packets, the data flow and processing of management packets, etc. In this paper, we also evaluate the performance of in-band management by mainly comparing with out-band management scheme. The preliminary results demonstrate the efficiency of the in-band management for interconnect network in Tianhe-2 system.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130621202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Transient Temperature Prediction for Aging Thermal Sensors Using Artificial Neural Network 基于人工神经网络的老化热传感器瞬态温度预测

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) Pub Date : 2016-04-04 DOI: 10.1109/PDP.2016.89

Kameswar Rao Vaddina, J. M. Cebrian, L. Natvig

{"title":"Transient Temperature Prediction for Aging Thermal Sensors Using Artificial Neural Network","authors":"Kameswar Rao Vaddina, J. M. Cebrian, L. Natvig","doi":"10.1109/PDP.2016.89","DOIUrl":"https://doi.org/10.1109/PDP.2016.89","url":null,"abstract":"As technology scales down and power density increases, the temperature sensor characteristics will drift, leading to temperature errors which increase over time. Transistor aging is one of the leading contributors to temperature sensing inaccuracies. The prominent aging failure mechanisms like Negative Bias Temperature Instability (NBTI), Hot Carrier Injection (HCI) and electromigration have emerged as the main sources of system unreliability which manifest as an increase in the propagation delay over time. On-chip thermal sensors are not immune to this phenomenon and get affected by these aging mechanisms. Thermal sensor aging exacerbated by increased temperatures leads to temperature sensing inaccuracies requiring repeated sensor calibration. In this work, we propose a novel approach of using performance metrics to predict the transient temperature profile of an application as seen by the aging thermal sensor. Firstly, we make offline profiling of applications and then cluster them into groups using k-means clustering mechanism. Then we use a neural network model to predict the thermal profile of a new application given its performance metrics. The forecasting ability of our model is accessed using MSE and RMSE. This approach is highly scalable and can be used to predict future temperatures which can then be used for run-time dynamic thermal management of multi-core systems.","PeriodicalId":192273,"journal":{"name":"2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)","volume":"348 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132697432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3