Many-Core Computing: Hardware and Software最新文献

Operating systems for many-core systems 多核系统的操作系统

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch3

Hendrik Borghorst, O. Spinczyk

引用次数: 1

Decoupling the programming model from resource management in throughput processors 将编程模型与吞吐量处理器中的资源管理分离

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch4

Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, S. Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, O. Mutlu

{"title":"Decoupling the programming model from resource management in throughput processors","authors":"Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, S. Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, O. Mutlu","doi":"10.1049/pbpc022e_ch4","DOIUrl":"https://doi.org/10.1049/pbpc022e_ch4","url":null,"abstract":"This chapter introduces a new resource virtualization framework, Zorua, that decouples the graphics processing unit (GPU) programming model from the management of key on-chip resources in hardware to enhance programming ease, portability, and performance. The application resource specification-a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block-forms a critical component of the existing GPU programming models. This specification determines the parallelism, and, hence, performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely based on this specification. This tight coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance, as we demonstrate in this chapter using real data obtained on state-of-the-art GPU systems. Our goal in this work is to reduce the dependence of performance on the software-provided static resource specification to simultaneously alleviate the above challenges. To this end, we introduce Zorua, a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer. The virtualization provided by Zorua builds on two key concepts-dynamic allocation of the on-chip resources and their oversubscription using a swap space in memory. Zorua provides a holistic GPU resource virtualization strategy designed to (i) adaptively control the extent of oversubscription and (ii) coordinate the dynamic management of multiple on-chip resources to maximize the effectiveness of virtualization.We demonstrate that by providing the illusion of more resources than physically available via controlled and coordinated virtualization, Zorua offers several important benefits: (i) Programming ease. It eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability. It alleviates the necessity of retuning an application's resource usage when porting the application across GPU generations. (iii) Performance. By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources. The holistic virtualization provided by Zorua has many other potential uses, e.g., fine-grained resource sharing among multiple kernels, low latency preemption of GPU programs, and support for dynamic parallelism, which we describe in this chapter.","PeriodicalId":254920,"journal":{"name":"Many-Core Computing: Hardware and Software","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127764605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

From power-efficient to power-driven computing 从节能到功率驱动计算

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch11

R. Shafik, A. Yakovlev

{"title":"From power-efficient to power-driven computing","authors":"R. Shafik, A. Yakovlev","doi":"10.1049/pbpc022e_ch11","DOIUrl":"https://doi.org/10.1049/pbpc022e_ch11","url":null,"abstract":"The dramatic spread of computing, at the scale of trillions of ubiquitous devices, is delivering on the pervasive penetration into the real world in the form of Internet of Things (IoT). Today, the widely used power-efficient paradigms directly related to the behaviour of computing systems are those of real-time (working to deadlines imposed from the real world) and low-power (prolonging battery life or reducing heat dissipation and electricity bills). None of these addresses the strict requirements on power supply, allocation and utilisation that are imposed by the needs of new devices and applications in the computing swarm - many of which are expected to be confronted with challenges of autonomy and battery-free long life. Indeed, we need to design and build systems for survival, operating under a wide range of power constraints; we need a new power-driven paradigm called real-power computing (RPC). The article provides an overview of this emerging paradigm with definition, taxonomies and a case study, together with a summary of the existing research. Towards the end, the overview leads to research and development challenges and opportunities surfacing this paradigm. Throughout the article, we have used the power and energy terms as follows. From the supply side, the energy term will be used to refer to harvesters with built-in storage, while the power term will indicate instantaneous energy dispensation. For the computing logic side, the energy term will define the total power consumed over a given time interval.","PeriodicalId":254920,"journal":{"name":"Many-Core Computing: Hardware and Software","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117006621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

HPC with many core processors 具有多核心处理器的HPC

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch1

X. Martorell, Jorge Bellón, Víctor López, Vicencc Beltran, Sergi Mateo, Xavier Teruel, E. Ayguadé, Jesús Labarta

引用次数: 0

Cognitive I/O for 3D-integrated many-core system 3d集成多核系统的认知I/O

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch19

Hao Yu, Sai Manoj Pudukotai Dinakarrao, Hantao Huang

引用次数: 0

Runtime thermal management of many-core systems 多核系统的运行时热管理

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch9

Anup Das, Akash Kumar

引用次数: 0

Self-testing of multicore processors 多核处理器的自测

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/PBPC022E_CH15

M. Skitsas, Marco Restifo, M. Michael, Nicopoulos Chrysostomos, P. Bernardi, Sanchez Ernesto

引用次数: 0

From irregular heterogeneous software to reconfigurable hardware 从不规则的异构软件到可重构的硬件

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch2

John Wickerson, G. Constantinides

引用次数: 0

Developing portable embedded software for multicore systems through formal abstraction and refinement 通过形式化抽象和细化，开发可移植的多核系统嵌入式软件

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/PBPC022E_CH14

Asieh Salehi Fathabadi, Mohammadsadegh Dalvandi, M. Butler

{"title":"Developing portable embedded software for multicore systems through formal abstraction and refinement","authors":"Asieh Salehi Fathabadi, Mohammadsadegh Dalvandi, M. Butler","doi":"10.1049/PBPC022E_CH14","DOIUrl":"https://doi.org/10.1049/PBPC022E_CH14","url":null,"abstract":"Run-time management (RTM) systems are used in embedded systems to dynamically adapt hardware performance to minimise energy consumption. An RTM system implementation is coupled with the hardware platform specifications and is implemented individually for each specific platform. A significant challenge is that RTM software can require laborious manual adjustment across different hardware platforms due to the diversity of architecture characteristics. Hardware specifications vary from one platform to another and include a number of characteristic such as the number of supported voltage and frequency (VF) settings. Formal modelling offers the potential to simplify the management of platform diversity by shifting the focus away from handwritten platform-specific code to platform-independent models from which platform-specific implementations are automatically generated. The article presents an overview of the motivations for this work. It goes on to overview the RTM architecture and requirements and introduce the Event-B formal method and its tool support. The article then describes the Event-B model of two different RTMs and presents the portability support provided by formal modelling and code generation. Finalyy, it reviews the verification and experimental results.","PeriodicalId":254920,"journal":{"name":"Many-Core Computing: Hardware and Software","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117315290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Modelling many-core architectures 建模多核心架构

Many-Core Computing: Hardware and Software Pub Date : 2019-06-03 DOI: 10.1049/pbpc022e_ch12

Guihai Yan, Jiajun Li, L. Xiaowei

引用次数: 0