International Conference on Hardware/Software Codesign and System Synthesis最新文献

Furion: alleviating overheads for deep learning framework on single machine (work-in-progress) Furion:减轻单机上深度学习框架的开销(正在开发中)

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2018-09-30 DOI: 10.5555/3283568.3283582

L. Jin, Chao Wang, Lei Gong, Chongchong Xu, Yahui Hu, Luchao Tan, Xuehai Zhou

{"title":"Furion: alleviating overheads for deep learning framework on single machine (work-in-progress)","authors":"L. Jin, Chao Wang, Lei Gong, Chongchong Xu, Yahui Hu, Luchao Tan, Xuehai Zhou","doi":"10.5555/3283568.3283582","DOIUrl":"https://doi.org/10.5555/3283568.3283582","url":null,"abstract":"Deep learning has been successful at solving many kinds of tasks. Hardware accelerators with high performance and parallelism have become mainstream to implement deep neural networks. In order to increase hardware utilization, multiple applications will share the same compute resource. However, different applications may use different deep learning frameworks and occupy different amounts of resources. If there are no scheduling platforms that are compatible with different frameworks, resources competition will result in longer response time, run out of memory, and other errors. When the resources of the system cannot satisfy all the applications at the same time, application switching overhead will be excessive without reasonable resource management strategy.In this paper, we propose Furion - a middleware alleviates overheads for deep learning framework on a single machine. Furion schedules tasks, overlaps the execution of different computing resource, and batches unknown inputs to increase the hardware accelerator utilization. It dynamically manages memory usage for each application to alleviate the overhead of application switching and make a complex model enable implement in a low-end GPU. Our experiment proved that Furion achieves 2.2x-2.7x speedup on the GTX1060.","PeriodicalId":300268,"journal":{"name":"International Conference on Hardware/Software Codesign and System Synthesis","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122561609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A chip-level security framework for assessing sensor data integrity: work-in-progress 用于评估传感器数据完整性的芯片级安全框架:正在进行的工作

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2018-09-30 DOI: 10.5555/3283568.3283588

Taimour Wehbe, V. Mooney, D. Keezer

引用次数: 0

Dynamic data management for automotive ECUs with hybrid RAM-NVM memory: work-in-progress 带有混合RAM-NVM内存的汽车ecu动态数据管理:正在进行的工作

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2018-09-30 DOI: 10.5555/3283568.3283573

Jinyu Zhan, Junhuan Yang, Wei Jiang, Yixin Li

引用次数: 0

Dynamically utilizing computation accelerators for extensible processors in a software approach 在软件方法中动态利用可扩展处理器的计算加速器

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629443

Yashuai Lü, Li Shen, Zhiying Wang, Nong Xiao

引用次数: 5

Native MPSoC co-simulation environment for software performance estimation 用于软件性能估计的原生MPSoC联合仿真环境

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629490

P. Gerin, M. M. Hamayun, F. Pétrot

{"title":"Native MPSoC co-simulation environment for software performance estimation","authors":"P. Gerin, M. M. Hamayun, F. Pétrot","doi":"10.1145/1629435.1629490","DOIUrl":"https://doi.org/10.1145/1629435.1629490","url":null,"abstract":"Performance estimation of Multi-Processor System-On-Chip (MPSoC) at a high abstraction level is required in order to perform early architecture exploration and accurate design validations. Although abstract executable models provide interesting functional validation capabilities, they quickly become unsuitable when timing becomes an issue - Native software simulation, a good candidate from the speed point of view, suffers from this issue.\u0000 In this paper, we present a transactional level simulation environment that allows reliable performance estimation with a specific focus on software timing estimation on multi processor architectures. The embedded software is compiled natively on the host running the simulation and instrumented to reflect its execution on a specific target processor and then executed on a simulation model of the underlying hardware.\u0000 The key contribution of this work is the use of both static and dynamic analysis, that allow realistic timing measurements in native software simulation. Experimental results show the efficiency of the proposed method to accurately estimate software performance in co-simulation environments.","PeriodicalId":300268,"journal":{"name":"International Conference on Hardware/Software Codesign and System Synthesis","volume":"79 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124283347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Automated technique for design of NoC with minimal communication latency 最小通信延迟NoC设计的自动化技术

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629499

G. Leary, Karam S. Chatha

引用次数: 12

Exploiting data-redundancy in reliability-aware networked embedded system design 利用数据冗余在可靠性感知网络嵌入式系统设计中的应用

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629468

M. Lukasiewycz, M. Glaß, J. Teich

引用次数: 8

A monitoring and adaptive routing mechanism for QoS traffic on mesh NoC architectures 一种基于网状NoC架构的QoS流量监控和自适应路由机制

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629451

Leonel Tedesco, F. Clermidy, F. Moraes

{"title":"A monitoring and adaptive routing mechanism for QoS traffic on mesh NoC architectures","authors":"Leonel Tedesco, F. Clermidy, F. Moraes","doi":"10.1145/1629435.1629451","DOIUrl":"https://doi.org/10.1145/1629435.1629451","url":null,"abstract":"The development of MPSoCs targeting embedded systems with a dynamic workload of applications constitutes an important challenge. The growing number of applications running on these systems produces a considerable utilization of resources, implying a high demand of computation and communication in the different MPSoC parts. The heterogeneity of processing elements brings to the application traffic a dynamic and unpredictable nature, due to the variability on data injection rates. NoCs are the communication infrastructure to be used in such systems, due to its performance, reliability and scalability. Different strategies may be employed to deal with traffic congestion, such as adaptive routing, buffer sizing, and even task migration. The goal of this work is to investigate the use of adaptive routing algorithms, where the path between source and target PEs may be modified due to congestion events. The major part of the state of art proposals have a limited view of NoCs, since each NoC router takes decisions based on few neighbors' congestion status. Such local decision may lead packets to other congested regions, therefore being inefficient. This paper presents a new method, where congestion analysis considers information of all routers in the source-target path. This method relies on a protocol for QoS session establishment, followed by distributed monitoring and re-route to non-congested regions. The set of experiments present results concerning performance and amount of time spent by packets on routers when the proposed method is applied.","PeriodicalId":300268,"journal":{"name":"International Conference on Hardware/Software Codesign and System Synthesis","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126571310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

A compositional modelling framework for exploring MPSoC systems 一个用于探索MPSoC系统的组成建模框架

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629437

Anders Sejer Tranberg-Hansen, J. Madsen

引用次数: 4

Improving application launch times with hybrid disks 使用混合磁盘改进应用程序启动时间

International Conference on Hardware/Software Codesign and System Synthesis Pub Date : 2009-10-11 DOI: 10.1145/1629435.1629486

Yongsoo Joo, Youngjin Cho, Kyungsoo Lee, N. Chang

引用次数: 13