Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems最新文献

OpenMP to CUDA graphs: a compiler-based transformation to enhance the programmability of NVIDIA devices OpenMP到CUDA图形:一个基于编译器的转换，以增强NVIDIA设备的可编程性

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391881

Chen Yu, Sara Royuela, E. Quiñones

引用次数: 8

Programming tensor cores from an image processing DSL 从图像处理DSL编程张量核

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391880

Savvas Sioutas, S. Stuijk, T. Basten, L. Somers, H. Corporaal

引用次数: 5

Configuring loosely time-triggered wireless control software 配置松散的时间触发无线控制软件

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391888

Philipp H. Kindt, Sumana Ghosh, S. Chakraborty

{"title":"Configuring loosely time-triggered wireless control software","authors":"Philipp H. Kindt, Sumana Ghosh, S. Chakraborty","doi":"10.1145/3378678.3391888","DOIUrl":"https://doi.org/10.1145/3378678.3391888","url":null,"abstract":"In many wireless control networks, sensor data and controller data are exchanged periodically, which requires periodic packet transmissions between the physical plant and the controller. As an alternative, event-triggered control paradigms imply that data is only exchanged when there are significant changes in the state of the plant, e.g., because of disturbances. This is the nature of many IoT scenarios and requires that a receiving device has to listen to the channel for incoming packets during all times. However, especially in mobile networks, in which all devices are battery-powered, continuous scanning would drain the battery quickly and hence, reception needs to be duty-cycled. When optimizing such duty-cycled operation, significant energy savings are possible using intelligent software-enabled communication scheduling. In this paper, we propose a wireless transmission scheme that supports loosely time-triggered control. When optimizing the scheduling of transmissions and reception windows in the communication protocol, our proposed scheme allows for energy-efficient communication without requiring strict clock-synchronization between the devices. We show that such a scheme is practical and can greatly reduce the energy consumption in event-triggered control applications.","PeriodicalId":383191,"journal":{"name":"Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116864095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

On the implementation and execution of adaptive streaming applications modeled as MADF 以MADF为模型的自适应流应用程序的实现和执行

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391876

Sobhan Niknam, Peng Wang, T. Stefanov

引用次数: 0

Cross-layer approaches for improving the dependability of deep learning systems 提高深度学习系统可靠性的跨层方法

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391884

Muhammad Abdullah Hanif, L. Hoang, M. Shafique

引用次数: 1

Scheduling of moldable fork-join tasks with inter- and intra-task communications 具有任务间和任务内通信的可塑分叉连接任务的调度

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391875

Hiroki Nishikawa, Kaname Shimada, Ittetsu Taniguchi, H. Tomiyama

引用次数: 1

A secure hardware-software solution based on RISC-V, logic locking and microkernel 基于RISC-V、逻辑锁定和微内核的安全软硬件解决方案

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391886

Dominik Sisejkovic, Farhad Merchant, Lennart M. Reimann, R. Leupers, M. Giacometti, Sascha Kegreiss

引用次数: 12

Reviewing inference performance of state-of-the-art deep learning frameworks 回顾最先进的深度学习框架的推理性能

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391882

Berk Ulker, S. Stuijk, H. Corporaal, R. Wijnhoven

引用次数: 13

Real-time audio processing for hearing aids using a model-based bayesian inference framework 基于模型的贝叶斯推理框架的助听器实时音频处理

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3397528

M. Roa-Villescas, B. Vries, S. Stuijk, H. Corporaal

引用次数: 3

Exploration of GPU sharing policies under GEMM workloads GEMM工作负载下GPU共享策略的探索

Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems Pub Date : 2020-05-25 DOI: 10.1145/3378678.3391887

Ioannis Oroutzoglou, Dimosthenis Masouros, Konstantina Koliogeorgi, S. Xydis, D. Soudris

{"title":"Exploration of GPU sharing policies under GEMM workloads","authors":"Ioannis Oroutzoglou, Dimosthenis Masouros, Konstantina Koliogeorgi, S. Xydis, D. Soudris","doi":"10.1145/3378678.3391887","DOIUrl":"https://doi.org/10.1145/3378678.3391887","url":null,"abstract":"Lately, cloud computing has seen explosive growth, due to the flexibility and scalability it offers. The ever-increasing computational demands, especially from the machine learning domain, have forced cloud operators to enhance their infrastructure with acceleration devices, such as General-Purpose (GP)GPUs or FPGAs. Even though multi-tenancy has been widely examined for conventional CPUs, this is not the case for accelerators. Current solutions support \"one accelerator per user\" schemes, which can lead to both under-utilization and starvation of available resources. In this work, we analyze the potentials of GPU sharing inside data-center environments. We investigate how several architectural features affect the performance of GPUs under different multi-tenant stressing scenarios. We compare CUDA MPS with the native, default CUDA scheduler and also with Vinetalk, a research framework providing GPU sharing capabilities. Experimental results show that NVIDIA's MPS achieves the best performance in multi-application scenarios, specifically up to X4.5 and X11.2 compared to native CUDA scheduler and Vinetalk respectively.","PeriodicalId":383191,"journal":{"name":"Proceedings of the 23th International Workshop on Software and Compilers for Embedded Systems","volume":"604 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131427943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1