2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)最新文献_第6页

Design of Reward Functions for RL-based High-Speed Autonomous Driving 基于rl的高速自动驾驶奖励函数设计

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00015

Tanaka Kohsuke, Yuta Shintomi, Y. Okuyama, Taro Suzuki

{"title":"Design of Reward Functions for RL-based High-Speed Autonomous Driving","authors":"Tanaka Kohsuke, Yuta Shintomi, Y. Okuyama, Taro Suzuki","doi":"10.1109/MCSoC57363.2022.00015","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00015","url":null,"abstract":"We aim to design a reward function for autonomous driving by reinforcement learning for achieving high-speed driving while maintaining training stability for reaching the racetrack's goal. High-speed driving is aggressive, such as running on the road's edge as fast as possible at corners. Thus, creating reinforcement learning agents that drive at high speeds and can reach a goal is difficult in racing competition situations because of running off the road or collisions with other objects. In general, human drivers see the road ahead and make control decisions. Therefore, we design a reward function to consider the road ahead depending on the driving speed. Through experiments in a simulator, we compared our proposed reward function with others proposed in previous works in terms of driving speed and the training stability about reaching the goal. As a result of the experiment, our proposed reward function achieves an improvement of lap time by 0.71 seconds (3 %) with only a 4.4 % loss in stability in reaching a goal compared to the most stable reward function proposed in previous work.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127383373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient Hardware Architecture for Posit Addition/Subtraction 正数加减法的高效硬件架构

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00068

Susheel Ujwal Siddamshetty, Srinivas Boppu, D. Ghosh

{"title":"Efficient Hardware Architecture for Posit Addition/Subtraction","authors":"Susheel Ujwal Siddamshetty, Srinivas Boppu, D. Ghosh","doi":"10.1109/MCSoC57363.2022.00068","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00068","url":null,"abstract":"This paper proposes an efficient architecture for the design of adder/subtractor for the recently developed universal posit number system. Posits are designed as a direct drop-in replacement for IEEE-754 standard floating-point numbers. They provide compelling advantages over floats, such as larger dynamic range, higher accuracy than the same bit width floats, bit-wise identical results across systems, no overflow or underflow, tapered accuracy, and simpler exception handling. The word size $(N)$ and exponent size $(ES)$ define a posit format. It includes a variable exponent, consisting of variable length regime-bits and exponent-bits with a maximum size of up to $ES$ bits. This also leads to a change in the size and position of the mantissa bits. These run-time variations in the length of the regime, exponent, and mantissa fields pose a challenge while designing arithmetic hardware units. Though a few adder/subtractors are proposed in the literature, they are not 100% accurate. However, the proposed design is efficient in performance metrics such as area, delay, and leakage power. Furthermore, our design is 100% accurate, on an average 15 % area, and 23 % leakage power efficient while having a similar critical path delay when compared to the recent designs proposed in the literature when synthesized using Cadence's 45 nm standard cell library.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130323948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Lightweight End-to-end Network for Wearing Mask Recognition on Low-resolution Images 基于低分辨率图像的面罩识别轻量端到端网络

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00016

Menglei Li, Hongbo Chen, Zixue Cheng

{"title":"A Lightweight End-to-end Network for Wearing Mask Recognition on Low-resolution Images","authors":"Menglei Li, Hongbo Chen, Zixue Cheng","doi":"10.1109/MCSoC57363.2022.00016","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00016","url":null,"abstract":"In realistic scenarios, resolution is still one of the major problems in wearing mask recognition. Due to the large distances between surveillance cameras and human faces, facial images captured by low-power devices usually have low resolution and lead to poor recognition results. To address the above issue, we propose a lightweight end-to-end network to reconstruct Super-resolution (SR) images and achieve wearing mask recognition. Besides, to apply to challenging real applications, we combine hardware devices and software technology to simulate the recognition process of wearing masks in real scenarios. To demonstrate the effectiveness of the method, we comprehensively evaluate our proposed method by comparing it with state-of-the-art methods. The recognition accuracy using super-resolution is 98.44%, outperforming RepVGG-A2 (97.00%) and ResNet34 (93.75%). Moreover, experimental results show that the number of parameters and FLOPs in our recognition model is 9.34 million and 1.85 billion, respectively, both of which outperform traditional CNN methods (20 million+ parameters and 3 billion+ FLOPs). The performance of our recognition system is competitive with state-of-the-art methods in terms of low memory usage and computational complexity, showing that the system can be cost-effectively and widely applied in real-world environments and thus has potential applications in respiratory disease prevention.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133140041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Systolic Array Based Convolutional Neural Network Inference on FPGA 基于收缩阵列的卷积神经网络推理FPGA

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00029

Shi Hui Chua, T. Teo, Mulat Ayinet Tiruye, I-Chyn Wey

{"title":"Systolic Array Based Convolutional Neural Network Inference on FPGA","authors":"Shi Hui Chua, T. Teo, Mulat Ayinet Tiruye, I-Chyn Wey","doi":"10.1109/MCSoC57363.2022.00029","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00029","url":null,"abstract":"Convolutional Neural Networks (CNNs) possess a particular edge over its predecessor, the Multi-Layer Perceptron (MLP). This is due to its weight sharing features that allows the CNN to use less parameters for the same number of outputs as compared to the MLP. Systolic arrays capitalize on the weight sharing property of CNNs to do data reuse while performing convolutional operations, in order to reduce the power consumption from the memory accesses. A kernel fitting systolic processing element array was designed with only positive multiplication to increase the throughput and power efficiency of the CNN accelerator, while using weight stationary dataflow to achieve data reuse in the systolic array. A cost-optimized lightweight solution is implemented through low-cost FPGA hardware so as to allow for greater accessibility. The CNN accelerator consumes 0.363 W power at 100 MHz operating frequency. A peak throughput of 10.98 GOps/s was achieved with peak performance density of 0.200 GOps/s/DSP and peak power efficiency of 30.26 GOps/s/W. Even with the added support for additional functions, proposed design achieved up to 1.59x better power efficiency compared to other systolic implementations and up to 6.17x better power efficiency compared to non-systolic implementations.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131252527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

FPGA-Based Prototype of a Quantum Annealing Simulator for Sparse Ising Model 基于fpga的稀疏Ising模型量子退火模拟器原型

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00039

H. M. Waidyasooriya, Yuta Ohma, M. Hariyama

{"title":"FPGA-Based Prototype of a Quantum Annealing Simulator for Sparse Ising Model","authors":"H. M. Waidyasooriya, Yuta Ohma, M. Hariyama","doi":"10.1109/MCSoC57363.2022.00039","DOIUrl":"https://doi.org/10.1109/MCSoC57363.2022.00039","url":null,"abstract":"Quantum annealing (QA) is a probabilistic approx-imation method to find the global optimum of a combinatorial optimization problem. QA is done on quantum annealers such as D-wave using quantum properties. Since the number of qubits in quantum annealers is limited, it is difficult to use those to solve large-scale real-world problems. Therefore, quantum annealing simulation on digital computers is necessary. In this paper, we discuss an FPGA based quantum annealing simulator for sparse Ising model. Unlike a fully-connected Ising model, the number of connections among spins in sparse model is limited. Highly sparse Ising models require significantly low amount of computations while allowing more parallel operations. One the other hand, sparsity and the connections among spins are not the same for different Ising models, and it is difficult to propose one specific accelerator architecture for all. We propose a method to automatically generate an application specific accelerator archi-tecture for a given sparse Ising model. The proposed accelerator fully exploits the parallelism to increase the processing speed. We design an FPGA prototype of the proposed accelerator and confirmed the correct behavior. In future, we expect to extend the proposed method to execute large quantum annealing simulations using multiple FPGAs.","PeriodicalId":150801,"journal":{"name":"2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116122286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fake Image Detection Using An Ensemble of CNN Models Specialized For Individual Face Parts 使用针对单个面部部分的CNN模型集合进行假图像检测

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00021

Akihisa Kawabe, Ryuto Haga, Yoichi Tomioka, Y. Okuyama, Jungpil Shin

引用次数: 2

Hardware Implementation of an Automatic Color Equalization Algorithm for Real-time Image Enhancement 用于实时图像增强的自动色彩均衡算法的硬件实现

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00036

Xiang-Yu Chen, Yu-Hsiang Wang, Yao-Song Zhang, Yen-Jui Chen, Shiann-Rong Kuang

引用次数: 0

Evaluation of Different Microarchitectures for Energy-Efficient RISC-V Cores 节能RISC-V内核的不同微架构评估

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00022

J. Kadomoto, H. Irie, S. Sakai

引用次数: 0

Radar and Camera Fusion for Object Forecasting in Driving Scenarios 基于雷达与相机融合的驾驶场景目标预测

2022 IEEE 15th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC) Pub Date : 2022-12-01 DOI: 10.1109/MCSoC57363.2022.00026

Albert Budi Christian, Yu-Hsuan Wu, Chih-Yu Lin, Lan-Da Van, Y. Tseng

引用次数: 0