2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)最新文献

Message from the Program Committee Chairs 来自计划委员会主席的信息

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2022-04-20 DOI: 10.1109/coolchips52128.2021.9410315

{"title":"Message from the Program Committee Chairs","authors":"","doi":"10.1109/coolchips52128.2021.9410315","DOIUrl":"https://doi.org/10.1109/coolchips52128.2021.9410315","url":null,"abstract":"On behalf of the SCAM 2003 conference and program committees, we would like to welcome you to this year’s workshop. This is the third Source Code Analysis and Manipulation workshop. While it required a great deal of effort by a large number of people to put together this year’s workshop, this work only serves to underscore the greater effort put forth by Mark Harman in making the first and hence later SCAM workshops a reality. Thank you, Mark. All the committee members have worked hard to ensure that SCAM is a useful and enjoyable occasion. However, there are two members who have worked tirelessly to ensure that this occasion is also affordable! Leon Moonen who has managed to obtain external funding from The Netherlands Organisation for Scientific Research (http://www.nwo.nl) and The Royal Netherlands Academy of Arts and Sciences (http://www.knaw.nl), and Dave Binkley for doing the financing and much, much more. Thanks, Lads! The aim of the SCAM workshop is to bring together researchers and practitioners working on theory, techniques and applications which concern analysis and/or manipulation of the source code of computer systems. It is the source code that contains the only precise description of the behavior of the system. Many conferences and workshops address the applications of source code analysis and manipulation. The aim of SCAM is to focus on the algorithms and tools themselves; what they can achieve; and how they can be improved, refined, and combined. This year we received 43 regular paper submissions for the workshop and were able to select from these 21 excellent papers which cover the broad range of activity in Source Code Analysis and Manipulation. All papers were fully reviewed by three referees for relevance, soundness, and originality. Each paper was assigned a rating ranging from A (excellent) to D (poor). Those receiving at least two accepts (A or B rating) appear herein and were included as part of the program. For the accepted papers, 42% received an A rating, 50% a B rating, only 5% a C rating, and a residual 3% received a D rating. Overall this indicates a strong technical program. We would also like to thank our keynote speaker, Chris Verhoef, for his contribution. We would like to take this opportunity to thank the SCAM Program Committee for their hard work and expertise in reviewing the papers. In addition to thanking the authors, reviewers, and Steering Committee for their work in bringing about the third SCAM workshop, we would also like to thank Hans van Vliet, and all those responsible for putting together ICSM 2003, Leon Moonen for his work on local arrangements, Mark Harman and Jianjun Zhao for publicizing the workshop, and Silvio Stefanucci for helping to manage the review process. Thanks are also due to Stacy A. Wagner and Maggie Johnson from the IEEE, and Stephanie Kawada and Thomas Baldwin from the IEEE publications. And last, but not least to Claire Knight for designing the SCAM logo. We hope that you find t","PeriodicalId":103337,"journal":{"name":"2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124883130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LSFQ: A Low Precision Full Integer Quantization for High-Performance FPGA-Based CNN Acceleration LSFQ:一种用于高性能fpga CNN加速的低精度全整数量化方法

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410327

Zhenshan Bao, Kang Zhan, Wenbo Zhang, Junnan Guo

{"title":"LSFQ: A Low Precision Full Integer Quantization for High-Performance FPGA-Based CNN Acceleration","authors":"Zhenshan Bao, Kang Zhan, Wenbo Zhang, Junnan Guo","doi":"10.1109/COOLCHIPS52128.2021.9410327","DOIUrl":"https://doi.org/10.1109/COOLCHIPS52128.2021.9410327","url":null,"abstract":"Neural network quantization has become an important research area. Deep networks run with low precision operations at inference time offer power and space advantages over high precision alternatives, and can maintain high accuracy. However, few quantization can demonstrate this advantage on hardware platform, because the design of quantization algorithm lacks the consideration of actual hardware implementation. In this paper, we propose an efficient quantization method for hardware implementation, a learnable parameter soft clipping fully integer quantization (LSFQ), which includes weight quantization and activation quantization with learnable clipping parameter method. The quantization parameters are optimized automatically by back propagation to minimize the loss, then the BatchNorm layer and convolutional layer are fused, and the bias and quantization step size are further quantized. In this way, LSFQ accomplishes integer-only-arithmetic. We evaluate the quantization algorithm on a variety of models including VGG7, mobile-net v2 in CIFAR10 and CIFAR100. The results show that when the quantization reaches 3-bit or 4-bit, the accuracy loss of our method is less than 1 % compared with the full-precision network. In addition, we design an accelerator for the quantization algorithm and deploy it to the FPGA platform to verify the hardware-awareness of our method.","PeriodicalId":103337,"journal":{"name":"2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127062792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Hybrid Network of Packet Switching and STDM in a Multi-FPGA System 分组交换和STDM在多fpga系统中的混合网络

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410322

Tomoki Shimizu, Kohe Ito, Kensuke Iizuka, Kazuei Hironaka, H. Amano

引用次数: 1

Nonvolatile SRAM Using Fishbone-in-Cage Capacitor in a 180 nm Standard CMOS Process for Zero-Standby and Instant-Powerup Embedded Memory on IoT 采用鱼骨笼形电容器的180 nm标准CMOS工艺的非易失性SRAM，用于物联网零待机和瞬时上电嵌入式存储器

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410314

Takaki Urabe, H. Ochi, Kazutoshi Kobayashi

引用次数: 0

Power/Performance/Area Evaluations for Next-Generation HPC Processors using the A64FX Chip 采用A64FX芯片的下一代高性能计算处理器的功耗/性能/面积评估

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410320

Eishi Arima, Yuetsu Kodama, Tetsuya Odajima, Miwako Tsuji, M. Sato

{"title":"Power/Performance/Area Evaluations for Next-Generation HPC Processors using the A64FX Chip","authors":"Eishi Arima, Yuetsu Kodama, Tetsuya Odajima, Miwako Tsuji, M. Sato","doi":"10.1109/COOLCHIPS52128.2021.9410320","DOIUrl":"https://doi.org/10.1109/COOLCHIPS52128.2021.9410320","url":null,"abstract":"Future HPC systems, including post-exascale supercomputers, will face severe problems such as the slowing-down of Moore's law and the limitation of power supply. To achieve desired system performance improvement while counteracting these issues, the hardware design optimization is a key factor. In this paper, we investigate the future directions of SIMD-based processor architectures by using the A64FX chip and a customized version of power/performance/area simulators, i.e., Gem5 and McPAT. More specifically, based on the A64FX chip, we firstly customize various energy parameters in the simulators, and then evaluate the power and area reductions by scaling the technology node down to 3nm. Moreover, we investigate also the achievable FLOPS improvement at 3nm by scaling the number of cores, SIMD width, and FP pipeline width under power/area constraints. The evaluation result indicates that no further SIMD/pipeline width scaling will help with improving FLOPS due to the memory system bottleneck, especially on L1 data caches and FP register files. Based on the observation, we discuss the future directions of SIMD-based HPC processors.","PeriodicalId":103337,"journal":{"name":"2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131673275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

A Metadata Prefetching Mechanism for Hybrid Memory Architectures 混合内存体系结构的元数据预取机制

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410321

S. Tsukada, Hikaru Takayashiki, Masayuki Sato, K. Komatsu, Hiroaki Kobayashi

引用次数: 1

An Energy-Efficient Deep Neural Network Training Processor with Bit-Slice-Level Reconfigurability and Sparsity Exploitation 具有位片级可重构性和稀疏性的高效深度神经网络训练处理器

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410324

Donghyeon Han, Dongseok Im, Gwangtae Park, Youngwoo Kim, Seokchan Song, Juhyoung Lee, H. Yoo

引用次数: 0

In Search of the Performance- and Energy-Efficient CNN Accelerators 寻找性能和节能的CNN加速器

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410350

S. Sedukhin, Yoichi Tomioka, Kohei Yamamoto

引用次数: 1

High Performance Multicore SHA-256 Accelerator using Fully Parallel Computation and Local Memory 使用完全并行计算和本地内存的高性能多核SHA-256加速器

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410349

Van Dai Phan, H. Pham, T. Tran, Y. Nakashima

引用次数: 3

Training Low-Latency Spiking Neural Network through Knowledge Distillation 基于知识蒸馏的低延迟脉冲神经网络训练

2021 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS) Pub Date : 2021-04-14 DOI: 10.1109/COOLCHIPS52128.2021.9410323

Sugahara Takuya, Renyuan Zhang, Y. Nakashima

引用次数: 16