{"title":"A CNN Hardware Accelerator Designed for YOLO Algorithm Based on RISC-V SoC","authors":"Xinyu Qin, Xudong Liu, Jun Han","doi":"10.1109/ASICON52560.2021.9620500","DOIUrl":null,"url":null,"abstract":"YOLO (You Only Look Once) has been widely used in the field of object detection because of its extremely fast real-time calculation speed and good migration ability. In recent years, the design of artificial intelligence systems with high real-time and low energy consumption has become a research hotspot. In this paper, we propose a CNN hardware accelerator specifically designed for YOLOv3-Tiny to increase the calculation parallelism while reducing the frequency of memory access. The design is configured and controlled by T-Head C910, a state-of-art open source multi-core processor based on RISC-V architecture. Experimental results show that the design can provide effective throughput improvement for small embedded systems with limited resources.","PeriodicalId":233584,"journal":{"name":"2021 IEEE 14th International Conference on ASIC (ASICON)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 14th International Conference on ASIC (ASICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASICON52560.2021.9620500","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
YOLO (You Only Look Once) has been widely used in the field of object detection because of its extremely fast real-time calculation speed and good migration ability. In recent years, the design of artificial intelligence systems with high real-time and low energy consumption has become a research hotspot. In this paper, we propose a CNN hardware accelerator specifically designed for YOLOv3-Tiny to increase the calculation parallelism while reducing the frequency of memory access. The design is configured and controlled by T-Head C910, a state-of-art open source multi-core processor based on RISC-V architecture. Experimental results show that the design can provide effective throughput improvement for small embedded systems with limited resources.