{"title":"基于RISC-V SoC的YOLO算法CNN硬件加速器设计","authors":"Xinyu Qin, Xudong Liu, Jun Han","doi":"10.1109/ASICON52560.2021.9620500","DOIUrl":null,"url":null,"abstract":"YOLO (You Only Look Once) has been widely used in the field of object detection because of its extremely fast real-time calculation speed and good migration ability. In recent years, the design of artificial intelligence systems with high real-time and low energy consumption has become a research hotspot. In this paper, we propose a CNN hardware accelerator specifically designed for YOLOv3-Tiny to increase the calculation parallelism while reducing the frequency of memory access. The design is configured and controlled by T-Head C910, a state-of-art open source multi-core processor based on RISC-V architecture. Experimental results show that the design can provide effective throughput improvement for small embedded systems with limited resources.","PeriodicalId":233584,"journal":{"name":"2021 IEEE 14th International Conference on ASIC (ASICON)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A CNN Hardware Accelerator Designed for YOLO Algorithm Based on RISC-V SoC\",\"authors\":\"Xinyu Qin, Xudong Liu, Jun Han\",\"doi\":\"10.1109/ASICON52560.2021.9620500\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"YOLO (You Only Look Once) has been widely used in the field of object detection because of its extremely fast real-time calculation speed and good migration ability. In recent years, the design of artificial intelligence systems with high real-time and low energy consumption has become a research hotspot. In this paper, we propose a CNN hardware accelerator specifically designed for YOLOv3-Tiny to increase the calculation parallelism while reducing the frequency of memory access. The design is configured and controlled by T-Head C910, a state-of-art open source multi-core processor based on RISC-V architecture. Experimental results show that the design can provide effective throughput improvement for small embedded systems with limited resources.\",\"PeriodicalId\":233584,\"journal\":{\"name\":\"2021 IEEE 14th International Conference on ASIC (ASICON)\",\"volume\":\"73 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 14th International Conference on ASIC (ASICON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ASICON52560.2021.9620500\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 14th International Conference on ASIC (ASICON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASICON52560.2021.9620500","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
YOLO (You Only Look Once)算法以其极快的实时计算速度和良好的迁移能力在目标检测领域得到了广泛的应用。近年来,设计高实时性、低能耗的人工智能系统已成为研究热点。在本文中,我们提出了一个专门为YOLOv3-Tiny设计的CNN硬件加速器,以提高计算并行性,同时降低内存访问频率。该设计由T-Head C910配置和控制,T-Head C910是一款基于RISC-V架构的先进开源多核处理器。实验结果表明,该设计可以有效地提高资源有限的小型嵌入式系统的吞吐量。
A CNN Hardware Accelerator Designed for YOLO Algorithm Based on RISC-V SoC
YOLO (You Only Look Once) has been widely used in the field of object detection because of its extremely fast real-time calculation speed and good migration ability. In recent years, the design of artificial intelligence systems with high real-time and low energy consumption has become a research hotspot. In this paper, we propose a CNN hardware accelerator specifically designed for YOLOv3-Tiny to increase the calculation parallelism while reducing the frequency of memory access. The design is configured and controlled by T-Head C910, a state-of-art open source multi-core processor based on RISC-V architecture. Experimental results show that the design can provide effective throughput improvement for small embedded systems with limited resources.