{"title":"基于收缩阵列结构的轻量级卷积神经网络加速器的研究与设计","authors":"Yunping Zhao, Xiaowen Chen, Rui Xu, Jinhui Wei, Jianzhuang Lu, Bo Yuan","doi":"10.1145/3449301.3449342","DOIUrl":null,"url":null,"abstract":"With the wide application of convolutional neural networks(CNNs) in the field of artificial intelligence, more attention has been paid to the architecture design of CNNs accelerator. But even so, there is little research on hardware acceleration of light-weight CNNs, and there is a lack of systematic and in-depth exploration of lightweight CNNs accelerator design space. In this paper, we propose a design scheme for the lightweight CNNs accelerator based on the systolic array structure. Taking the MobileNet series, the typical representative of lightweight CNNs, as the test benchmark, we carry out detailed experiments and research analysis on the accelerator performance under different data-flow modes and different core computing array scales. Based on the systematic and comprehensive experiments, we provide powerful experimental data supporting and scientific guidance for the design space of the systolic array based lightweight CNNs accelerator and the trade-off of various indicators including operational efficiency, acceleration ratio, cycle time and so on, which makes up for the blank of current research in this field, and makes great convenience for subsequent designers to develop lightweight CNNs accelerators. Through our research, MobileNet V1 is speeded up nearly 1.2 times under certain conditions.","PeriodicalId":429684,"journal":{"name":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Research and Design of Lightweight Convolutional Neural Networks Accelerator Based on Systolic Array Structure\",\"authors\":\"Yunping Zhao, Xiaowen Chen, Rui Xu, Jinhui Wei, Jianzhuang Lu, Bo Yuan\",\"doi\":\"10.1145/3449301.3449342\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the wide application of convolutional neural networks(CNNs) in the field of artificial intelligence, more attention has been paid to the architecture design of CNNs accelerator. But even so, there is little research on hardware acceleration of light-weight CNNs, and there is a lack of systematic and in-depth exploration of lightweight CNNs accelerator design space. In this paper, we propose a design scheme for the lightweight CNNs accelerator based on the systolic array structure. Taking the MobileNet series, the typical representative of lightweight CNNs, as the test benchmark, we carry out detailed experiments and research analysis on the accelerator performance under different data-flow modes and different core computing array scales. Based on the systematic and comprehensive experiments, we provide powerful experimental data supporting and scientific guidance for the design space of the systolic array based lightweight CNNs accelerator and the trade-off of various indicators including operational efficiency, acceleration ratio, cycle time and so on, which makes up for the blank of current research in this field, and makes great convenience for subsequent designers to develop lightweight CNNs accelerators. Through our research, MobileNet V1 is speeded up nearly 1.2 times under certain conditions.\",\"PeriodicalId\":429684,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3449301.3449342\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3449301.3449342","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Research and Design of Lightweight Convolutional Neural Networks Accelerator Based on Systolic Array Structure
With the wide application of convolutional neural networks(CNNs) in the field of artificial intelligence, more attention has been paid to the architecture design of CNNs accelerator. But even so, there is little research on hardware acceleration of light-weight CNNs, and there is a lack of systematic and in-depth exploration of lightweight CNNs accelerator design space. In this paper, we propose a design scheme for the lightweight CNNs accelerator based on the systolic array structure. Taking the MobileNet series, the typical representative of lightweight CNNs, as the test benchmark, we carry out detailed experiments and research analysis on the accelerator performance under different data-flow modes and different core computing array scales. Based on the systematic and comprehensive experiments, we provide powerful experimental data supporting and scientific guidance for the design space of the systolic array based lightweight CNNs accelerator and the trade-off of various indicators including operational efficiency, acceleration ratio, cycle time and so on, which makes up for the blank of current research in this field, and makes great convenience for subsequent designers to develop lightweight CNNs accelerators. Through our research, MobileNet V1 is speeded up nearly 1.2 times under certain conditions.