{"title":"基于沙漏网络的CNN/FPGA协同探索方法","authors":"Zhong Qiu, Jianli Li, Dongbao Liang, Tao Su","doi":"10.1109/ICET51757.2021.9450895","DOIUrl":null,"url":null,"abstract":"Field Programming Gate Array (FPGA) has become a popular choice for neural network inference for its great potential of parallel computing and abundant on-chip buffer resource. Both software and hardware researchers are trying to explore the design space to obtain the best network model and the best accelerator design. Different from the existing sequential exploration method, we propose a CNN/FPGA co-exploration method to search for the best network model and hardware design simultaneously. We applied our method to Hourglass Network, a widely used model for pose estimation, and the top 5 experimental results show that our method can achieve an average of 22.44% faster in computation, 16.48% faster in memory access, and 17.81% faster in total, with a minor accuracy loss of 0.75%. The whole search and training process only cost 75 GPU hours.","PeriodicalId":316980,"journal":{"name":"2021 IEEE 4th International Conference on Electronics Technology (ICET)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A CNN/FPGA Co-Exploration Method Based on Hourglass Network\",\"authors\":\"Zhong Qiu, Jianli Li, Dongbao Liang, Tao Su\",\"doi\":\"10.1109/ICET51757.2021.9450895\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Field Programming Gate Array (FPGA) has become a popular choice for neural network inference for its great potential of parallel computing and abundant on-chip buffer resource. Both software and hardware researchers are trying to explore the design space to obtain the best network model and the best accelerator design. Different from the existing sequential exploration method, we propose a CNN/FPGA co-exploration method to search for the best network model and hardware design simultaneously. We applied our method to Hourglass Network, a widely used model for pose estimation, and the top 5 experimental results show that our method can achieve an average of 22.44% faster in computation, 16.48% faster in memory access, and 17.81% faster in total, with a minor accuracy loss of 0.75%. The whole search and training process only cost 75 GPU hours.\",\"PeriodicalId\":316980,\"journal\":{\"name\":\"2021 IEEE 4th International Conference on Electronics Technology (ICET)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 4th International Conference on Electronics Technology (ICET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICET51757.2021.9450895\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Electronics Technology (ICET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICET51757.2021.9450895","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A CNN/FPGA Co-Exploration Method Based on Hourglass Network
Field Programming Gate Array (FPGA) has become a popular choice for neural network inference for its great potential of parallel computing and abundant on-chip buffer resource. Both software and hardware researchers are trying to explore the design space to obtain the best network model and the best accelerator design. Different from the existing sequential exploration method, we propose a CNN/FPGA co-exploration method to search for the best network model and hardware design simultaneously. We applied our method to Hourglass Network, a widely used model for pose estimation, and the top 5 experimental results show that our method can achieve an average of 22.44% faster in computation, 16.48% faster in memory access, and 17.81% faster in total, with a minor accuracy loss of 0.75%. The whole search and training process only cost 75 GPU hours.