{"title":"基于BNN的孤立词语音识别及其硬件实现","authors":"Xin Liu, Kefei Liu, Xiaoxin Cui, Yuan Wang","doi":"10.1109/CSTIC52283.2021.9461579","DOIUrl":null,"url":null,"abstract":"In this paper, a binary convolution neural network (BNN) is proposed to realize isolated word speech recognition task, which greatly reduces the model training parameters and training time. For isolated word data sets, the rectangular convolution kernel is designed to replace the traditional square convolution kernel, and batch normalization layer is integrated into the convolution layer to realize the lossless acceleration of the inference process. The binary convolution neural network is deployed on FPGA to realize the edge calculation.","PeriodicalId":186529,"journal":{"name":"2021 China Semiconductor Technology International Conference (CSTIC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Isolated Word Speech Recognition based on BNN and Its Hardware Implementation\",\"authors\":\"Xin Liu, Kefei Liu, Xiaoxin Cui, Yuan Wang\",\"doi\":\"10.1109/CSTIC52283.2021.9461579\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a binary convolution neural network (BNN) is proposed to realize isolated word speech recognition task, which greatly reduces the model training parameters and training time. For isolated word data sets, the rectangular convolution kernel is designed to replace the traditional square convolution kernel, and batch normalization layer is integrated into the convolution layer to realize the lossless acceleration of the inference process. The binary convolution neural network is deployed on FPGA to realize the edge calculation.\",\"PeriodicalId\":186529,\"journal\":{\"name\":\"2021 China Semiconductor Technology International Conference (CSTIC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 China Semiconductor Technology International Conference (CSTIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSTIC52283.2021.9461579\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 China Semiconductor Technology International Conference (CSTIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSTIC52283.2021.9461579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Isolated Word Speech Recognition based on BNN and Its Hardware Implementation
In this paper, a binary convolution neural network (BNN) is proposed to realize isolated word speech recognition task, which greatly reduces the model training parameters and training time. For isolated word data sets, the rectangular convolution kernel is designed to replace the traditional square convolution kernel, and batch normalization layer is integrated into the convolution layer to realize the lossless acceleration of the inference process. The binary convolution neural network is deployed on FPGA to realize the edge calculation.