{"title":"基于深度神经网络的声纹识别技术研究","authors":"Zinan Li, Y. Li, Wei Xiong, Min Chen, Y. Li","doi":"10.1145/3448748.3448812","DOIUrl":null,"url":null,"abstract":"In recent years, the technology of identity authentication based on biometrics is constantly improving and maturing. Due to the unique advantages of long-distance and multi equipment data acquisition, voiceprint recognition has been gradually commercialized in the past fifty years. However, the traditional voiceprint recognition method will reduce the model performance under the condition of large-scale data. In view of the above problem, this paper mainly studies text independent speaker verification system, using Mel Frequency Cepstrum Coefficient (MFCC) as speech feature parameter based on deep neural network (DNN). Compared with the traditional neural network, this method has the fast-learning ability of the network weight and high recognition rate. In this experiment, different speed speech is added to the training, which improves the recognition accuracy, and promotes the robustness of the model.","PeriodicalId":115821,"journal":{"name":"Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Research on Voiceprint Recognition Technology Based on Deep Neural Network\",\"authors\":\"Zinan Li, Y. Li, Wei Xiong, Min Chen, Y. Li\",\"doi\":\"10.1145/3448748.3448812\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the technology of identity authentication based on biometrics is constantly improving and maturing. Due to the unique advantages of long-distance and multi equipment data acquisition, voiceprint recognition has been gradually commercialized in the past fifty years. However, the traditional voiceprint recognition method will reduce the model performance under the condition of large-scale data. In view of the above problem, this paper mainly studies text independent speaker verification system, using Mel Frequency Cepstrum Coefficient (MFCC) as speech feature parameter based on deep neural network (DNN). Compared with the traditional neural network, this method has the fast-learning ability of the network weight and high recognition rate. In this experiment, different speed speech is added to the training, which improves the recognition accuracy, and promotes the robustness of the model.\",\"PeriodicalId\":115821,\"journal\":{\"name\":\"Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing\",\"volume\":\"93 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3448748.3448812\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3448748.3448812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Voiceprint Recognition Technology Based on Deep Neural Network
In recent years, the technology of identity authentication based on biometrics is constantly improving and maturing. Due to the unique advantages of long-distance and multi equipment data acquisition, voiceprint recognition has been gradually commercialized in the past fifty years. However, the traditional voiceprint recognition method will reduce the model performance under the condition of large-scale data. In view of the above problem, this paper mainly studies text independent speaker verification system, using Mel Frequency Cepstrum Coefficient (MFCC) as speech feature parameter based on deep neural network (DNN). Compared with the traditional neural network, this method has the fast-learning ability of the network weight and high recognition rate. In this experiment, different speed speech is added to the training, which improves the recognition accuracy, and promotes the robustness of the model.