{"title":"无线电通信中基于自适应ResNet的说话人识别","authors":"Liu Jiahong, Bao Jie, Chen Yingshuang, Lv Chun","doi":"10.1109/ICESIT53460.2021.9696720","DOIUrl":null,"url":null,"abstract":"In this paper, a speaker recognition strategy in military radio communication is applied. In military operations, the most commonly used method of information transmission is radio communication. Speaker recognition technology can confirm the sender's identity, and effectively prevent the enemy from pretending to be our military commander to issue false orders. However, the datasets of the military commander from the radio are confidential, and there are no large open-source datasets. Consequently, speaker recognition accuracy is not ideal if we only train a small sample of speaker datasets. Therefore, we propose a transfer learning method for training. We pre-train a Deep Residual neural network (ResNet) with large sample datasets and re-train a novel adaptive model with a simple sample dataset in radio communication. Experiments are carried out using the aishell-2 dataset and the self-collected radio military command datasets. Experimental results demonstrate that the adaptive network with transfer learning method improves the performance by 23.55% relatively compared to the baseline system in radio communication.","PeriodicalId":164745,"journal":{"name":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Adaptive ResNet Based Speaker Recognition in Radio Communication\",\"authors\":\"Liu Jiahong, Bao Jie, Chen Yingshuang, Lv Chun\",\"doi\":\"10.1109/ICESIT53460.2021.9696720\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a speaker recognition strategy in military radio communication is applied. In military operations, the most commonly used method of information transmission is radio communication. Speaker recognition technology can confirm the sender's identity, and effectively prevent the enemy from pretending to be our military commander to issue false orders. However, the datasets of the military commander from the radio are confidential, and there are no large open-source datasets. Consequently, speaker recognition accuracy is not ideal if we only train a small sample of speaker datasets. Therefore, we propose a transfer learning method for training. We pre-train a Deep Residual neural network (ResNet) with large sample datasets and re-train a novel adaptive model with a simple sample dataset in radio communication. Experiments are carried out using the aishell-2 dataset and the self-collected radio military command datasets. Experimental results demonstrate that the adaptive network with transfer learning method improves the performance by 23.55% relatively compared to the baseline system in radio communication.\",\"PeriodicalId\":164745,\"journal\":{\"name\":\"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICESIT53460.2021.9696720\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Emergency Science and Information Technology (ICESIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICESIT53460.2021.9696720","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Adaptive ResNet Based Speaker Recognition in Radio Communication
In this paper, a speaker recognition strategy in military radio communication is applied. In military operations, the most commonly used method of information transmission is radio communication. Speaker recognition technology can confirm the sender's identity, and effectively prevent the enemy from pretending to be our military commander to issue false orders. However, the datasets of the military commander from the radio are confidential, and there are no large open-source datasets. Consequently, speaker recognition accuracy is not ideal if we only train a small sample of speaker datasets. Therefore, we propose a transfer learning method for training. We pre-train a Deep Residual neural network (ResNet) with large sample datasets and re-train a novel adaptive model with a simple sample dataset in radio communication. Experiments are carried out using the aishell-2 dataset and the self-collected radio military command datasets. Experimental results demonstrate that the adaptive network with transfer learning method improves the performance by 23.55% relatively compared to the baseline system in radio communication.