Di Pan, Ying Wei, Shili Liang, Tingfa Xu, Shuangwei Wang
{"title":"利用傅立叶变换对宽带频谱图进行两次变换的特定双词汉语词汇语音识别","authors":"Di Pan, Ying Wei, Shili Liang, Tingfa Xu, Shuangwei Wang","doi":"10.1109/EIIS.2017.8298546","DOIUrl":null,"url":null,"abstract":"This paper illustrates a method to recognize the speech of specific two-word Chinese vocabulary by analyzing speech signals using a broad-band spectrogram after Fourier transform is applied to it twice. First, we analyze the broad-band spectrogram in the frequency domain and its corresponding voice characteristics in detail after applying Fourier transform twice. Then, binary width zoning column projection is carried out in the broad-band spectrogram frequency domain. The projection value is treated as the characteristic value of speech recognition feature and the support vector machine (SVM) is considered as the classifier for recognizing the speech of specific two-word Chinese vocabulary. A total of 1000 voice samples were used in the simulation. The results using this method show a remarkable recognition rate of 93.4%. The proposed method provides a new way for vocabulary recognition.","PeriodicalId":434246,"journal":{"name":"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speech recognition of specific two-word Chinese vocabulary by applying Fourier transform twice to the broad-band spectrogram\",\"authors\":\"Di Pan, Ying Wei, Shili Liang, Tingfa Xu, Shuangwei Wang\",\"doi\":\"10.1109/EIIS.2017.8298546\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper illustrates a method to recognize the speech of specific two-word Chinese vocabulary by analyzing speech signals using a broad-band spectrogram after Fourier transform is applied to it twice. First, we analyze the broad-band spectrogram in the frequency domain and its corresponding voice characteristics in detail after applying Fourier transform twice. Then, binary width zoning column projection is carried out in the broad-band spectrogram frequency domain. The projection value is treated as the characteristic value of speech recognition feature and the support vector machine (SVM) is considered as the classifier for recognizing the speech of specific two-word Chinese vocabulary. A total of 1000 voice samples were used in the simulation. The results using this method show a remarkable recognition rate of 93.4%. The proposed method provides a new way for vocabulary recognition.\",\"PeriodicalId\":434246,\"journal\":{\"name\":\"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/EIIS.2017.8298546\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 First International Conference on Electronics Instrumentation & Information Systems (EIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EIIS.2017.8298546","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech recognition of specific two-word Chinese vocabulary by applying Fourier transform twice to the broad-band spectrogram
This paper illustrates a method to recognize the speech of specific two-word Chinese vocabulary by analyzing speech signals using a broad-band spectrogram after Fourier transform is applied to it twice. First, we analyze the broad-band spectrogram in the frequency domain and its corresponding voice characteristics in detail after applying Fourier transform twice. Then, binary width zoning column projection is carried out in the broad-band spectrogram frequency domain. The projection value is treated as the characteristic value of speech recognition feature and the support vector machine (SVM) is considered as the classifier for recognizing the speech of specific two-word Chinese vocabulary. A total of 1000 voice samples were used in the simulation. The results using this method show a remarkable recognition rate of 93.4%. The proposed method provides a new way for vocabulary recognition.