{"title":"基于小波分析的鲁棒语音特征感知尺度图","authors":"Yao Kaisheng, C. Zhigang","doi":"10.1109/ICOSP.1998.770298","DOIUrl":null,"url":null,"abstract":"In real world applications, additive noise will contaminate input speech features for speech recognition and representation when speech recognition systems are working in real environments. There have been many attempts made to find a robust speech feature. In this paper, we propose a robust speech feature, the perceptive scalogram, for speech representation and recognition. The new feature is based on some propositions which state that a human's perception of speech is a perception of specific components of sounds, and the components have a specific changing rate of their short-time spectrum. The proposed perceptive scalogram also takes consideration of the fact that speech is non-stationary, and uses wavelets as its signal analysis tool. Simulation results show the robustness of the perceptive scalogram against additive Gaussian noise.","PeriodicalId":145700,"journal":{"name":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","volume":"6 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A robust speech feature-perceptive scalogram based on wavelet analysis\",\"authors\":\"Yao Kaisheng, C. Zhigang\",\"doi\":\"10.1109/ICOSP.1998.770298\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In real world applications, additive noise will contaminate input speech features for speech recognition and representation when speech recognition systems are working in real environments. There have been many attempts made to find a robust speech feature. In this paper, we propose a robust speech feature, the perceptive scalogram, for speech representation and recognition. The new feature is based on some propositions which state that a human's perception of speech is a perception of specific components of sounds, and the components have a specific changing rate of their short-time spectrum. The proposed perceptive scalogram also takes consideration of the fact that speech is non-stationary, and uses wavelets as its signal analysis tool. Simulation results show the robustness of the perceptive scalogram against additive Gaussian noise.\",\"PeriodicalId\":145700,\"journal\":{\"name\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"volume\":\"6 6\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.1998.770298\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.1998.770298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A robust speech feature-perceptive scalogram based on wavelet analysis
In real world applications, additive noise will contaminate input speech features for speech recognition and representation when speech recognition systems are working in real environments. There have been many attempts made to find a robust speech feature. In this paper, we propose a robust speech feature, the perceptive scalogram, for speech representation and recognition. The new feature is based on some propositions which state that a human's perception of speech is a perception of specific components of sounds, and the components have a specific changing rate of their short-time spectrum. The proposed perceptive scalogram also takes consideration of the fact that speech is non-stationary, and uses wavelets as its signal analysis tool. Simulation results show the robustness of the perceptive scalogram against additive Gaussian noise.