{"title":"基于均值移位帧的HMM/SVM混合分类器独立说话人孤立词语音识别","authors":"K. Rahbar, A. Broumandnia","doi":"10.1109/IRANIANCEE.2010.5507082","DOIUrl":null,"url":null,"abstract":"This paper studies an independent-speaker isolated word speech recognition based on mean-shift framing using hybrid HMM/SVM classifier. The proposed framework includes two main units: preprocessing unit, and classification unit. The first unit tries to segment the speech signal into proper frames using the benefits of mean-shift gradient clustering algorithm and extract time-frequency relevant features in a way that maximize relative entropy of time-frequency energy distribution among segments. Then the second unit classifies words into the proper classes. To fulfill this intention, self-adaptive HMM calculates word's likelihood of each existed class and finally support vector machine (SVM) classifies it by using all classes' likelihood as an input vector. To validate method's accuracy and stability, the method verified within TULIPS1 dataset in the present of different kind of additive noises provided by SPIB. Comparing the results with the outcomes of the previous paper shows 3.2% improvement.","PeriodicalId":282587,"journal":{"name":"2010 18th Iranian Conference on Electrical Engineering","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Independent-speaker isolated word speech recognition based on mean-shift framing using hybrid HMM/SVM classifier\",\"authors\":\"K. Rahbar, A. Broumandnia\",\"doi\":\"10.1109/IRANIANCEE.2010.5507082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies an independent-speaker isolated word speech recognition based on mean-shift framing using hybrid HMM/SVM classifier. The proposed framework includes two main units: preprocessing unit, and classification unit. The first unit tries to segment the speech signal into proper frames using the benefits of mean-shift gradient clustering algorithm and extract time-frequency relevant features in a way that maximize relative entropy of time-frequency energy distribution among segments. Then the second unit classifies words into the proper classes. To fulfill this intention, self-adaptive HMM calculates word's likelihood of each existed class and finally support vector machine (SVM) classifies it by using all classes' likelihood as an input vector. To validate method's accuracy and stability, the method verified within TULIPS1 dataset in the present of different kind of additive noises provided by SPIB. Comparing the results with the outcomes of the previous paper shows 3.2% improvement.\",\"PeriodicalId\":282587,\"journal\":{\"name\":\"2010 18th Iranian Conference on Electrical Engineering\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-05-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 18th Iranian Conference on Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRANIANCEE.2010.5507082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 18th Iranian Conference on Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRANIANCEE.2010.5507082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Independent-speaker isolated word speech recognition based on mean-shift framing using hybrid HMM/SVM classifier
This paper studies an independent-speaker isolated word speech recognition based on mean-shift framing using hybrid HMM/SVM classifier. The proposed framework includes two main units: preprocessing unit, and classification unit. The first unit tries to segment the speech signal into proper frames using the benefits of mean-shift gradient clustering algorithm and extract time-frequency relevant features in a way that maximize relative entropy of time-frequency energy distribution among segments. Then the second unit classifies words into the proper classes. To fulfill this intention, self-adaptive HMM calculates word's likelihood of each existed class and finally support vector machine (SVM) classifies it by using all classes' likelihood as an input vector. To validate method's accuracy and stability, the method verified within TULIPS1 dataset in the present of different kind of additive noises provided by SPIB. Comparing the results with the outcomes of the previous paper shows 3.2% improvement.