{"title":"基于DCT的非线性预测编码在语音识别系统中的特征提取","authors":"M. Azar, F. Razzazi","doi":"10.1109/CIMSA.2008.4595825","DOIUrl":null,"url":null,"abstract":"Speech representation strategies play a key role in automatic speech recognition systems. In this study, a nonlinear procedure has been proposed to overcome the complexities of speech sequence representations. The proposed method may be considered as an extension of nonlinear predictive coding representation procedure in cosine transform domain. The best results belong to classification of nonlinear behaved stop phonemes (i.e. /b/, /d/, /g/) in TIMIT database which show good performance while reducing the computational complexity in comparison to standard NPC.","PeriodicalId":302812,"journal":{"name":"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"A DCT based nonlinear predictive coding for feature extraction in speech recognition systems\",\"authors\":\"M. Azar, F. Razzazi\",\"doi\":\"10.1109/CIMSA.2008.4595825\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech representation strategies play a key role in automatic speech recognition systems. In this study, a nonlinear procedure has been proposed to overcome the complexities of speech sequence representations. The proposed method may be considered as an extension of nonlinear predictive coding representation procedure in cosine transform domain. The best results belong to classification of nonlinear behaved stop phonemes (i.e. /b/, /d/, /g/) in TIMIT database which show good performance while reducing the computational complexity in comparison to standard NPC.\",\"PeriodicalId\":302812,\"journal\":{\"name\":\"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIMSA.2008.4595825\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIMSA.2008.4595825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A DCT based nonlinear predictive coding for feature extraction in speech recognition systems
Speech representation strategies play a key role in automatic speech recognition systems. In this study, a nonlinear procedure has been proposed to overcome the complexities of speech sequence representations. The proposed method may be considered as an extension of nonlinear predictive coding representation procedure in cosine transform domain. The best results belong to classification of nonlinear behaved stop phonemes (i.e. /b/, /d/, /g/) in TIMIT database which show good performance while reducing the computational complexity in comparison to standard NPC.