{"title":"葡萄牙语连续语音识别的特征集","authors":"S. Dos Santos, A. Alcaim","doi":"10.1109/ITS.1998.713103","DOIUrl":null,"url":null,"abstract":"We evaluate the performance of different feature sets in continuous speech recognition systems for the Portuguese language. Results were obtained for the task of recognizing sequences of digits spoken in a fluent manner. We have investigated five parametric descriptions of speech, selected among the most-used ones in present continuous speech recognition systems. We show that the feature set providing the best results for the Portuguese language comprises 18 parameters, 15 derived from the PLP-cepstrum and 3 from the energy. In the speaker-independent mode, a word accuracy of 99.5% was obtained. The performance of a Mel-cepstrum-based set with 39 parameters was 99.3% word-accurate.","PeriodicalId":205350,"journal":{"name":"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature sets in continuous speech recognition for the Portuguese language\",\"authors\":\"S. Dos Santos, A. Alcaim\",\"doi\":\"10.1109/ITS.1998.713103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We evaluate the performance of different feature sets in continuous speech recognition systems for the Portuguese language. Results were obtained for the task of recognizing sequences of digits spoken in a fluent manner. We have investigated five parametric descriptions of speech, selected among the most-used ones in present continuous speech recognition systems. We show that the feature set providing the best results for the Portuguese language comprises 18 parameters, 15 derived from the PLP-cepstrum and 3 from the energy. In the speaker-independent mode, a word accuracy of 99.5% was obtained. The performance of a Mel-cepstrum-based set with 39 parameters was 99.3% word-accurate.\",\"PeriodicalId\":205350,\"journal\":{\"name\":\"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITS.1998.713103\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITS.1998.713103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature sets in continuous speech recognition for the Portuguese language
We evaluate the performance of different feature sets in continuous speech recognition systems for the Portuguese language. Results were obtained for the task of recognizing sequences of digits spoken in a fluent manner. We have investigated five parametric descriptions of speech, selected among the most-used ones in present continuous speech recognition systems. We show that the feature set providing the best results for the Portuguese language comprises 18 parameters, 15 derived from the PLP-cepstrum and 3 from the energy. In the speaker-independent mode, a word accuracy of 99.5% was obtained. The performance of a Mel-cepstrum-based set with 39 parameters was 99.3% word-accurate.