{"title":"Hybrid recognizers combining hidden Markov models and multilayer perceptron","authors":"J.A. Martins, F. Violaro","doi":"10.1109/ITS.1998.713107","DOIUrl":null,"url":null,"abstract":"This paper describes some approaches for hybrid recognizers combining hidden Markov models (HMM) and multilayer perceptrons (MLP). One of these approaches employs MLP as a post-processor for HMM while the other uses HMM to segment the speech signal for MLP. The performance of hybrid recognizers is compared with discrete HMM and multilayer perceptrons. All of the implemented recognizers were speaker-independent and a 50-word vocabulary spoken in Brazilian Portuguese was employed in their evaluation. The speech signal was parametrized using mel-frequency cepstrum coefficients, mel-frequency cepstrum coefficients with cepstral mean removal, energy and delta coefficients.","PeriodicalId":205350,"journal":{"name":"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ITS'98 Proceedings. SBT/IEEE International Telecommunications Symposium (Cat. No.98EX202)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITS.1998.713107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper describes some approaches for hybrid recognizers combining hidden Markov models (HMM) and multilayer perceptrons (MLP). One of these approaches employs MLP as a post-processor for HMM while the other uses HMM to segment the speech signal for MLP. The performance of hybrid recognizers is compared with discrete HMM and multilayer perceptrons. All of the implemented recognizers were speaker-independent and a 50-word vocabulary spoken in Brazilian Portuguese was employed in their evaluation. The speech signal was parametrized using mel-frequency cepstrum coefficients, mel-frequency cepstrum coefficients with cepstral mean removal, energy and delta coefficients.