{"title":"音素因子隐马尔可夫模型的家庭环境适应性","authors":"A. B. Cavalcante, K. Shinoda, S. Furui","doi":"10.5281/ZENODO.40691","DOIUrl":null,"url":null,"abstract":"We focus on the problem of speech recognition in the presence of nonstationary sudden noise, which is very likely to happen in home environments. To handle this problem, a model compensation method based on a factorial hidden Markov model (FHMM) has been recently introduced. In this architecture, speech and noise processes are modeled in parallel by a phoneme FHMM that is built by combining a clean-speech phoneme hidden Markov model (HMM) and a sudden noise HMM. Here, to increase the robustness of this method further, we apply supervised and unsupervised home-environment adaptation of phoneme FHMMs. A database recorded by a personal robot PaPeRo in home environments was used for the evaluation of the proposed method under noisy conditions. The phoneme home-dependent FHMM achieved better recognition accuracy than the clean-speech home-independent HMM, reducing the overall relative error by 16.2% and 12.3% on average for supervised and unsupervised adaptation, respectively.","PeriodicalId":176384,"journal":{"name":"2007 15th European Signal Processing Conference","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Home-environment adaptation of phoneme factorial hidden Markov models\",\"authors\":\"A. B. Cavalcante, K. Shinoda, S. Furui\",\"doi\":\"10.5281/ZENODO.40691\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We focus on the problem of speech recognition in the presence of nonstationary sudden noise, which is very likely to happen in home environments. To handle this problem, a model compensation method based on a factorial hidden Markov model (FHMM) has been recently introduced. In this architecture, speech and noise processes are modeled in parallel by a phoneme FHMM that is built by combining a clean-speech phoneme hidden Markov model (HMM) and a sudden noise HMM. Here, to increase the robustness of this method further, we apply supervised and unsupervised home-environment adaptation of phoneme FHMMs. A database recorded by a personal robot PaPeRo in home environments was used for the evaluation of the proposed method under noisy conditions. The phoneme home-dependent FHMM achieved better recognition accuracy than the clean-speech home-independent HMM, reducing the overall relative error by 16.2% and 12.3% on average for supervised and unsupervised adaptation, respectively.\",\"PeriodicalId\":176384,\"journal\":{\"name\":\"2007 15th European Signal Processing Conference\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 15th European Signal Processing Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.40691\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 15th European Signal Processing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.40691","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Home-environment adaptation of phoneme factorial hidden Markov models
We focus on the problem of speech recognition in the presence of nonstationary sudden noise, which is very likely to happen in home environments. To handle this problem, a model compensation method based on a factorial hidden Markov model (FHMM) has been recently introduced. In this architecture, speech and noise processes are modeled in parallel by a phoneme FHMM that is built by combining a clean-speech phoneme hidden Markov model (HMM) and a sudden noise HMM. Here, to increase the robustness of this method further, we apply supervised and unsupervised home-environment adaptation of phoneme FHMMs. A database recorded by a personal robot PaPeRo in home environments was used for the evaluation of the proposed method under noisy conditions. The phoneme home-dependent FHMM achieved better recognition accuracy than the clean-speech home-independent HMM, reducing the overall relative error by 16.2% and 12.3% on average for supervised and unsupervised adaptation, respectively.