{"title":"基于特征重构的未知结构自动语音识别系统语音增强","authors":"Wooil Kim","doi":"10.21742/IJISSE.2017.1.1.02","DOIUrl":null,"url":null,"abstract":"This study proposes a speech enhancement method which can be applied as a front-end to an automatic speech recognition system with an unknown structure. In this paper, a speech enhancement method is proposed, which is based on a feature reconstruction method employing variational model composition. In the proposed scheme, a gain parameter is estimated from the reconstructed speech feature and it is used for speech enhancement. The experimental results show that the proposed speech method significantly outperforms the existing front-end algorithms for unknown speech recognition over various background noise conditions. The results demonstrate that the proposed method can be effectively employed for an unknown ASR system to improve speech recognition performance, where no knowledge of the ASR system is available including the feature type and acoustic model.","PeriodicalId":433219,"journal":{"name":"The International Journal on the Image","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure\",\"authors\":\"Wooil Kim\",\"doi\":\"10.21742/IJISSE.2017.1.1.02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study proposes a speech enhancement method which can be applied as a front-end to an automatic speech recognition system with an unknown structure. In this paper, a speech enhancement method is proposed, which is based on a feature reconstruction method employing variational model composition. In the proposed scheme, a gain parameter is estimated from the reconstructed speech feature and it is used for speech enhancement. The experimental results show that the proposed speech method significantly outperforms the existing front-end algorithms for unknown speech recognition over various background noise conditions. The results demonstrate that the proposed method can be effectively employed for an unknown ASR system to improve speech recognition performance, where no knowledge of the ASR system is available including the feature type and acoustic model.\",\"PeriodicalId\":433219,\"journal\":{\"name\":\"The International Journal on the Image\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The International Journal on the Image\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21742/IJISSE.2017.1.1.02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International Journal on the Image","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/IJISSE.2017.1.1.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech Enhancement Based on Feature Reconstruction for Automatic Speech Recognition System with Unknown Structure
This study proposes a speech enhancement method which can be applied as a front-end to an automatic speech recognition system with an unknown structure. In this paper, a speech enhancement method is proposed, which is based on a feature reconstruction method employing variational model composition. In the proposed scheme, a gain parameter is estimated from the reconstructed speech feature and it is used for speech enhancement. The experimental results show that the proposed speech method significantly outperforms the existing front-end algorithms for unknown speech recognition over various background noise conditions. The results demonstrate that the proposed method can be effectively employed for an unknown ASR system to improve speech recognition performance, where no knowledge of the ASR system is available including the feature type and acoustic model.