{"title":"基于并行模型补偿和语音活动检测方法的噪声鲁棒语音识别","authors":"Serhat Hizlisoy, Z. Tufekci","doi":"10.1109/ICEDSA.2016.7818517","DOIUrl":null,"url":null,"abstract":"The main purpose of this study is to increase the performance of a speech recognition system under noisy environments. In this study Voice Activity Detection (VAD) methods is used for estimating the noise model, and Parallel Model Compensation (PMC) is used for estimating the noisy speech model using the clean speech model and noise model which is estimated using a VAD method. Performances of the baseline and four well known VAD methods have been compared for noisy speech recognition. In addition to this, a new VAD method is proposed to estimate parameters of the noise model. The proposed VAD method's speech recognition performance is better than the most of the well-known VAD methods despite less computational requirement of the proposed VAD method compared to these well-known VAD methods.","PeriodicalId":247318,"journal":{"name":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","volume":"129 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Noise robust speech recognition using parallel model compensation and voice activity detection methods\",\"authors\":\"Serhat Hizlisoy, Z. Tufekci\",\"doi\":\"10.1109/ICEDSA.2016.7818517\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The main purpose of this study is to increase the performance of a speech recognition system under noisy environments. In this study Voice Activity Detection (VAD) methods is used for estimating the noise model, and Parallel Model Compensation (PMC) is used for estimating the noisy speech model using the clean speech model and noise model which is estimated using a VAD method. Performances of the baseline and four well known VAD methods have been compared for noisy speech recognition. In addition to this, a new VAD method is proposed to estimate parameters of the noise model. The proposed VAD method's speech recognition performance is better than the most of the well-known VAD methods despite less computational requirement of the proposed VAD method compared to these well-known VAD methods.\",\"PeriodicalId\":247318,\"journal\":{\"name\":\"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)\",\"volume\":\"129 10\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEDSA.2016.7818517\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEDSA.2016.7818517","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
本研究的主要目的是提高语音识别系统在噪声环境下的性能。本研究采用语音活动检测(Voice Activity Detection, VAD)方法对噪声模型进行估计,并采用并行模型补偿(Parallel model Compensation, PMC)方法对含噪语音模型进行估计,使用VAD方法对纯净语音模型和噪声模型进行估计。比较了基线和四种已知的VAD方法在噪声语音识别中的性能。此外,提出了一种新的VAD方法来估计噪声模型的参数。与大多数已知的VAD方法相比,所提出的VAD方法的计算量较少,但其语音识别性能优于大多数已知的VAD方法。
Noise robust speech recognition using parallel model compensation and voice activity detection methods
The main purpose of this study is to increase the performance of a speech recognition system under noisy environments. In this study Voice Activity Detection (VAD) methods is used for estimating the noise model, and Parallel Model Compensation (PMC) is used for estimating the noisy speech model using the clean speech model and noise model which is estimated using a VAD method. Performances of the baseline and four well known VAD methods have been compared for noisy speech recognition. In addition to this, a new VAD method is proposed to estimate parameters of the noise model. The proposed VAD method's speech recognition performance is better than the most of the well-known VAD methods despite less computational requirement of the proposed VAD method compared to these well-known VAD methods.