{"title":"基于听觉的噪声和混响语音增强单声特征","authors":"Yi-jiao Jiang, Runsheng Liu, Ya Bai","doi":"10.1109/CIIS.2017.23","DOIUrl":null,"url":null,"abstract":"The deep neural networks (DNN) based speech enhancements is a hot topic in machine learning and speech enhancement application. Even with deep neural network, it is still hard to improve the speech quality on noisy and reverberant conditions. For machine learning based system, auditory feature extraction becomes the key point in speech enhancement and recognition. In this paper, we proposed a speech enhancement framework based on an auditory-based monaural feature, which model the function of human hearing auditory system. The auditory based feature is extracted from the data passing the gammatone filter banks, which has more detail on low frequency than normal filters. Systemic tests show the better performance of the proposed auditory based monaural feature than the mel-frequency cepstral coefficients (MFCC) in noise and reverberant environment.","PeriodicalId":254342,"journal":{"name":"2017 International Conference on Computing Intelligence and Information System (CIIS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"An Auditory-Based Monaural Feature for Noisy and Reverberant Speech Enhancement\",\"authors\":\"Yi-jiao Jiang, Runsheng Liu, Ya Bai\",\"doi\":\"10.1109/CIIS.2017.23\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The deep neural networks (DNN) based speech enhancements is a hot topic in machine learning and speech enhancement application. Even with deep neural network, it is still hard to improve the speech quality on noisy and reverberant conditions. For machine learning based system, auditory feature extraction becomes the key point in speech enhancement and recognition. In this paper, we proposed a speech enhancement framework based on an auditory-based monaural feature, which model the function of human hearing auditory system. The auditory based feature is extracted from the data passing the gammatone filter banks, which has more detail on low frequency than normal filters. Systemic tests show the better performance of the proposed auditory based monaural feature than the mel-frequency cepstral coefficients (MFCC) in noise and reverberant environment.\",\"PeriodicalId\":254342,\"journal\":{\"name\":\"2017 International Conference on Computing Intelligence and Information System (CIIS)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computing Intelligence and Information System (CIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIIS.2017.23\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing Intelligence and Information System (CIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIIS.2017.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Auditory-Based Monaural Feature for Noisy and Reverberant Speech Enhancement
The deep neural networks (DNN) based speech enhancements is a hot topic in machine learning and speech enhancement application. Even with deep neural network, it is still hard to improve the speech quality on noisy and reverberant conditions. For machine learning based system, auditory feature extraction becomes the key point in speech enhancement and recognition. In this paper, we proposed a speech enhancement framework based on an auditory-based monaural feature, which model the function of human hearing auditory system. The auditory based feature is extracted from the data passing the gammatone filter banks, which has more detail on low frequency than normal filters. Systemic tests show the better performance of the proposed auditory based monaural feature than the mel-frequency cepstral coefficients (MFCC) in noise and reverberant environment.