{"title":"Single-channel speech separation using zero-phase models","authors":"Y. Lee, Chul Kwak, I. Lee, O. Kwon","doi":"10.1109/ISCE.2010.5523701","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"198 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCE.2010.5523701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.