{"title":"使用零相位模型的单通道语音分离","authors":"Y. Lee, Chul Kwak, I. Lee, O. Kwon","doi":"10.1109/ISCE.2010.5523701","DOIUrl":null,"url":null,"abstract":"This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.","PeriodicalId":403652,"journal":{"name":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","volume":"198 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Single-channel speech separation using zero-phase models\",\"authors\":\"Y. Lee, Chul Kwak, I. Lee, O. Kwon\",\"doi\":\"10.1109/ISCE.2010.5523701\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.\",\"PeriodicalId\":403652,\"journal\":{\"name\":\"IEEE International Symposium on Consumer Electronics (ISCE 2010)\",\"volume\":\"198 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-06-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE International Symposium on Consumer Electronics (ISCE 2010)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCE.2010.5523701\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE International Symposium on Consumer Electronics (ISCE 2010)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCE.2010.5523701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Single-channel speech separation using zero-phase models
This paper addresses the problem of single-channel speech separation to extract and enhance the desired speech signals from mixed speech signals. We propose a new speech separation algorithm by utilizing both magnitude and phase information, which can be applied to multimedia mobile communication and navigation systems. Conventionally, phase information has been neglected in speech signal processing. However, in the proposed method, we originally formulate a probabilistic phase-based speech estimator based on zero-phase models to improve the speech separation performance. In the speech separation experiments, the proposed method is shown to improve speaker-to-interference ratio (SIR) by 2.2 dB compared to the system using magnitude models only. When only phase-based speech estimator is used for speech separation, the SIR was improved by 0.8 dB. This result justify that the proposed phase-based speech estimation method achieves significant SIR improvement compared with the previous magnitude-based method.