{"title":"基于空间初始化辅助函数的频域语音分离独立向量分析","authors":"Songbo Chen, Yuxin Zhao, Yanfeng Liang","doi":"10.1109/CSO.2014.41","DOIUrl":null,"url":null,"abstract":"Independent vector analysis (IVA) is one of the state-of-the-art methods for frequency domain speech separation, which can retain the inter-frequency dependency structure to theoretically avoid the classical permutation ambiguity inherent to blind source separation (BSS). Auxiliary function based IVA (AuxIVA) is proposed as a fast form IVA method by adopting the auxiliary function technique to avoid step size tuning. In this paper, the spatial information is introduced as a prior knowledge for AuxIVA to set an initialization, which can not only increase the convergence speed in terms of iteration number but also improve the separation performance. The experimental results with real speech signals and real room recordings confirm the advantage of the proposed method.","PeriodicalId":174800,"journal":{"name":"2014 Seventh International Joint Conference on Computational Sciences and Optimization","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Auxiliary Function Based Independent Vector Analysis with Spatial Initialization for Frequency Domain Speech Separation\",\"authors\":\"Songbo Chen, Yuxin Zhao, Yanfeng Liang\",\"doi\":\"10.1109/CSO.2014.41\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Independent vector analysis (IVA) is one of the state-of-the-art methods for frequency domain speech separation, which can retain the inter-frequency dependency structure to theoretically avoid the classical permutation ambiguity inherent to blind source separation (BSS). Auxiliary function based IVA (AuxIVA) is proposed as a fast form IVA method by adopting the auxiliary function technique to avoid step size tuning. In this paper, the spatial information is introduced as a prior knowledge for AuxIVA to set an initialization, which can not only increase the convergence speed in terms of iteration number but also improve the separation performance. The experimental results with real speech signals and real room recordings confirm the advantage of the proposed method.\",\"PeriodicalId\":174800,\"journal\":{\"name\":\"2014 Seventh International Joint Conference on Computational Sciences and Optimization\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Seventh International Joint Conference on Computational Sciences and Optimization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSO.2014.41\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Seventh International Joint Conference on Computational Sciences and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSO.2014.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Auxiliary Function Based Independent Vector Analysis with Spatial Initialization for Frequency Domain Speech Separation
Independent vector analysis (IVA) is one of the state-of-the-art methods for frequency domain speech separation, which can retain the inter-frequency dependency structure to theoretically avoid the classical permutation ambiguity inherent to blind source separation (BSS). Auxiliary function based IVA (AuxIVA) is proposed as a fast form IVA method by adopting the auxiliary function technique to avoid step size tuning. In this paper, the spatial information is introduced as a prior knowledge for AuxIVA to set an initialization, which can not only increase the convergence speed in terms of iteration number but also improve the separation performance. The experimental results with real speech signals and real room recordings confirm the advantage of the proposed method.