Belhedi Wiem, Ben Messaoud Mohamed Anouar, Bouzid Aicha
{"title":"Soft-CASA system for single channel speech separation","authors":"Belhedi Wiem, Ben Messaoud Mohamed Anouar, Bouzid Aicha","doi":"10.1109/CEIT.2016.7929095","DOIUrl":null,"url":null,"abstract":"In this paper we study the masking effect on Computational Auditory Scene Analysis (CASA) based systems for single channel speech separation (SCSS). In this study, we focus on the benchmark masks of the literature that are namely: the ideal binary mask (IBM), the binary mask (BM) and soft mask. Each system is evaluated objectively and subjectively in order to highlight the effect of each mask on the intelligibility and the quality of the separated speech. Based on this study we develop a new system, that we call Soft-CASA for SCSS that outperforms the original one. The proposed system achieves 28.84% improvement in the Short-time Objective Intelligibility (STOI) parameter, 7.1% improvement in SNRloss and 92% improvement in overall perceptual score (OPS), compared to the original system.","PeriodicalId":355001,"journal":{"name":"2016 4th International Conference on Control Engineering & Information Technology (CEIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 4th International Conference on Control Engineering & Information Technology (CEIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEIT.2016.7929095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper we study the masking effect on Computational Auditory Scene Analysis (CASA) based systems for single channel speech separation (SCSS). In this study, we focus on the benchmark masks of the literature that are namely: the ideal binary mask (IBM), the binary mask (BM) and soft mask. Each system is evaluated objectively and subjectively in order to highlight the effect of each mask on the intelligibility and the quality of the separated speech. Based on this study we develop a new system, that we call Soft-CASA for SCSS that outperforms the original one. The proposed system achieves 28.84% improvement in the Short-time Objective Intelligibility (STOI) parameter, 7.1% improvement in SNRloss and 92% improvement in overall perceptual score (OPS), compared to the original system.