Soft-CASA system for single channel speech separation

Belhedi Wiem, Ben Messaoud Mohamed Anouar, Bouzid Aicha
{"title":"Soft-CASA system for single channel speech separation","authors":"Belhedi Wiem, Ben Messaoud Mohamed Anouar, Bouzid Aicha","doi":"10.1109/CEIT.2016.7929095","DOIUrl":null,"url":null,"abstract":"In this paper we study the masking effect on Computational Auditory Scene Analysis (CASA) based systems for single channel speech separation (SCSS). In this study, we focus on the benchmark masks of the literature that are namely: the ideal binary mask (IBM), the binary mask (BM) and soft mask. Each system is evaluated objectively and subjectively in order to highlight the effect of each mask on the intelligibility and the quality of the separated speech. Based on this study we develop a new system, that we call Soft-CASA for SCSS that outperforms the original one. The proposed system achieves 28.84% improvement in the Short-time Objective Intelligibility (STOI) parameter, 7.1% improvement in SNRloss and 92% improvement in overall perceptual score (OPS), compared to the original system.","PeriodicalId":355001,"journal":{"name":"2016 4th International Conference on Control Engineering & Information Technology (CEIT)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 4th International Conference on Control Engineering & Information Technology (CEIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEIT.2016.7929095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper we study the masking effect on Computational Auditory Scene Analysis (CASA) based systems for single channel speech separation (SCSS). In this study, we focus on the benchmark masks of the literature that are namely: the ideal binary mask (IBM), the binary mask (BM) and soft mask. Each system is evaluated objectively and subjectively in order to highlight the effect of each mask on the intelligibility and the quality of the separated speech. Based on this study we develop a new system, that we call Soft-CASA for SCSS that outperforms the original one. The proposed system achieves 28.84% improvement in the Short-time Objective Intelligibility (STOI) parameter, 7.1% improvement in SNRloss and 92% improvement in overall perceptual score (OPS), compared to the original system.
单通道语音分离软casa系统
本文研究了基于计算听觉场景分析(CASA)的单通道语音分离(SCSS)系统的掩蔽效应。在本研究中,我们重点研究了文献中的基准掩码,即:理想二进制掩码(IBM)、二进制掩码(BM)和软掩码。为了突出每个掩码对分离语音的可理解性和质量的影响,对每个系统进行了客观和主观的评估。在此研究的基础上,我们开发了一个新的系统,我们称之为软casa的SCSS,它优于原来的系统。与原始系统相比,该系统在短时客观可理解性(STOI)参数上提高了28.84%,在信噪比损失上提高了7.1%,在总体感知评分(OPS)上提高了92%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信