Speech Intelligibility of Microphone Arrays in Reverberant Environments with Interference

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP) Pub Date : 2018-08-01 DOI:10.1109/MMSP.2018.8547053

Elham Ideli, R. Vaughan, I. Bajić

{"title":"Speech Intelligibility of Microphone Arrays in Reverberant Environments with Interference","authors":"Elham Ideli, R. Vaughan, I. Bajić","doi":"10.1109/MMSP.2018.8547053","DOIUrl":null,"url":null,"abstract":"It is known that speech intelligibility degrades with additive noise and reverberation, and that quantitative parameters such as fidelity and signal-to-noise ratio can be improved by using microphone arrays with various beamforming algorithms. However, it is not clear how the array configuration impacts the intelligibility of speech. Numerical experiments, using widely-used models, provide the most convenient comparison, and the approach allows rapid assessment of parameters such as the array configuration, the number and spacing of the elements, and modeled features such as room reflection coefficients. For a typical reverberant room with a single wanted source and two unwanted sources (interferers), we compare the performance of two ceiling-mounted configurations - the uniform linear array (ULA) and a uniform circular array (UCA). The microphones are taken as omnidirectional and equispaced along the array loci, and we use a standard gain-constrained power minimization beamformer. In this study, a limiting performance is presented by emphasizing the early reflections over the late ones for the prior steering vector. Under this steering vector condition, for the same number of elements, the UCA easily outperforms the ULA on known quality and intelligibility metrics. For both arrays in this room scenario, all the metrics increase with an increasing number of microphones, although for one intelligibility metric, diminishing returns set in at about 12 microphones.","PeriodicalId":137522,"journal":{"name":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2018.8547053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

It is known that speech intelligibility degrades with additive noise and reverberation, and that quantitative parameters such as fidelity and signal-to-noise ratio can be improved by using microphone arrays with various beamforming algorithms. However, it is not clear how the array configuration impacts the intelligibility of speech. Numerical experiments, using widely-used models, provide the most convenient comparison, and the approach allows rapid assessment of parameters such as the array configuration, the number and spacing of the elements, and modeled features such as room reflection coefficients. For a typical reverberant room with a single wanted source and two unwanted sources (interferers), we compare the performance of two ceiling-mounted configurations - the uniform linear array (ULA) and a uniform circular array (UCA). The microphones are taken as omnidirectional and equispaced along the array loci, and we use a standard gain-constrained power minimization beamformer. In this study, a limiting performance is presented by emphasizing the early reflections over the late ones for the prior steering vector. Under this steering vector condition, for the same number of elements, the UCA easily outperforms the ULA on known quality and intelligibility metrics. For both arrays in this room scenario, all the metrics increase with an increasing number of microphones, although for one intelligibility metric, diminishing returns set in at about 12 microphones.

查看原文本刊更多论文

具有干扰的混响环境中传声器阵列的语音清晰度

众所周知，语音清晰度会随着附加噪声和混响而降低，而保真度和信噪比等定量参数可以通过使用具有各种波束形成算法的麦克风阵列来提高。然而，目前尚不清楚阵列配置如何影响语音的可理解性。数值实验，使用广泛使用的模型，提供了最方便的比较，并且该方法允许快速评估参数，如阵列配置，元素的数量和间距，以及模拟特征，如房间反射系数。对于具有单个所需源和两个不需要源(干扰)的典型混响室，我们比较了两种安装在天花板上的配置-均匀线性阵列(ULA)和均匀圆形阵列(UCA)的性能。麦克风是全向的，并沿阵列位置配置，我们使用标准的增益约束功率最小化波束形成器。在这项研究中，通过强调早期反射而不是后期反射来限制先前转向向量的性能。在这个导向向量条件下，对于相同数量的元素，UCA在已知的质量和可理解性度量上很容易优于ULA。对于这个房间场景中的两个阵列，所有指标都随着麦克风数量的增加而增加，尽管对于一个可理解性指标，收益递减在大约12个麦克风时开始。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE 20th International Workshop on Multimedia Signal Processing (MMSP)

自引率

0.00%

发文量