A unified beamforming and source separation model for static and dynamic human-robot interaction.

IF 1.2 Q3 ACOUSTICS

JASA express letters Pub Date : 2024-03-01 DOI:10.1121/10.0025238

Jorge Wuth, Rodrigo Mahu, Israel Cohen, Richard M Stern, Néstor Becerra Yoma

引用次数: 0

Abstract

This paper presents a unified model for combining beamforming and blind source separation (BSS). The validity of the model's assumptions is confirmed by recovering target speech information in noise accurately using Oracle information. Using real static human-robot interaction (HRI) data, the proposed combination of BSS with the minimum-variance distortionless response beamformer provides a greater signal-to-noise ratio (SNR) than previous parallel and cascade systems that combine BSS and beamforming. In the difficult-to-model HRI dynamic environment, the system provides a SNR gain that was 2.8 dB greater than the results obtained with the cascade combination, where the parallel combination is infeasible.

查看原文本刊更多论文

用于静态和动态人机交互的统一波束成形和信号源分离模型。

本文提出了一种结合波束成形和盲源分离（BSS）的统一模型。通过使用 Oracle 信息在噪声中准确恢复目标语音信息，证实了模型假设的有效性。通过使用真实的静态人机交互（HRI）数据，与之前结合了盲源分离和波束成形的并行和级联系统相比，所提出的盲源分离与最小方差无失真响应波束成形器的结合提供了更高的信噪比（SNR）。在难以建模的 HRI 动态环境中，该系统提供的信噪比增益比级联组合获得的结果高出 2.8 dB，而级联组合的并行组合是不可行的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JASA express letters

CiteScore

1.70

自引率

0.00%

发文量