Evaluating synthesized speech intelligibility in noise.

IF 1.4 Q3 ACOUSTICS

JASA express letters Pub Date : 2025-04-01 DOI:10.1121/10.0036397

Ye Yang, Dathan Nguyen, Katherine Chen, Fan-Gang Zeng

引用次数: 0

Abstract

Humans can modify their speech to improve intelligibility in noisy environments. With the advancement of speech synthesis technology, machines may also synthesize voices that remain highly intelligible in noise condition. This study evaluates both the subjective and objective intelligibility of synthesized speech in speech-shaped noise from three major speech synthesis platforms. It was found that synthesized voices have a similar intelligibility range to human voices, and some synthesized voices were more intelligible than human voices. It was also found that two modern automatic speech recognition systems recognized 10% more words than human listeners.

查看原文本刊更多论文

评价噪声环境下的合成语音清晰度。

在嘈杂的环境中，人类可以修改自己的语言以提高可理解性。随着语音合成技术的进步，机器也可以合成在噪声条件下仍然具有高度可理解性的语音。本研究在三个主要的语音合成平台上对语音形状噪声下合成语音的主观和客观可理解度进行了评价。研究发现，人工合成的声音与人类的声音具有相似的可理解范围，有些人工合成的声音比人类的声音更容易理解。研究还发现，两种现代自动语音识别系统比人类听众多识别10%的单词。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

JASA express letters

CiteScore

1.70

自引率

0.00%

发文量