A computational model for MOS prediction

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351) Pub Date : 1999-06-20 DOI:10.1109/SCFT.1999.781511

Doh-Suk Kim, O. Ghitza, P. Kroon

引用次数: 2

Abstract

A computational model to predict MOS (mean opinion score) of processed speech is proposed. The system measures the distortion of processed speech (compared to the source speech) using a peripheral model of the mammalian auditory system and a psychophysically-inspired measure, and maps the distortion value onto the MOS scale. This paper describes our attempt to derive a "universal", database-independent, distortion-to-MOS mapping function. Preliminary experimental evaluation shows that the performance of the proposed system is comparable with ITU-T recommendation P.861 for clean speech sources, and outperforms the P.861 recommendation for speech sources corrupted by either car or babble noise at 30 dB SNR.

查看原文本刊更多论文

MOS预测的计算模型

提出了一种预测处理后语音的平均意见评分的计算模型。该系统使用哺乳动物听觉系统的外围模型和心理物理学启发的测量方法来测量处理后语音的失真(与源语音相比)，并将失真值映射到MOS量表上。本文描述了我们试图推导一个“通用的”、数据库无关的、扭曲到mos的映射函数。初步的实验评估表明，该系统的性能可与ITU-T推荐的P.861相媲美，并且在信噪比为30 dB的情况下，优于P.861推荐的受汽车或杂音干扰的语音源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351)

自引率

0.00%

发文量