Multidimensional scaling of systems in the Voice Conversion Challenge 2016

Speech Synthesis Workshop Pub Date : 2016-09-13 DOI:10.21437/SSW.2016-7

M. Wester, Zhizheng Wu, J. Yamagishi

引用次数: 8

Abstract

This study investigates how listeners judge the similarity of voice converted voices using a talker discrimination task. The data used is from the Voice Conversion Challenge 2016. 17 participants from around the world took part in building voice converted voices from a shared data set of source and target speakers. This paper describes the evaluation of similarity for four of the source-target pairs (two intra-gender and two cross-gender) in more detail. Multidimensional scaling was performed to illustrate where each system was perceived to be in an acoustic space compared to the source and target speakers and to each other.

查看原文本刊更多论文

2016语音转换挑战赛中系统的多维缩放

本研究探讨了听者如何通过说话者辨别任务来判断声音转换后的声音的相似性。使用的数据来自2016年语音转换挑战赛。来自世界各地的17名参与者参与了从源和目标说话人的共享数据集建立语音转换语音的工作。本文更详细地描述了四种源-目标对(两种内性别和两种跨性别)的相似性评价。进行了多维缩放，以说明与源和目标扬声器以及彼此相比，每个系统在声学空间中的感知位置。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Speech Synthesis Workshop

自引率

0.00%

发文量