评估不同种族的人类和商业合成声音的可信度:一项比较研究

IF 5.8 Q1 PSYCHOLOGY, EXPERIMENTAL
Constantina Maltezou-Papastylianou , Reinhold Scherer , Silke Paulmann
{"title":"评估不同种族的人类和商业合成声音的可信度:一项比较研究","authors":"Constantina Maltezou-Papastylianou ,&nbsp;Reinhold Scherer ,&nbsp;Silke Paulmann","doi":"10.1016/j.chbr.2025.100762","DOIUrl":null,"url":null,"abstract":"<div><div>This study examined trustworthiness perceptions in the tone of voice of human and real-world synthesised voices, focusing on the impact of acoustic features, speaker and listener ethnicities, listener biases toward voice-based intelligent agents and speaker nature (human vs synthesised). Speech rate, mean pitch, harmonics-to-noise ratio, jitter, shimmer, cepstral peak prominence, and long-term average spectrum, significantly influenced trustworthiness ratings across both human and synthesised voices. Synthesised voices were rated as sounding more trustworthy than human voices with no explicit intent behind their tone of voice (i.e., neutral). However, synthesised voices were rated as sounding less trustworthy than human voices when human speakers intentionally attempted to sound trustworthy. Moreover, listener biases were measured using the Negative Attitudes toward Robots Scale (NARS), where a general scepticism toward robots lowered trustworthiness ratings overall. White speakers were consistently rated as more trustworthy than Black or south Asian speakers across all listener ethnic groups. The findings highlight the need to optimise acoustic properties of synthesised voices for trustworthiness while addressing biases related to speaker ethnicity and listener attitudes toward robots.</div></div>","PeriodicalId":72681,"journal":{"name":"Computers in human behavior reports","volume":"19 ","pages":"Article 100762"},"PeriodicalIF":5.8000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating trustworthiness across ethnically diverse human and commercial synthesised voices: A comparative study\",\"authors\":\"Constantina Maltezou-Papastylianou ,&nbsp;Reinhold Scherer ,&nbsp;Silke Paulmann\",\"doi\":\"10.1016/j.chbr.2025.100762\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study examined trustworthiness perceptions in the tone of voice of human and real-world synthesised voices, focusing on the impact of acoustic features, speaker and listener ethnicities, listener biases toward voice-based intelligent agents and speaker nature (human vs synthesised). Speech rate, mean pitch, harmonics-to-noise ratio, jitter, shimmer, cepstral peak prominence, and long-term average spectrum, significantly influenced trustworthiness ratings across both human and synthesised voices. Synthesised voices were rated as sounding more trustworthy than human voices with no explicit intent behind their tone of voice (i.e., neutral). However, synthesised voices were rated as sounding less trustworthy than human voices when human speakers intentionally attempted to sound trustworthy. Moreover, listener biases were measured using the Negative Attitudes toward Robots Scale (NARS), where a general scepticism toward robots lowered trustworthiness ratings overall. White speakers were consistently rated as more trustworthy than Black or south Asian speakers across all listener ethnic groups. The findings highlight the need to optimise acoustic properties of synthesised voices for trustworthiness while addressing biases related to speaker ethnicity and listener attitudes toward robots.</div></div>\",\"PeriodicalId\":72681,\"journal\":{\"name\":\"Computers in human behavior reports\",\"volume\":\"19 \",\"pages\":\"Article 100762\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in human behavior reports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2451958825001770\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in human behavior reports","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2451958825001770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

这项研究考察了人类和现实世界合成声音的语气可信度,重点关注声学特征、说话者和听者种族、听者对基于语音的智能代理的偏见和说话者性质(人类vs合成)的影响。语音速率、平均音高、谐波噪声比、抖动、闪烁、倒谱峰值突出和长期平均频谱,显著影响了人类和合成声音的可信度评级。人工合成的声音被认为比人类的声音听起来更可信,因为它们的语气背后没有明确的意图(即中性)。然而,当人类说话者有意让自己的声音听起来值得信赖时,人工合成的声音被认为不如人类的声音可信。此外,听众的偏见是用对机器人的消极态度量表(NARS)来衡量的,对机器人的普遍怀疑降低了整体的可信度评级。在所有的听众群体中,白人演讲者都被认为比黑人或南亚演讲者更值得信赖。研究结果强调,在解决与说话者种族和听者对机器人的态度有关的偏见的同时,需要优化合成声音的声学特性,以提高可信度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Evaluating trustworthiness across ethnically diverse human and commercial synthesised voices: A comparative study
This study examined trustworthiness perceptions in the tone of voice of human and real-world synthesised voices, focusing on the impact of acoustic features, speaker and listener ethnicities, listener biases toward voice-based intelligent agents and speaker nature (human vs synthesised). Speech rate, mean pitch, harmonics-to-noise ratio, jitter, shimmer, cepstral peak prominence, and long-term average spectrum, significantly influenced trustworthiness ratings across both human and synthesised voices. Synthesised voices were rated as sounding more trustworthy than human voices with no explicit intent behind their tone of voice (i.e., neutral). However, synthesised voices were rated as sounding less trustworthy than human voices when human speakers intentionally attempted to sound trustworthy. Moreover, listener biases were measured using the Negative Attitudes toward Robots Scale (NARS), where a general scepticism toward robots lowered trustworthiness ratings overall. White speakers were consistently rated as more trustworthy than Black or south Asian speakers across all listener ethnic groups. The findings highlight the need to optimise acoustic properties of synthesised voices for trustworthiness while addressing biases related to speaker ethnicity and listener attitudes toward robots.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信