A Synthetic Voice for an Assistive Conversational Agent: A Survey to Discover Italian Preferences regarding Synthetic Voice’s Gender and Quality Level

IF 4.3 Q1 PSYCHOLOGY, MULTIDISCIPLINARY

Human Behavior and Emerging Technologies Pub Date : 2023-12-28 DOI:10.1155/2023/8858268

M. Cuciniello, T. Amorese, C. Greco, Zoraida Callejas Carrión, Carl Vogel, G. Cordasco, Anna Esposito

{"title":"A Synthetic Voice for an Assistive Conversational Agent: A Survey to Discover Italian Preferences regarding Synthetic Voice’s Gender and Quality Level","authors":"M. Cuciniello, T. Amorese, C. Greco, Zoraida Callejas Carrión, Carl Vogel, G. Cordasco, Anna Esposito","doi":"10.1155/2023/8858268","DOIUrl":null,"url":null,"abstract":"Based on a previous investigation, a quantitative study aimed to identify user’ preferences towards four synthetic voices of two different quality levels (classified through the sophistication of the synthesizer: low vs. high) is proposed. The voices administered to participants were developed considering two main aspects: the voice quality (high/low) and their gender (male/female). 182 unpaid participants were recruited for the study, divided in four groups according to their age, and therefore classified as adolescents, young adults, middle-aged, and seniors. To collect data regarding each voice, randomly audited by participants, the shortened version of the Virtual Agent Voice Acceptance Questionnaire (VAVAQ) was exploited. Outcomes of the previous study revealed that the voices of high quality, regardless of their gender, received a higher acclaim by all participants examined rather than the corresponding two voices assessed as lower quality. Conversely, findings of the current study suggest that the four new groups of participants involved agreed in showing their strong preference towards the high-quality voice gendered as female compared to all the other considered voices. Regarding the two voices gendered as male, the high-quality one was considered as more original and capable to arouse positive emotional states than the low-quality one. Moreover, the high-quality male voice was judged as more natural than the female low-quality one. Results provide some insights for future directions in the user experience and design field.","PeriodicalId":36408,"journal":{"name":"Human Behavior and Emerging Technologies","volume":"327 6","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Behavior and Emerging Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2023/8858268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Based on a previous investigation, a quantitative study aimed to identify user’ preferences towards four synthetic voices of two different quality levels (classified through the sophistication of the synthesizer: low vs. high) is proposed. The voices administered to participants were developed considering two main aspects: the voice quality (high/low) and their gender (male/female). 182 unpaid participants were recruited for the study, divided in four groups according to their age, and therefore classified as adolescents, young adults, middle-aged, and seniors. To collect data regarding each voice, randomly audited by participants, the shortened version of the Virtual Agent Voice Acceptance Questionnaire (VAVAQ) was exploited. Outcomes of the previous study revealed that the voices of high quality, regardless of their gender, received a higher acclaim by all participants examined rather than the corresponding two voices assessed as lower quality. Conversely, findings of the current study suggest that the four new groups of participants involved agreed in showing their strong preference towards the high-quality voice gendered as female compared to all the other considered voices. Regarding the two voices gendered as male, the high-quality one was considered as more original and capable to arouse positive emotional states than the low-quality one. Moreover, the high-quality male voice was judged as more natural than the female low-quality one. Results provide some insights for future directions in the user experience and design field.

查看原文本刊更多论文

辅助对话代理的合成语音：调查发现意大利人对合成语音的性别和质量水平的偏好

在先前调查的基础上，我们提出了一项定量研究，旨在确定用户对两种不同质量水平的四种合成声音的偏好（根据合成器的复杂程度分类：低与高）。为参与者设计的声音主要考虑两个方面：声音质量（高/低）和性别（男/女）。研究共招募了 182 名无偿参与者，按年龄分为四组，即青少年组、青年组、中年组和老年组。为了收集有关每种语音的数据，研究人员采用了虚拟代理语音接受度问卷（VAVAQ）的简短版本，由参与者随机审核。前一项研究结果表明，无论性别如何，高质量的语音在所有受试者中都获得了较高的赞誉，而不是相应的两个被评估为低质量的语音。相反，本次研究的结果表明，与其他所有声音相比，四组新的参与者都更倾向于性别为女性的高质量声音。至于两种男性声音，高质量的声音被认为比低质量的声音更新颖，更能唤起积极的情绪状态。此外，高质量的男声被认为比低质量的女声更自然。研究结果为用户体验和设计领域的未来发展方向提供了一些启示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Human Behavior and Emerging Technologies Social Sciences-Social Sciences (all)

CiteScore

17.20

自引率

8.70%

发文量

期刊介绍： Human Behavior and Emerging Technologies is an interdisciplinary journal dedicated to publishing high-impact research that enhances understanding of the complex interactions between diverse human behavior and emerging digital technologies.