为语音助手和人类受话人标注前音突出度

IF 2.2 2区心理学 Q2 PSYCHOLOGY

Journal of Experimental Psychology-Learning Memory and Cognition Pub Date : 2025-06-01 Epub Date: 2024-11-18 DOI:10.1037/xlm0001396

Eleonora Beier, Michelle Cohn, Timothy Trammel, Fernanda Ferreira, Georgia Zellou

{"title":"为语音助手和人类受话人标注前音突出度","authors":"Eleonora Beier, Michelle Cohn, Timothy Trammel, Fernanda Ferreira, Georgia Zellou","doi":"10.1037/xlm0001396","DOIUrl":null,"url":null,"abstract":"Prosodic prominence (realized with phonetic features such as increased intensity, duration, and pitch, among others) is thought to guide listeners' attention by focusing new information. This study investigates production and perception of prosodic prominence toward two types of addressees: a human and a voice assistant interlocutor. We examine how the language system adapts to this increasingly common technology, by testing whether prosodic prominence is subject to audience design when addressing an interlocutor that is consistently rated as having less communicative ability. Stimuli consisted of question-answer pairs, where California English speakers read identical sentences (e.g., \"Jude saw the sun\") in response to interlocutors' questions probing different foci (e.g., \"Who saw the sun?\"). Experiment 1 reveals consistent acoustic adjustments to mark focus on either the subject or the object of a sentence. In Experiment 2, we find that listeners reliably infer the intended information structure based on these acoustic adjustments. Across both experiments, we see no consistent difference in focus marking by type of interlocutor (human vs. voice assistant). Nonetheless, listeners associate particular features (slower speech rate) with speech directed at voice assistants. Taken together, our findings suggest that while speakers apply communicative strategies from human-human interaction when addressing voice assistants, listeners expect a device-specific register. (PsycInfo Database Record (c) 2025 APA, all rights reserved).","PeriodicalId":50194,"journal":{"name":"Journal of Experimental Psychology-Learning Memory and Cognition","volume":" ","pages":"986-1003"},"PeriodicalIF":2.2000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Marking prosodic prominence for voice assistant and human addressees.\",\"authors\":\"Eleonora Beier, Michelle Cohn, Timothy Trammel, Fernanda Ferreira, Georgia Zellou\",\"doi\":\"10.1037/xlm0001396\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Prosodic prominence (realized with phonetic features such as increased intensity, duration, and pitch, among others) is thought to guide listeners' attention by focusing new information. This study investigates production and perception of prosodic prominence toward two types of addressees: a human and a voice assistant interlocutor. We examine how the language system adapts to this increasingly common technology, by testing whether prosodic prominence is subject to audience design when addressing an interlocutor that is consistently rated as having less communicative ability. Stimuli consisted of question-answer pairs, where California English speakers read identical sentences (e.g., \\\"Jude saw the sun\\\") in response to interlocutors' questions probing different foci (e.g., \\\"Who saw the sun?\\\"). Experiment 1 reveals consistent acoustic adjustments to mark focus on either the subject or the object of a sentence. In Experiment 2, we find that listeners reliably infer the intended information structure based on these acoustic adjustments. Across both experiments, we see no consistent difference in focus marking by type of interlocutor (human vs. voice assistant). Nonetheless, listeners associate particular features (slower speech rate) with speech directed at voice assistants. Taken together, our findings suggest that while speakers apply communicative strategies from human-human interaction when addressing voice assistants, listeners expect a device-specific register. (PsycInfo Database Record (c) 2025 APA, all rights reserved).\",\"PeriodicalId\":50194,\"journal\":{\"name\":\"Journal of Experimental Psychology-Learning Memory and Cognition\",\"volume\":\" \",\"pages\":\"986-1003\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental Psychology-Learning Memory and Cognition\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1037/xlm0001396\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Psychology-Learning Memory and Cognition","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/xlm0001396","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/18 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PSYCHOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

人们认为，前音突出（通过增加强度、持续时间和音高等语音特征来实现）可以通过集中新信息来引导听者的注意力。本研究调查了两种类型的受话者对前音突出的产生和感知：人类和语音助理对话者。我们检验了语言系统如何适应这种日益普遍的技术，并测试了在向一直被评为交际能力较弱的对话者说话时，前音突出是否受受众设计的影响。刺激物包括问答对，加州英语使用者在回答对话者提出的不同焦点问题（如 "谁看到了太阳"）时，要朗读相同的句子（如 "Jude saw the sun"）。实验 1 显示了一致的声学调整，以标记句子中的主语或宾语。在实验 2 中，我们发现听者能根据这些声音调整可靠地推断出预期的信息结构。在这两项实验中，我们发现不同类型的对话者（人类与语音助手）在重点标记方面没有一致的差异。然而，听者会将特定的特征（语速较慢）与针对语音助手的语音联系起来。综上所述，我们的研究结果表明，虽然说话者在对语音助手讲话时采用了人与人之间的交流策略，但听者却期望使用特定设备的语域。(PsycInfo Database Record (c) 2024 APA, 版权所有）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Marking prosodic prominence for voice assistant and human addressees.

Prosodic prominence (realized with phonetic features such as increased intensity, duration, and pitch, among others) is thought to guide listeners' attention by focusing new information. This study investigates production and perception of prosodic prominence toward two types of addressees: a human and a voice assistant interlocutor. We examine how the language system adapts to this increasingly common technology, by testing whether prosodic prominence is subject to audience design when addressing an interlocutor that is consistently rated as having less communicative ability. Stimuli consisted of question-answer pairs, where California English speakers read identical sentences (e.g., "Jude saw the sun") in response to interlocutors' questions probing different foci (e.g., "Who saw the sun?"). Experiment 1 reveals consistent acoustic adjustments to mark focus on either the subject or the object of a sentence. In Experiment 2, we find that listeners reliably infer the intended information structure based on these acoustic adjustments. Across both experiments, we see no consistent difference in focus marking by type of interlocutor (human vs. voice assistant). Nonetheless, listeners associate particular features (slower speech rate) with speech directed at voice assistants. Taken together, our findings suggest that while speakers apply communicative strategies from human-human interaction when addressing voice assistants, listeners expect a device-specific register. (PsycInfo Database Record (c) 2025 APA, all rights reserved).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Experimental Psychology-Learning Memory and Cognition 医学-心理学

CiteScore

4.30

自引率

3.80%

发文量

163

审稿时长

4-8 weeks

期刊介绍： The Journal of Experimental Psychology: Learning, Memory, and Cognition publishes studies on perception, control of action, perceptual aspects of language processing, and related cognitive processes.