通过众包听众评分来测量说话人的年龄：一项语音研究的试点研究。

IF 2.2 2区医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY

Journal of Speech Language and Hearing Research Pub Date : 2025-02-04 Epub Date: 2025-01-17 DOI:10.1044/2024_JSLHR-24-00125

Raquel M A Tripp, Eric J Hunter, Aaron M Johnson

{"title":"通过众包听众评分来测量说话人的年龄：一项语音研究的试点研究。","authors":"Raquel M A Tripp, Eric J Hunter, Aaron M Johnson","doi":"10.1044/2024_JSLHR-24-00125","DOIUrl":null,"url":null,"abstract":"Purpose: Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.Method: Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.Results: The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.Conclusions: Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"531-546"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring Talker Age Estimates Through Crowdsourced Listeners' Ratings: A Pilot Study for Voice Research.\",\"authors\":\"Raquel M A Tripp, Eric J Hunter, Aaron M Johnson\",\"doi\":\"10.1044/2024_JSLHR-24-00125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.Method: Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.Results: The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.Conclusions: Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.\",\"PeriodicalId\":51254,\"journal\":{\"name\":\"Journal of Speech Language and Hearing Research\",\"volume\":\" \",\"pages\":\"531-546\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Speech Language and Hearing Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1044/2024_JSLHR-24-00125\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Speech Language and Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1044/2024_JSLHR-24-00125","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

目的：大多数听觉感知语音研究利用受过训练的听者的判断，而不是没有受过语言病理学训练的日常听者的判断。从未经训练的参与者那里获得行为数据的在线众包正在迅速普及，但尚未成为声音听觉感知研究的常用程序。本初步研究的目的是通过使用在线实验平台来复制基于实验室的语音特定年龄估计研究，评估该模型在语音判断方面的功能。方法：50个未经训练的听众根据48年间20次演讲的音频样本来估计单个演讲者的年龄。主要结果是总体年龄估计的准确性。结果：众包年龄估计与先前使用相同听觉样本进行的高度控制的现场实验室研究密切匹配。当说话者年轻时，听众通常会高估他的年龄，而当他年老时，听众通常会低估他的年龄。估计的年龄等于说话者实际年龄的年龄是54岁。结论：在线众包可能是听觉感知语音评级的一种可行模式，有可能增加低成本、高数量的选择，通过(a)包括更广泛的参与者，(b)提供快速招募更多参与者的手段，来验证和加强临床和实验室研究。需要进一步研究使用更多听众对复杂的语音质量参数进行众包评级，以继续支持该方法作为感知语音研究的工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Measuring Talker Age Estimates Through Crowdsourced Listeners' Ratings: A Pilot Study for Voice Research.

Purpose: Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.

Method: Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.

Results: The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.

Conclusions: Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Speech Language and Hearing Research AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY-REHABILITATION

CiteScore

4.10

自引率

19.20%

发文量

538

审稿时长

4-8 weeks

期刊介绍： Mission: JSLHR publishes peer-reviewed research and other scholarly articles on the normal and disordered processes in speech, language, hearing, and related areas such as cognition, oral-motor function, and swallowing. The journal is an international outlet for both basic research on communication processes and clinical research pertaining to screening, diagnosis, and management of communication disorders as well as the etiologies and characteristics of these disorders. JSLHR seeks to advance evidence-based practice by disseminating the results of new studies as well as providing a forum for critical reviews and meta-analyses of previously published work. Scope: The broad field of communication sciences and disorders, including speech production and perception; anatomy and physiology of speech and voice; genetics, biomechanics, and other basic sciences pertaining to human communication; mastication and swallowing; speech disorders; voice disorders; development of speech, language, or hearing in children; normal language processes; language disorders; disorders of hearing and balance; psychoacoustics; and anatomy and physiology of hearing.