{"title":"通过众包听众评分来测量说话人的年龄:一项语音研究的试点研究。","authors":"Raquel M A Tripp, Eric J Hunter, Aaron M Johnson","doi":"10.1044/2024_JSLHR-24-00125","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.</p><p><strong>Method: </strong>Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.</p><p><strong>Results: </strong>The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.</p><p><strong>Conclusions: </strong>Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.</p>","PeriodicalId":51254,"journal":{"name":"Journal of Speech Language and Hearing Research","volume":" ","pages":"531-546"},"PeriodicalIF":2.2000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Measuring Talker Age Estimates Through Crowdsourced Listeners' Ratings: A Pilot Study for Voice Research.\",\"authors\":\"Raquel M A Tripp, Eric J Hunter, Aaron M Johnson\",\"doi\":\"10.1044/2024_JSLHR-24-00125\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.</p><p><strong>Method: </strong>Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.</p><p><strong>Results: </strong>The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.</p><p><strong>Conclusions: </strong>Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.</p>\",\"PeriodicalId\":51254,\"journal\":{\"name\":\"Journal of Speech Language and Hearing Research\",\"volume\":\" \",\"pages\":\"531-546\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Speech Language and Hearing Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1044/2024_JSLHR-24-00125\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/17 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Speech Language and Hearing Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1044/2024_JSLHR-24-00125","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY","Score":null,"Total":0}
Measuring Talker Age Estimates Through Crowdsourced Listeners' Ratings: A Pilot Study for Voice Research.
Purpose: Most auditory-perceptual voice research utilizes the judgments of trained listeners rather than everyday listeners with no previous training in speech pathology. Online crowdsourcing of behavioral data from untrained participants is rapidly increasing in popularity but has yet to be a common procedure for auditory-perceptual studies of the voice. The objective of this pilot study was to assess the functionality of this model for judgments of voice by using an online experiment platform to replicate a lab-based, voice-specific age estimation study.
Method: Fifty crowdsourced untrained listeners estimated the age of a single talker based on audio samples taken from 20 speeches over a 48-year span. The primary outcome was overall age estimation accuracy.
Results: The crowdsourced age estimations closely matched those of a previous highly controlled in-person laboratory study using the same auditory samples. Listeners generally overestimated the talker's age when the talker was younger and underestimated his age when he was older. The age at which the estimated age equaled the talker's chronological age was 54 years.
Conclusions: Online crowdsourcing may be a feasible modality for auditory-perceptual voice ratings with the potential to add low-cost, high-number options to validate and enhance clinical and laboratory-based studies by (a) including a wider diversity of participants and (b) providing the means for rapidly recruiting more participants. Further research investigating crowdsourced ratings of the complex parameters of voice quality using more listeners is needed to continue supporting this methodology as a tool for perceptual voice research.
期刊介绍:
Mission: JSLHR publishes peer-reviewed research and other scholarly articles on the normal and disordered processes in speech, language, hearing, and related areas such as cognition, oral-motor function, and swallowing. The journal is an international outlet for both basic research on communication processes and clinical research pertaining to screening, diagnosis, and management of communication disorders as well as the etiologies and characteristics of these disorders. JSLHR seeks to advance evidence-based practice by disseminating the results of new studies as well as providing a forum for critical reviews and meta-analyses of previously published work.
Scope: The broad field of communication sciences and disorders, including speech production and perception; anatomy and physiology of speech and voice; genetics, biomechanics, and other basic sciences pertaining to human communication; mastication and swallowing; speech disorders; voice disorders; development of speech, language, or hearing in children; normal language processes; language disorders; disorders of hearing and balance; psychoacoustics; and anatomy and physiology of hearing.