Jenna L Cramer, Ashley Reynard, Vanessa Torres, Jeremy J Donai
{"title":"Spectral and temporal information and presentation mode effects on individual speaker identification and listening effort.","authors":"Jenna L Cramer, Ashley Reynard, Vanessa Torres, Jeremy J Donai","doi":"10.1121/10.0039369","DOIUrl":null,"url":null,"abstract":"<p><p>Identifying speakers of interest in an auditory scene is a fundamental task that facilitates effective communication. Little is known about the specific contributions of spectral and temporal detail required for identifying a specific speaker of interest by human listeners. This study investigated the relative contributions of spectral and temporal detail for identifying a speaker of interest and perceived effort in doing so. Results showed significant improvements in speaker identification and decreased effort ratings as spectral channels increased. Improved speaker identification performance with increased temporal filter cutoff from 20 Hz to 800 Hz was observed. These results have implications for speech signal processing by amplification devices and automated speaker recognition systems.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":"5 9","pages":""},"PeriodicalIF":1.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0039369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Identifying speakers of interest in an auditory scene is a fundamental task that facilitates effective communication. Little is known about the specific contributions of spectral and temporal detail required for identifying a specific speaker of interest by human listeners. This study investigated the relative contributions of spectral and temporal detail for identifying a speaker of interest and perceived effort in doing so. Results showed significant improvements in speaker identification and decreased effort ratings as spectral channels increased. Improved speaker identification performance with increased temporal filter cutoff from 20 Hz to 800 Hz was observed. These results have implications for speech signal processing by amplification devices and automated speaker recognition systems.