{"title":"STRAIGHT: A new speech synthesizer for vowel formant discrimination","authors":"Chang Liu, D. Kewley-Port","doi":"10.1121/1.1635431","DOIUrl":null,"url":null,"abstract":"The present study investigated whether a new tool for nearly natural speech synthesis, STRAIGHT [Kawahara et al., Speech Commun. 27, 187–207 (1999)], could be used for fine manipulation of vowel formants, using a psychophysical test of formant discrimination. Thresholds for formant discrimination of F1 and F2 for an /ɛ/ vowel, originally synthesized by the KLTSYN [Klatt, J. Acoust. Soc. Am. 67, 971–995 (1980)] and then resynthesized by STRAIGHT, were estimated. Thresholds for vowels generated by KLTSYN and by STRAIGHT were not significantly different. This result validates that STRAIGHT resynthesis can finely manipulate formant frequencies from natural speech for use in speech perception experiments.","PeriodicalId":87384,"journal":{"name":"Acoustics research letters online : ARLO","volume":"7 1","pages":"31-36"},"PeriodicalIF":0.0000,"publicationDate":"2004-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acoustics research letters online : ARLO","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/1.1635431","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31
Abstract
The present study investigated whether a new tool for nearly natural speech synthesis, STRAIGHT [Kawahara et al., Speech Commun. 27, 187–207 (1999)], could be used for fine manipulation of vowel formants, using a psychophysical test of formant discrimination. Thresholds for formant discrimination of F1 and F2 for an /ɛ/ vowel, originally synthesized by the KLTSYN [Klatt, J. Acoust. Soc. Am. 67, 971–995 (1980)] and then resynthesized by STRAIGHT, were estimated. Thresholds for vowels generated by KLTSYN and by STRAIGHT were not significantly different. This result validates that STRAIGHT resynthesis can finely manipulate formant frequencies from natural speech for use in speech perception experiments.