{"title":"澳洲英语元音口音的影响及其与发音道参数的关系","authors":"D. Dersch, Chris Cléirigh, Julie Vonwiller","doi":"10.21437/ICSLP.1998-208","DOIUrl":null,"url":null,"abstract":"In this paper we analyse and compare a low dimensional linguistic representation of vowels with high dimensional prototypical vowel templates derived from a native Australian English speaker. We further perform the same analysis on Lebanese and Vietnamese accented English to investigate how di(cid:11)erences due to accents impact on such a representation. In a low dimensional linguistic representation a vowel is characterised by articulatory tract parameters. To simplify the problem, the study is restricted to vowels that, notionally at least, involve a steady state articulation i.e. a stable target con(cid:12)guration of tongue, lips and jaw between preceding and following articulatory transitions. Vowels are represented by the horizontal and vertical position of the part of the tongue involved in the key articulation of a particular vowel, e.g., high or low and front or back. To this is added lip posture, spread or rounded. Prototypical vowel templates are derived as follows. The sound pressure signal is parametrized by 12 mel-frequency cepstrum coe(cid:14)cients. At the centre of each phonetically labelled segment, 180 dimensional phone templates are extracted. For the group of short (/I/, /E/, /A/, /O/, /V/, /U/, /@/) and long vowels (/i:/, /e:/, /a:/, /o:/, /u:/, /@:/) we obtain vowel clusters by averaging over all templates of each vowel class and accent. The speech materiaThe speech material is taken from the Australian National Database Of Spoken Language (AN-DOSL). For a comparison of high dimensional vowel clusters derived from speech samples with low dimensional prototypical vowels in the articulatory tract representation we perform a reduction in dimension by a multidimensional scaling transformation in a two dimensional space. Here, a linear transformation maps a high dimensional space on a lower dimensional sub space by optimising the relative distances between data vectors. As an important result we (cid:12)nd. i) /@/ and /@:/ are surrounded by the remaining vowels; ii) the overall structure and the relative distances between the prototypical vowels are very similar. Varia-tions in the structure can be explained by the in(cid:13)uence of native Australian English, Lebanese Arabic and South Vietnamese accents.","PeriodicalId":117113,"journal":{"name":"5th International Conference on Spoken Language Processing (ICSLP 1998)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The influence of accents in australian English vowels and their relation to articulatory tract parameters\",\"authors\":\"D. Dersch, Chris Cléirigh, Julie Vonwiller\",\"doi\":\"10.21437/ICSLP.1998-208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we analyse and compare a low dimensional linguistic representation of vowels with high dimensional prototypical vowel templates derived from a native Australian English speaker. We further perform the same analysis on Lebanese and Vietnamese accented English to investigate how di(cid:11)erences due to accents impact on such a representation. In a low dimensional linguistic representation a vowel is characterised by articulatory tract parameters. To simplify the problem, the study is restricted to vowels that, notionally at least, involve a steady state articulation i.e. a stable target con(cid:12)guration of tongue, lips and jaw between preceding and following articulatory transitions. Vowels are represented by the horizontal and vertical position of the part of the tongue involved in the key articulation of a particular vowel, e.g., high or low and front or back. To this is added lip posture, spread or rounded. Prototypical vowel templates are derived as follows. The sound pressure signal is parametrized by 12 mel-frequency cepstrum coe(cid:14)cients. At the centre of each phonetically labelled segment, 180 dimensional phone templates are extracted. For the group of short (/I/, /E/, /A/, /O/, /V/, /U/, /@/) and long vowels (/i:/, /e:/, /a:/, /o:/, /u:/, /@:/) we obtain vowel clusters by averaging over all templates of each vowel class and accent. The speech materiaThe speech material is taken from the Australian National Database Of Spoken Language (AN-DOSL). For a comparison of high dimensional vowel clusters derived from speech samples with low dimensional prototypical vowels in the articulatory tract representation we perform a reduction in dimension by a multidimensional scaling transformation in a two dimensional space. Here, a linear transformation maps a high dimensional space on a lower dimensional sub space by optimising the relative distances between data vectors. As an important result we (cid:12)nd. i) /@/ and /@:/ are surrounded by the remaining vowels; ii) the overall structure and the relative distances between the prototypical vowels are very similar. Varia-tions in the structure can be explained by the in(cid:13)uence of native Australian English, Lebanese Arabic and South Vietnamese accents.\",\"PeriodicalId\":117113,\"journal\":{\"name\":\"5th International Conference on Spoken Language Processing (ICSLP 1998)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-11-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"5th International Conference on Spoken Language Processing (ICSLP 1998)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/ICSLP.1998-208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"5th International Conference on Spoken Language Processing (ICSLP 1998)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/ICSLP.1998-208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The influence of accents in australian English vowels and their relation to articulatory tract parameters
In this paper we analyse and compare a low dimensional linguistic representation of vowels with high dimensional prototypical vowel templates derived from a native Australian English speaker. We further perform the same analysis on Lebanese and Vietnamese accented English to investigate how di(cid:11)erences due to accents impact on such a representation. In a low dimensional linguistic representation a vowel is characterised by articulatory tract parameters. To simplify the problem, the study is restricted to vowels that, notionally at least, involve a steady state articulation i.e. a stable target con(cid:12)guration of tongue, lips and jaw between preceding and following articulatory transitions. Vowels are represented by the horizontal and vertical position of the part of the tongue involved in the key articulation of a particular vowel, e.g., high or low and front or back. To this is added lip posture, spread or rounded. Prototypical vowel templates are derived as follows. The sound pressure signal is parametrized by 12 mel-frequency cepstrum coe(cid:14)cients. At the centre of each phonetically labelled segment, 180 dimensional phone templates are extracted. For the group of short (/I/, /E/, /A/, /O/, /V/, /U/, /@/) and long vowels (/i:/, /e:/, /a:/, /o:/, /u:/, /@:/) we obtain vowel clusters by averaging over all templates of each vowel class and accent. The speech materiaThe speech material is taken from the Australian National Database Of Spoken Language (AN-DOSL). For a comparison of high dimensional vowel clusters derived from speech samples with low dimensional prototypical vowels in the articulatory tract representation we perform a reduction in dimension by a multidimensional scaling transformation in a two dimensional space. Here, a linear transformation maps a high dimensional space on a lower dimensional sub space by optimising the relative distances between data vectors. As an important result we (cid:12)nd. i) /@/ and /@:/ are surrounded by the remaining vowels; ii) the overall structure and the relative distances between the prototypical vowels are very similar. Varia-tions in the structure can be explained by the in(cid:13)uence of native Australian English, Lebanese Arabic and South Vietnamese accents.