{"title":"语音过程和语音混淆在儿童ASR电话错误中的作用","authors":"Evangelia Fringi, J. Lehman, M. Russell","doi":"10.21437/WOCCI.2016-2","DOIUrl":null,"url":null,"abstract":"This paper examines the extent to which computer speech recognition errors for children’s speech can be attributed to common phonological effects associated with language acquisition. Recognition results are presented for three corpora of children’s speech, two comprising recordings of American English spoken by five-to nine-year-olds and one comprising recordings of British English speech from children aged five and six. The results are compared with adult reference confusion matrices based on TIMIT for the first two experiments and with confusion matrices for British adults and children with good speech for the third. They appear to be influenced by three factors: (i) confusions that are predictable from phonological factors associated with language acquisition also arise from acoustic confusability (e.g. /k/ → /t/ ) , (ii) the frequency of the phonological errors is expected to decrease with increasing age, and (iii) an accurate recogniser is more likely to detect a phonological error when it occurs than a less accurate one. Overall the percentage of errors attributable to phonological processes remains approximately constant in each experiment. However, the proportion of these errors that differ significantly from reference patterns increases with recognition accuracy and is greater for children who are judged to have poor speech.","PeriodicalId":91973,"journal":{"name":"The ... Workshop on Child, Computer and Interaction","volume":"63 1","pages":"10-15"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"The role of phonological processes and acoustic confusability in phone errors in children's ASR\",\"authors\":\"Evangelia Fringi, J. Lehman, M. Russell\",\"doi\":\"10.21437/WOCCI.2016-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper examines the extent to which computer speech recognition errors for children’s speech can be attributed to common phonological effects associated with language acquisition. Recognition results are presented for three corpora of children’s speech, two comprising recordings of American English spoken by five-to nine-year-olds and one comprising recordings of British English speech from children aged five and six. The results are compared with adult reference confusion matrices based on TIMIT for the first two experiments and with confusion matrices for British adults and children with good speech for the third. They appear to be influenced by three factors: (i) confusions that are predictable from phonological factors associated with language acquisition also arise from acoustic confusability (e.g. /k/ → /t/ ) , (ii) the frequency of the phonological errors is expected to decrease with increasing age, and (iii) an accurate recogniser is more likely to detect a phonological error when it occurs than a less accurate one. Overall the percentage of errors attributable to phonological processes remains approximately constant in each experiment. However, the proportion of these errors that differ significantly from reference patterns increases with recognition accuracy and is greater for children who are judged to have poor speech.\",\"PeriodicalId\":91973,\"journal\":{\"name\":\"The ... Workshop on Child, Computer and Interaction\",\"volume\":\"63 1\",\"pages\":\"10-15\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The ... Workshop on Child, Computer and Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/WOCCI.2016-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The ... Workshop on Child, Computer and Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/WOCCI.2016-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The role of phonological processes and acoustic confusability in phone errors in children's ASR
This paper examines the extent to which computer speech recognition errors for children’s speech can be attributed to common phonological effects associated with language acquisition. Recognition results are presented for three corpora of children’s speech, two comprising recordings of American English spoken by five-to nine-year-olds and one comprising recordings of British English speech from children aged five and six. The results are compared with adult reference confusion matrices based on TIMIT for the first two experiments and with confusion matrices for British adults and children with good speech for the third. They appear to be influenced by three factors: (i) confusions that are predictable from phonological factors associated with language acquisition also arise from acoustic confusability (e.g. /k/ → /t/ ) , (ii) the frequency of the phonological errors is expected to decrease with increasing age, and (iii) an accurate recogniser is more likely to detect a phonological error when it occurs than a less accurate one. Overall the percentage of errors attributable to phonological processes remains approximately constant in each experiment. However, the proportion of these errors that differ significantly from reference patterns increases with recognition accuracy and is greater for children who are judged to have poor speech.