Radha Krishna Guntur, Kr Ramakrishnan, V. K. Mittal
{"title":"外语口音自动分类","authors":"Radha Krishna Guntur, Kr Ramakrishnan, V. K. Mittal","doi":"10.1109/GCAT52182.2021.9587650","DOIUrl":null,"url":null,"abstract":"Automatic accent classification using a database developed with both L1 and L2 language data has been proposed. Speech samples were collected from native Indian speakers speaking in their mother tongue namely Kannada, Tamil, or Telugu, and from non-native English speakers with one of the above as the first language. The vocal tract characteristics were used in the present study. The MFCC features extracted from both native speech and non-native speech were extensively analyzed. Performance validation in Regional Nativity Identification has been investigated using both native South Indian speech, and non-native English speech by the compatriots of the linguistic groups. Detecting regional identity using MFCC features with GMMUBM / i-vector modeling has been proposed. The challenges of second language speech recognition have been addressed by leveraging native, and non-native speech, which produced an SVM classification score of 86.1%, and the Area Under Curve (AUC) is found to be well above 90% for all three languages.","PeriodicalId":436231,"journal":{"name":"2021 2nd Global Conference for Advancement in Technology (GCAT)","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic Classification of Foreign Language Accent\",\"authors\":\"Radha Krishna Guntur, Kr Ramakrishnan, V. K. Mittal\",\"doi\":\"10.1109/GCAT52182.2021.9587650\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic accent classification using a database developed with both L1 and L2 language data has been proposed. Speech samples were collected from native Indian speakers speaking in their mother tongue namely Kannada, Tamil, or Telugu, and from non-native English speakers with one of the above as the first language. The vocal tract characteristics were used in the present study. The MFCC features extracted from both native speech and non-native speech were extensively analyzed. Performance validation in Regional Nativity Identification has been investigated using both native South Indian speech, and non-native English speech by the compatriots of the linguistic groups. Detecting regional identity using MFCC features with GMMUBM / i-vector modeling has been proposed. The challenges of second language speech recognition have been addressed by leveraging native, and non-native speech, which produced an SVM classification score of 86.1%, and the Area Under Curve (AUC) is found to be well above 90% for all three languages.\",\"PeriodicalId\":436231,\"journal\":{\"name\":\"2021 2nd Global Conference for Advancement in Technology (GCAT)\",\"volume\":\"2013 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd Global Conference for Advancement in Technology (GCAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GCAT52182.2021.9587650\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd Global Conference for Advancement in Technology (GCAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCAT52182.2021.9587650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Classification of Foreign Language Accent
Automatic accent classification using a database developed with both L1 and L2 language data has been proposed. Speech samples were collected from native Indian speakers speaking in their mother tongue namely Kannada, Tamil, or Telugu, and from non-native English speakers with one of the above as the first language. The vocal tract characteristics were used in the present study. The MFCC features extracted from both native speech and non-native speech were extensively analyzed. Performance validation in Regional Nativity Identification has been investigated using both native South Indian speech, and non-native English speech by the compatriots of the linguistic groups. Detecting regional identity using MFCC features with GMMUBM / i-vector modeling has been proposed. The challenges of second language speech recognition have been addressed by leveraging native, and non-native speech, which produced an SVM classification score of 86.1%, and the Area Under Curve (AUC) is found to be well above 90% for all three languages.