{"title":"母语和非母语英语语音分类:口音转换的前提","authors":"Abhinav Sharma, Muskaan Bhargava, A. Khanna","doi":"10.1109/icecct52121.2021.9616718","DOIUrl":null,"url":null,"abstract":"This research has been carried out with an intent of classifying if the English speech given as input is either a native or non-native accent with the help of Machine Learning classification algorithms on Mel Frequency Cepstral Coefficients. As the world has been evolving, so is the technology around us which has given rise to numerous voice-based tools. But the efficiency of these tools depends greatly on various unpredictable and pesky factors such as audio tempo and pronunciation which substantially differ throughout different accents and dialects. Hence, we chose to move ahead with working towards finding a method to best curb the problem of reduced accuracy in these technologies due to the said differences in accents and dialects. This has been carried out by applying five different machine learning classification models namely Gaussian Mixture Model, K-Nearest Neighbor, Logistic Regression, Support Vector Machine and Neural Network with an observed accuracy of 53%, 95%, 95%, 98% and 98% respectively.","PeriodicalId":155129,"journal":{"name":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Native and Non-Native English Speech Classification: A premise to Accent Conversion\",\"authors\":\"Abhinav Sharma, Muskaan Bhargava, A. Khanna\",\"doi\":\"10.1109/icecct52121.2021.9616718\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This research has been carried out with an intent of classifying if the English speech given as input is either a native or non-native accent with the help of Machine Learning classification algorithms on Mel Frequency Cepstral Coefficients. As the world has been evolving, so is the technology around us which has given rise to numerous voice-based tools. But the efficiency of these tools depends greatly on various unpredictable and pesky factors such as audio tempo and pronunciation which substantially differ throughout different accents and dialects. Hence, we chose to move ahead with working towards finding a method to best curb the problem of reduced accuracy in these technologies due to the said differences in accents and dialects. This has been carried out by applying five different machine learning classification models namely Gaussian Mixture Model, K-Nearest Neighbor, Logistic Regression, Support Vector Machine and Neural Network with an observed accuracy of 53%, 95%, 95%, 98% and 98% respectively.\",\"PeriodicalId\":155129,\"journal\":{\"name\":\"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icecct52121.2021.9616718\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icecct52121.2021.9616718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Native and Non-Native English Speech Classification: A premise to Accent Conversion
This research has been carried out with an intent of classifying if the English speech given as input is either a native or non-native accent with the help of Machine Learning classification algorithms on Mel Frequency Cepstral Coefficients. As the world has been evolving, so is the technology around us which has given rise to numerous voice-based tools. But the efficiency of these tools depends greatly on various unpredictable and pesky factors such as audio tempo and pronunciation which substantially differ throughout different accents and dialects. Hence, we chose to move ahead with working towards finding a method to best curb the problem of reduced accuracy in these technologies due to the said differences in accents and dialects. This has been carried out by applying five different machine learning classification models namely Gaussian Mixture Model, K-Nearest Neighbor, Logistic Regression, Support Vector Machine and Neural Network with an observed accuracy of 53%, 95%, 95%, 98% and 98% respectively.