{"title":"基于支持向量机和k近邻的孟加拉语元音感知空间分类","authors":"Sourin Dey, Md. Ashraful Alam","doi":"10.1109/ICCITECHN.2018.8631948","DOIUrl":null,"url":null,"abstract":"In the emerging field of speech processing and Automatic Speech Recognition (ASR), vowel perceptual space classification has a vital role for speech intelligibility. In this paper, formant based vowel perceptual space classification is implemented for Bangla vowels. A dataset of vowel signals for 50 speakers has been prepared. The first and second formants of vowels have been extracted from segmented recorded data of different speakers. These two formants have been employed to classify the Bangla vowels perceptual space. Two algorithms namely Support Vector Machine (SVM) and K-Nearest Neighbors (KNN) are used to classify the vowels perceptual space using formants. SVM linear kernel has turned up to be efficient with 84.3% classification accuracy and SVM radial basis function (rbf) kernel has shown to be 100% accurate. KNN has exhibited maximum of 95% classification accuracy.","PeriodicalId":355984,"journal":{"name":"2018 21st International Conference of Computer and Information Technology (ICCIT)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Formant Based Bangla Vowel Perceptual Space Classification Using Support Vector Machine and K-Nearest Neighbor Method\",\"authors\":\"Sourin Dey, Md. Ashraful Alam\",\"doi\":\"10.1109/ICCITECHN.2018.8631948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the emerging field of speech processing and Automatic Speech Recognition (ASR), vowel perceptual space classification has a vital role for speech intelligibility. In this paper, formant based vowel perceptual space classification is implemented for Bangla vowels. A dataset of vowel signals for 50 speakers has been prepared. The first and second formants of vowels have been extracted from segmented recorded data of different speakers. These two formants have been employed to classify the Bangla vowels perceptual space. Two algorithms namely Support Vector Machine (SVM) and K-Nearest Neighbors (KNN) are used to classify the vowels perceptual space using formants. SVM linear kernel has turned up to be efficient with 84.3% classification accuracy and SVM radial basis function (rbf) kernel has shown to be 100% accurate. KNN has exhibited maximum of 95% classification accuracy.\",\"PeriodicalId\":355984,\"journal\":{\"name\":\"2018 21st International Conference of Computer and Information Technology (ICCIT)\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 21st International Conference of Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCITECHN.2018.8631948\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 21st International Conference of Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHN.2018.8631948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Formant Based Bangla Vowel Perceptual Space Classification Using Support Vector Machine and K-Nearest Neighbor Method
In the emerging field of speech processing and Automatic Speech Recognition (ASR), vowel perceptual space classification has a vital role for speech intelligibility. In this paper, formant based vowel perceptual space classification is implemented for Bangla vowels. A dataset of vowel signals for 50 speakers has been prepared. The first and second formants of vowels have been extracted from segmented recorded data of different speakers. These two formants have been employed to classify the Bangla vowels perceptual space. Two algorithms namely Support Vector Machine (SVM) and K-Nearest Neighbors (KNN) are used to classify the vowels perceptual space using formants. SVM linear kernel has turned up to be efficient with 84.3% classification accuracy and SVM radial basis function (rbf) kernel has shown to be 100% accurate. KNN has exhibited maximum of 95% classification accuracy.