Clayder Gonzalez-Cadenillas, Nils Murrugarra-Llerena
{"title":"基于低成本单片机的孤立词识别","authors":"Clayder Gonzalez-Cadenillas, Nils Murrugarra-Llerena","doi":"10.1109/SBESC.2013.28","DOIUrl":null,"url":null,"abstract":"Currently, the field of automatic speech recognition is being widely used in commercial electronic devices such as TVs, phones, game consoles and computers. Thus, this article presents an evaluation of different isolated words recognition techniques on an embedded system using in the microcontroller dsPIC30F4013. So, the feature extraction phase is based on an adaptation of the Mel-frequency Cepstral Coefficients (MFCC) and the automatic recognition phase is based on the following techniques: Dynamic Time Warping (DTW), Artificial Neural Networks (ANN) and Principal Component Analysis (PCA). Related to the experiments setup, voice commands were evaluated in 3 different scenarios and the best accuracy rate was reached by a combination of PCA and ANN. It is also important to note that this implementation was carried out with the capacity constraints of the mentioned circuit.","PeriodicalId":359419,"journal":{"name":"2013 III Brazilian Symposium on Computing Systems Engineering","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Isolated Words Recognition Using a Low Cost Microcontroller\",\"authors\":\"Clayder Gonzalez-Cadenillas, Nils Murrugarra-Llerena\",\"doi\":\"10.1109/SBESC.2013.28\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Currently, the field of automatic speech recognition is being widely used in commercial electronic devices such as TVs, phones, game consoles and computers. Thus, this article presents an evaluation of different isolated words recognition techniques on an embedded system using in the microcontroller dsPIC30F4013. So, the feature extraction phase is based on an adaptation of the Mel-frequency Cepstral Coefficients (MFCC) and the automatic recognition phase is based on the following techniques: Dynamic Time Warping (DTW), Artificial Neural Networks (ANN) and Principal Component Analysis (PCA). Related to the experiments setup, voice commands were evaluated in 3 different scenarios and the best accuracy rate was reached by a combination of PCA and ANN. It is also important to note that this implementation was carried out with the capacity constraints of the mentioned circuit.\",\"PeriodicalId\":359419,\"journal\":{\"name\":\"2013 III Brazilian Symposium on Computing Systems Engineering\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 III Brazilian Symposium on Computing Systems Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SBESC.2013.28\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 III Brazilian Symposium on Computing Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBESC.2013.28","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Isolated Words Recognition Using a Low Cost Microcontroller
Currently, the field of automatic speech recognition is being widely used in commercial electronic devices such as TVs, phones, game consoles and computers. Thus, this article presents an evaluation of different isolated words recognition techniques on an embedded system using in the microcontroller dsPIC30F4013. So, the feature extraction phase is based on an adaptation of the Mel-frequency Cepstral Coefficients (MFCC) and the automatic recognition phase is based on the following techniques: Dynamic Time Warping (DTW), Artificial Neural Networks (ANN) and Principal Component Analysis (PCA). Related to the experiments setup, voice commands were evaluated in 3 different scenarios and the best accuracy rate was reached by a combination of PCA and ANN. It is also important to note that this implementation was carried out with the capacity constraints of the mentioned circuit.