A. Aibinu, M. Salami, Athaur Rahman Najeeb, J. F. Azeez, S. Rajin
{"title":"评价语音活动检测在孤立约鲁巴语单词识别系统中的效果","authors":"A. Aibinu, M. Salami, Athaur Rahman Najeeb, J. F. Azeez, S. Rajin","doi":"10.1109/ICOM.2011.5937134","DOIUrl":null,"url":null,"abstract":"This paper discusses and evaluates the effect of voice Activity Detection (VAD) in an isolated Yoruba word recognition system (IYWRS). The word database used in this paper are collected from 22 speakers by repeating the numbers 1 to 9 three times each. A hybrid configuration of Mel-Frequency Cepstral coefficient (MFCC) and Linear Predictive Coding (LPC) have been used to extract the features of the speech samples. Artificial Neural Network algorithms are then used to classify these features. An overall accuracy of about 60% has been achieved from the two proposed feature extraction methods.","PeriodicalId":376337,"journal":{"name":"2011 4th International Conference on Mechatronics (ICOM)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Evaluating the effect of voice activity detection in isolated Yoruba word recognition system\",\"authors\":\"A. Aibinu, M. Salami, Athaur Rahman Najeeb, J. F. Azeez, S. Rajin\",\"doi\":\"10.1109/ICOM.2011.5937134\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper discusses and evaluates the effect of voice Activity Detection (VAD) in an isolated Yoruba word recognition system (IYWRS). The word database used in this paper are collected from 22 speakers by repeating the numbers 1 to 9 three times each. A hybrid configuration of Mel-Frequency Cepstral coefficient (MFCC) and Linear Predictive Coding (LPC) have been used to extract the features of the speech samples. Artificial Neural Network algorithms are then used to classify these features. An overall accuracy of about 60% has been achieved from the two proposed feature extraction methods.\",\"PeriodicalId\":376337,\"journal\":{\"name\":\"2011 4th International Conference on Mechatronics (ICOM)\",\"volume\":\"83 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 4th International Conference on Mechatronics (ICOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOM.2011.5937134\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 4th International Conference on Mechatronics (ICOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOM.2011.5937134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluating the effect of voice activity detection in isolated Yoruba word recognition system
This paper discusses and evaluates the effect of voice Activity Detection (VAD) in an isolated Yoruba word recognition system (IYWRS). The word database used in this paper are collected from 22 speakers by repeating the numbers 1 to 9 three times each. A hybrid configuration of Mel-Frequency Cepstral coefficient (MFCC) and Linear Predictive Coding (LPC) have been used to extract the features of the speech samples. Artificial Neural Network algorithms are then used to classify these features. An overall accuracy of about 60% has been achieved from the two proposed feature extraction methods.