Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa
{"title":"帮助视障儿童学习盲文的自动语音识别系统","authors":"Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa","doi":"10.1109/STSIVA.2016.7743335","DOIUrl":null,"url":null,"abstract":"We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.","PeriodicalId":373420,"journal":{"name":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"An automatic speech recognition system for helping visually impaired children to learn Braille\",\"authors\":\"Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa\",\"doi\":\"10.1109/STSIVA.2016.7743335\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.\",\"PeriodicalId\":373420,\"journal\":{\"name\":\"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/STSIVA.2016.7743335\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STSIVA.2016.7743335","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An automatic speech recognition system for helping visually impaired children to learn Braille
We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.