帮助视障儿童学习盲文的自动语音识别系统

Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa
{"title":"帮助视障儿童学习盲文的自动语音识别系统","authors":"Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa","doi":"10.1109/STSIVA.2016.7743335","DOIUrl":null,"url":null,"abstract":"We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.","PeriodicalId":373420,"journal":{"name":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"An automatic speech recognition system for helping visually impaired children to learn Braille\",\"authors\":\"Melissa Ramlrez, M. Sotaquirá, Alberto De La Cruz, Esther Maria, G. Avellaneda, Ana Ochoa\",\"doi\":\"10.1109/STSIVA.2016.7743335\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.\",\"PeriodicalId\":373420,\"journal\":{\"name\":\"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/STSIVA.2016.7743335\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STSIVA.2016.7743335","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

摘要

我们提出了一种自动语音识别(ASR)系统,该系统与触觉界面一起,旨在帮助学龄前儿童学习盲文。ASR算法从语音信号中提取一组Mel-Frequency Cepstral系数(MFCC),然后采用动态时间扭曲(DTW)方法,从而允许识别用户发音的元音。对9名受试者进行ASR算法测试,并以真阳性百分比衡量其敏感性。a、e、o和u元音的准确率最高(88.8%),而i元音的灵敏度最低(77.7%)。用户与触觉系统交互的验证目前正在进行中,需要进行额外的测试,以确定该系统在哥伦比亚学前教育背景下提供的潜在好处。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An automatic speech recognition system for helping visually impaired children to learn Braille
We present an automatic speech recognition (ASR) system which, along with a haptic interface, is aimed at helping preschool children to learn Braille. The ASR algorithm extracts a set of Mel-Frequency Cepstral Coefficients (MFCC) from the speech signal, followed by a Dynamic Time Warping (DTW) approach, thus allowing to recognize vowels pronounced by the user. The ASR algorithm was tested on 9 subjects and its sensitivity was measured in terms of the percentage of true positives. The highest accuracy values were obtained for the a, e, o and u vowels (with hit ratios of 88.8% in all cases), whereas the i vowel exhibited the lowest sensitivity (77.7%). Validation of user interaction with the haptic system is currently underway, and additional testing is needed to determine the potential benefits that this system offers in the context of preschool education in Colombia.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信