LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant

G. Rachel, S. J. J. Singh, P. Vijayalakshmi
{"title":"LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant","authors":"G. Rachel, S. J. J. Singh, P. Vijayalakshmi","doi":"10.1109/ICRTIT.2013.6844195","DOIUrl":null,"url":null,"abstract":"A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.","PeriodicalId":113531,"journal":{"name":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","volume":"39 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTIT.2013.6844195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.
LabVIEW和数字信号处理器实现了一种基于通道声码器的人工耳蜗模型
人工耳蜗是一种假体装置,用于模仿由内耳受损引起的重度和双侧听力损失的人的耳蜗功能。目前的工作围绕着基于LabVIEW和TMS320C6713 DSK的人工耳蜗实时通道声码器模型的设计展开。首先,基于之前的MATLAB分析,设计了一个统一频带的16通道声码器,用于分析和合成英语元音,其中滤波器带宽为400 Hz,截止频率高达6200Hz。为了将分析扩展到单词和句子,提取短时间特征,即短时间能量、短时间过零率和短时间自相关函数的主瓣宽度,并使用高斯混合建模(GMM)将语音片段分类为浊音或非浊音。为了使合成语音听起来自然,通道声码器中的合成使用声门脉冲序列而不是脉冲序列来完成。合成语音的可理解性用平均意见评分(Mean Opinion Score, MOS)来衡量。对于通道声码器,其中合成部分使用声门脉冲序列,当使用脉冲序列时,获得3.6的MOS,而不是3.5。然后在TMS320C6713 DSK中实现人工耳蜗的实验室模型,即16通道声码器的分析部分。通过编程DSK的拨码开关获得单个通道输出,DSK_AUDIO16_BASE,一个16通道音频子卡,与DSK接口,同时获得多个通道的输出。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信