A biophysical model of the human cochlea for speech stimulus using STFT

L. Golipour, S. Gazor
{"title":"A biophysical model of the human cochlea for speech stimulus using STFT","authors":"L. Golipour, S. Gazor","doi":"10.1109/ISSPIT.2005.1577068","DOIUrl":null,"url":null,"abstract":"This paper presents a new approach to an auditory model which matches closely the response patterns in physiological data for single tone inputs. Including the biophysical complexity of the wave motion in the cochlea and considering all terms of the motion equation, the new model can evaluate the auditory spectrum using the Basilar membrane (BM) displacement and the inner hair cells' (IHCs) firing rate for all input signals, including speech. We employ the partial differential motion equations of the BM and its parameters measured for the human auditory system, and design an algorithm which uses short time Fourier transform (STFT) to compute the output for speech stimulus. The idea is to isolate the input signal in the vicinity of a time-window and try to follow the changes in its frequencies and their influences on the signal perceived by the auditory system. The new model includes the nonlinearity action of outer hair cells (OHCs) and provides a new auditory spectrum for speech inputs in the real time domain which reflects a proper view of propagating signal in the cochlea. Despite most of the previous models this model can track the effects of high formant frequencies in the human cochlea as well. This model is a new signal processing tool for studying the response of the auditory system to transient signals which is highly demanded in various speech enhancement and audio coding algorithms","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"141 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPIT.2005.1577068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

This paper presents a new approach to an auditory model which matches closely the response patterns in physiological data for single tone inputs. Including the biophysical complexity of the wave motion in the cochlea and considering all terms of the motion equation, the new model can evaluate the auditory spectrum using the Basilar membrane (BM) displacement and the inner hair cells' (IHCs) firing rate for all input signals, including speech. We employ the partial differential motion equations of the BM and its parameters measured for the human auditory system, and design an algorithm which uses short time Fourier transform (STFT) to compute the output for speech stimulus. The idea is to isolate the input signal in the vicinity of a time-window and try to follow the changes in its frequencies and their influences on the signal perceived by the auditory system. The new model includes the nonlinearity action of outer hair cells (OHCs) and provides a new auditory spectrum for speech inputs in the real time domain which reflects a proper view of propagating signal in the cochlea. Despite most of the previous models this model can track the effects of high formant frequencies in the human cochlea as well. This model is a new signal processing tool for studying the response of the auditory system to transient signals which is highly demanded in various speech enhancement and audio coding algorithms
基于STFT的人耳蜗语音刺激生物物理模型
本文提出了一种新的听觉模型,该模型与单音输入的生理数据的反应模式密切匹配。考虑到耳蜗波运动的生物物理复杂性,并考虑运动方程的所有项,该模型可以利用基底膜(BM)位移和内毛细胞(IHCs)放电率对包括语音在内的所有输入信号进行听觉频谱评估。利用人类听觉系统中脑机的偏微分运动方程及其测量参数,设计了一种利用短时傅立叶变换(STFT)计算语音刺激输出的算法。这个想法是在时间窗口附近隔离输入信号,并尝试跟踪其频率的变化及其对听觉系统感知的信号的影响。该模型包含了外毛细胞的非线性作用,为语音输入提供了一种新的实时听觉频谱,反映了信号在耳蜗内传播的正确观点。尽管大多数以前的模型,这个模型可以跟踪高峰频率在人类耳蜗的影响。该模型是一种新的信号处理工具,用于研究听觉系统对瞬态信号的响应,这在各种语音增强和音频编码算法中都有很高的要求
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信