{"title":"An Adaptive Q Cochlear Filter in Phoneme Recognition","authors":"T. Hirahara","doi":"10.1109/ASPAA.1991.634091","DOIUrl":null,"url":null,"abstract":"Introduction It has been expected that speech recognition performance can be improved by replacing a traditional front-end with a model of the auditory periphery. The underlying assumption is that if a model could be designed properly, it should generate a more efficient representation compared to traditional physical spectrum representations. From this viewpoint, several works have been reported [11-[41. However, these studies do not always show an auditory front-end to be superior to a traditional front-end. Some auditory front-ends are superior only for noisy speech, but many show little, if any, superiority in processing clean speech. We also have been developing an auditory model characterized by an adaptive Q cochlear filter not only for the front-end of a speech recognition system but also for a general purpose spectral1 analyzer in speech research [SI. In this paper, several auditory front-ends based on the adaptive Q cochlear filter and its relatives are tested in speaker dependent phoneme recognition using different stochastic pattern classifier!;, a shift invariant multi template matching system using LVQ2-trained codebook, and a VQ-HMM system. Further, we will discuss problems of using an auditory model as a frontend of an automatic speech recognition system. 2. Adaptive Q Cochlear Filter An adaptive Q cochlear filter (AQF) is a computational filter that functionally simulates the nonlinear filtering characteristics of the basilar membrane vibrating system. The AQF consists of three parts: (1) cascaded second-order notch filters (NOTCH), (2) second-order band pass filters (BPF) connected to each NOTCH output, and (3) adaptive Q circuits connected to each BPF output. The adaptive Q circuit consists of a second-order low-pass filter (LPF) in","PeriodicalId":146017,"journal":{"name":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASPAA.1991.634091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction It has been expected that speech recognition performance can be improved by replacing a traditional front-end with a model of the auditory periphery. The underlying assumption is that if a model could be designed properly, it should generate a more efficient representation compared to traditional physical spectrum representations. From this viewpoint, several works have been reported [11-[41. However, these studies do not always show an auditory front-end to be superior to a traditional front-end. Some auditory front-ends are superior only for noisy speech, but many show little, if any, superiority in processing clean speech. We also have been developing an auditory model characterized by an adaptive Q cochlear filter not only for the front-end of a speech recognition system but also for a general purpose spectral1 analyzer in speech research [SI. In this paper, several auditory front-ends based on the adaptive Q cochlear filter and its relatives are tested in speaker dependent phoneme recognition using different stochastic pattern classifier!;, a shift invariant multi template matching system using LVQ2-trained codebook, and a VQ-HMM system. Further, we will discuss problems of using an auditory model as a frontend of an automatic speech recognition system. 2. Adaptive Q Cochlear Filter An adaptive Q cochlear filter (AQF) is a computational filter that functionally simulates the nonlinear filtering characteristics of the basilar membrane vibrating system. The AQF consists of three parts: (1) cascaded second-order notch filters (NOTCH), (2) second-order band pass filters (BPF) connected to each NOTCH output, and (3) adaptive Q circuits connected to each BPF output. The adaptive Q circuit consists of a second-order low-pass filter (LPF) in