D. Vásquez, Guillermo Aradilla, R. Gruhn, W. Minker
{"title":"On speeding phoneme recognition in a hierarchical MLP structure","authors":"D. Vásquez, Guillermo Aradilla, R. Gruhn, W. Minker","doi":"10.1109/ASRU.2009.5373278","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a technique for speeding phoneme recognition in a hierarchical structure involving multilayered perceptrons (MLPs). The hierarchical structure consists of two MLP-based layers, where the output of the first layer is used as input for the second layer. In this paper, we efficiently speed up the system by removing the redundant information contained at the output of the first layer. Several techniques are investigated for removing this redundant information based on temporal and phonetic criteria. The best approach reduces the computational time by 57% while keeping a system accuracy comparable to the standard hierarchical approach. This scheme favors the implementation of such hierarchical structures in real-time applications.","PeriodicalId":292194,"journal":{"name":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Workshop on Automatic Speech Recognition & Understanding","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASRU.2009.5373278","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we propose a technique for speeding phoneme recognition in a hierarchical structure involving multilayered perceptrons (MLPs). The hierarchical structure consists of two MLP-based layers, where the output of the first layer is used as input for the second layer. In this paper, we efficiently speed up the system by removing the redundant information contained at the output of the first layer. Several techniques are investigated for removing this redundant information based on temporal and phonetic criteria. The best approach reduces the computational time by 57% while keeping a system accuracy comparable to the standard hierarchical approach. This scheme favors the implementation of such hierarchical structures in real-time applications.