{"title":"Persian phoneme recognition using long short-term memory neural network","authors":"M. Daneshvar, H. Veisi","doi":"10.1109/IKT.2016.7777777","DOIUrl":null,"url":null,"abstract":"Recently Recurrent Neural Networks (RNNs) have shown impressive performance in sequence classification tasks. In this paper we apply Long Short-Term Memory (LSTM) network on Persian phoneme recognition. For years Hidden Markov Model (HMM) was the dominant technique in speech recognition system but after introducing LSTM, RNNs outperformed HHM-based methods. We apply LSTM and deep LSTM on FARSDAT speech database and find that both LSTM and deep LSTM outperforms HMM in Persian phoneme recognition. Our evaluation show that deep LSTM achieves 17.55% error in FARSDAT phoneme recognition on test set which to our knowledge is the best recorded result.","PeriodicalId":205496,"journal":{"name":"2016 Eighth International Conference on Information and Knowledge Technology (IKT)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eighth International Conference on Information and Knowledge Technology (IKT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IKT.2016.7777777","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recently Recurrent Neural Networks (RNNs) have shown impressive performance in sequence classification tasks. In this paper we apply Long Short-Term Memory (LSTM) network on Persian phoneme recognition. For years Hidden Markov Model (HMM) was the dominant technique in speech recognition system but after introducing LSTM, RNNs outperformed HHM-based methods. We apply LSTM and deep LSTM on FARSDAT speech database and find that both LSTM and deep LSTM outperforms HMM in Persian phoneme recognition. Our evaluation show that deep LSTM achieves 17.55% error in FARSDAT phoneme recognition on test set which to our knowledge is the best recorded result.