{"title":"Text independent language recognition system using DHMM with new features","authors":"M. Sadanandam, V. Prasad, V. Janaki, A. Nagesh","doi":"10.1109/ICOSP.2012.6491537","DOIUrl":null,"url":null,"abstract":"Spoken Language Identification is a task of recognizing the language from an unknown utterance of speech. This paper describes a text independent language recognition system using new features derived from MFCC feature of speech signal with a common code book and discrete hidden Markov models (DHMM) to achieve a very good LID recognition performance with less computation time comparing with that of a state of art phone based systems available in literature. In this work, MFCC feature vectors of speech signal are transformed into new feature vectors. This LID approach includes generation of a common codebook using new features and training of DHMM, one for each language. The experiments are carried out on the database of OGI and Indian language consists of six languages namely Telugu, Tamil, Hindi, Marathi, Malayalam and Kannada.","PeriodicalId":143331,"journal":{"name":"2012 IEEE 11th International Conference on Signal Processing","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 11th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2012.6491537","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Spoken Language Identification is a task of recognizing the language from an unknown utterance of speech. This paper describes a text independent language recognition system using new features derived from MFCC feature of speech signal with a common code book and discrete hidden Markov models (DHMM) to achieve a very good LID recognition performance with less computation time comparing with that of a state of art phone based systems available in literature. In this work, MFCC feature vectors of speech signal are transformed into new feature vectors. This LID approach includes generation of a common codebook using new features and training of DHMM, one for each language. The experiments are carried out on the database of OGI and Indian language consists of six languages namely Telugu, Tamil, Hindi, Marathi, Malayalam and Kannada.