Laishram Rahul, Salam Nandakishor, L. J. Singh, S. K. Dutta
{"title":"Design of Manipuri Keywords Spotting System using HMM","authors":"Laishram Rahul, Salam Nandakishor, L. J. Singh, S. K. Dutta","doi":"10.1109/NCVPRIPG.2013.6776249","DOIUrl":null,"url":null,"abstract":"This paper aims to discuss the implementation of phoneme based Manipuri Keyword Spotting System (MKWSS). Manipuri is a scheduled Indian language of Tibeto-Burman origin. Around 5 hours of read speech are collected from 4 male and 6 female speakers for development of database of MKWSS. The symbols of International Phonetic Alphabet (IPA)(revised in 2005) are used during the transcription of the data. A five state left to right Hidden Markov Model (HMM) with 32 mixture continuous density diagonal covariance Gaussian Mixture Model (GMM) per state is used to build a model for each phonetic unit. We have used HMM tool kit (HTK), version 3.4 for modeling the system. The system can recognize 29 phonemes and a non-speech event (silence) and will detect the present keywords formed by these phonemes. Continuous Speech data have been collected from 5 males and 8 females for analysing the performance of the system. The performance of the system depends on the ability of detection of the keywords. An overall performance of 65.24% is obtained from the phoneme based MKWSS.","PeriodicalId":436402,"journal":{"name":"2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCVPRIPG.2013.6776249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
This paper aims to discuss the implementation of phoneme based Manipuri Keyword Spotting System (MKWSS). Manipuri is a scheduled Indian language of Tibeto-Burman origin. Around 5 hours of read speech are collected from 4 male and 6 female speakers for development of database of MKWSS. The symbols of International Phonetic Alphabet (IPA)(revised in 2005) are used during the transcription of the data. A five state left to right Hidden Markov Model (HMM) with 32 mixture continuous density diagonal covariance Gaussian Mixture Model (GMM) per state is used to build a model for each phonetic unit. We have used HMM tool kit (HTK), version 3.4 for modeling the system. The system can recognize 29 phonemes and a non-speech event (silence) and will detect the present keywords formed by these phonemes. Continuous Speech data have been collected from 5 males and 8 females for analysing the performance of the system. The performance of the system depends on the ability of detection of the keywords. An overall performance of 65.24% is obtained from the phoneme based MKWSS.