{"title":"Experiments of GMM based speaker identification","authors":"P. Qi, Lu Wang","doi":"10.1109/URAI.2011.6145927","DOIUrl":null,"url":null,"abstract":"In human-robot interaction areas, the robot is often expected to recognize the identity of the speaker in some specific scenarios. It is a kind of biometric modality, and in general using statistical model is a classical and powerful method dealing with speaker identification problem. In this paper, we apply the Gaussian mixture model (GMM) on the speech feature distribution modeling and build the speaker identification system under MATLAB platform. Experiments are conducted on practical speech database and we also further give some insights into feature extraction, different length input utterances analysis and the impostor situation.","PeriodicalId":385925,"journal":{"name":"2011 8th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI)","volume":"484 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 8th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/URAI.2011.6145927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In human-robot interaction areas, the robot is often expected to recognize the identity of the speaker in some specific scenarios. It is a kind of biometric modality, and in general using statistical model is a classical and powerful method dealing with speaker identification problem. In this paper, we apply the Gaussian mixture model (GMM) on the speech feature distribution modeling and build the speaker identification system under MATLAB platform. Experiments are conducted on practical speech database and we also further give some insights into feature extraction, different length input utterances analysis and the impostor situation.