{"title":"大规模说话人识别任务的资源约束HCRF建模","authors":"W. Hong","doi":"10.1109/GCCE.2016.7800517","DOIUrl":null,"url":null,"abstract":"This paper proposes an efficient algorithm on the training of hidden conditional random fields (HCRFs) for large-scale speaker recognition in which a speaker identification task with around 1000 speakers is investigated. HCRFs are a type of direct models in pattern recognition and thus iterative procedures are usually required to estimate the model parameters. The key method in this paper is to perform the optimization alternatively between the native direct models (HCRFs) and their approximated generative models (i.e., the equivalent HMMs of HCRFs). By this framework, a constrained optimization method is proposed which makes the training process of speaker models consumes less computation resources than the one by the conventional HCRF training scheme. The experimental results indicate that the proposed method enjoys potential benefits for development in resource-constrained speaker recognition.","PeriodicalId":416104,"journal":{"name":"2016 IEEE 5th Global Conference on Consumer Electronics","volume":"508 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A resource-constrained HCRF modeling for a large-scale speaker identification task\",\"authors\":\"W. Hong\",\"doi\":\"10.1109/GCCE.2016.7800517\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an efficient algorithm on the training of hidden conditional random fields (HCRFs) for large-scale speaker recognition in which a speaker identification task with around 1000 speakers is investigated. HCRFs are a type of direct models in pattern recognition and thus iterative procedures are usually required to estimate the model parameters. The key method in this paper is to perform the optimization alternatively between the native direct models (HCRFs) and their approximated generative models (i.e., the equivalent HMMs of HCRFs). By this framework, a constrained optimization method is proposed which makes the training process of speaker models consumes less computation resources than the one by the conventional HCRF training scheme. The experimental results indicate that the proposed method enjoys potential benefits for development in resource-constrained speaker recognition.\",\"PeriodicalId\":416104,\"journal\":{\"name\":\"2016 IEEE 5th Global Conference on Consumer Electronics\",\"volume\":\"508 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE 5th Global Conference on Consumer Electronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GCCE.2016.7800517\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 5th Global Conference on Consumer Electronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCCE.2016.7800517","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A resource-constrained HCRF modeling for a large-scale speaker identification task
This paper proposes an efficient algorithm on the training of hidden conditional random fields (HCRFs) for large-scale speaker recognition in which a speaker identification task with around 1000 speakers is investigated. HCRFs are a type of direct models in pattern recognition and thus iterative procedures are usually required to estimate the model parameters. The key method in this paper is to perform the optimization alternatively between the native direct models (HCRFs) and their approximated generative models (i.e., the equivalent HMMs of HCRFs). By this framework, a constrained optimization method is proposed which makes the training process of speaker models consumes less computation resources than the one by the conventional HCRF training scheme. The experimental results indicate that the proposed method enjoys potential benefits for development in resource-constrained speaker recognition.