{"title":"连接模型与传统模型相结合的文本独立说话人识别系统","authors":"Younès Bennani","doi":"10.1109/NNSP.1992.253700","DOIUrl":null,"url":null,"abstract":"Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<<ETX>>","PeriodicalId":438250,"journal":{"name":"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop","volume":"39 992 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Text-independent talker identification system combining connectionist and conventional models\",\"authors\":\"Younès Bennani\",\"doi\":\"10.1109/NNSP.1992.253700\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<<ETX>>\",\"PeriodicalId\":438250,\"journal\":{\"name\":\"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop\",\"volume\":\"39 992 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1992-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NNSP.1992.253700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NNSP.1992.253700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text-independent talker identification system combining connectionist and conventional models
Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<>