连接模型与传统模型相结合的文本独立说话人识别系统

Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop Pub Date : 1992-08-31 DOI:10.1109/NNSP.1992.253700

Younès Bennani

{"title":"连接模型与传统模型相结合的文本独立说话人识别系统","authors":"Younès Bennani","doi":"10.1109/NNSP.1992.253700","DOIUrl":null,"url":null,"abstract":"Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<<ETX>>","PeriodicalId":438250,"journal":{"name":"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop","volume":"39 992 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Text-independent talker identification system combining connectionist and conventional models\",\"authors\":\"Younès Bennani\",\"doi\":\"10.1109/NNSP.1992.253700\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<<ETX>>\",\"PeriodicalId\":438250,\"journal\":{\"name\":\"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop\",\"volume\":\"39 992 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1992-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NNSP.1992.253700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NNSP.1992.253700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

不同的说话人识别技术具有不同的特点和能力。比较了采用神经网络、隐马尔可夫模型和多元自回归模型的三种不同系统的优点。提出了一种基于这些不同技术的独立文本说话人识别系统。该系统优于以前的型号，可以处理大量扬声器。有人认为模块化体系结构具有显著的优势，例如它们的学习速度、泛化和表示能力，以及它们满足硬件限制所施加的约束的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text-independent talker identification system combining connectionist and conventional models

Several techniques have been used for speaker identification which have different characteristics and capabilities. The respective merits of three different systems respectively employing neural networks, hidden Markov models, and multivariate autoregressive models are compared. A novel text-independent speaker identification system based on the cooperation of these different techniques is presented. This system outperforms previous models and can handle a large number of speakers. It is argued that modular architectures present significant advantages, such as their learning speed, their generalization and representation capabilities, and their ability to satisfy constraints imposed by hardware limitations.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop

自引率

0.00%

发文量