自组织映射与联想记忆模型混合分类器的说话人识别

M. Inal, Y.S. Fatihoglu
{"title":"自组织映射与联想记忆模型混合分类器的说话人识别","authors":"M. Inal, Y.S. Fatihoglu","doi":"10.1109/NEUREL.2002.1057970","DOIUrl":null,"url":null,"abstract":"In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.","PeriodicalId":347066,"journal":{"name":"6th Seminar on Neural Network Applications in Electrical Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Self organizing map and associative memory model hybrid classifier for speaker recognition\",\"authors\":\"M. Inal, Y.S. Fatihoglu\",\"doi\":\"10.1109/NEUREL.2002.1057970\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.\",\"PeriodicalId\":347066,\"journal\":{\"name\":\"6th Seminar on Neural Network Applications in Electrical Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"6th Seminar on Neural Network Applications in Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NEUREL.2002.1057970\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"6th Seminar on Neural Network Applications in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2002.1057970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

摘要

本研究将自组织映射(SOM)和联想记忆模型(AMM)人工神经网络(ANN)作为混合分类器,进行了多个说话人识别实验。这些包括文本依赖的闭集说话人识别和土耳其语说话人集的说话人验证,以及TIMIT数据库子集的文本独立闭集说话人识别。土耳其语组由10个具有其姓名和姓氏的发言者组成。每句话重复8次,其中5次用于训练和。留在测试阶段。TIMIT数据库的子集包括来自新英格兰地区的38名发言者。每位演讲者的10种不同的话语被同等地选择用于训练和测试环节。采用Mel频率倒谱系数(MFCC)方法对训练向量和测试向量进行特征提取。当将该研究与相同数据库的不同研究进行比较时,该研究的结果与其他研究一样好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Self organizing map and associative memory model hybrid classifier for speaker recognition
In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信