{"title":"自组织映射与联想记忆模型混合分类器的说话人识别","authors":"M. Inal, Y.S. Fatihoglu","doi":"10.1109/NEUREL.2002.1057970","DOIUrl":null,"url":null,"abstract":"In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.","PeriodicalId":347066,"journal":{"name":"6th Seminar on Neural Network Applications in Electrical Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2002-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Self organizing map and associative memory model hybrid classifier for speaker recognition\",\"authors\":\"M. Inal, Y.S. Fatihoglu\",\"doi\":\"10.1109/NEUREL.2002.1057970\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.\",\"PeriodicalId\":347066,\"journal\":{\"name\":\"6th Seminar on Neural Network Applications in Electrical Engineering\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"6th Seminar on Neural Network Applications in Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NEUREL.2002.1057970\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"6th Seminar on Neural Network Applications in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2002.1057970","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Self organizing map and associative memory model hybrid classifier for speaker recognition
In this study, self organizing map (SOM) and associative memory model (AMM) artificial neural networks (ANN) are used as hybrid classifier for several speaker recognition experiments. These include text dependent closed-set speaker identification and speaker verification of Turkish speaker set and text independent closed-set speaker identification of a subset of the TIMIT database. Turkish speaker set constitutes 10 speakers with their name and surname. Each utterance is repeated 8 times, 5 of them are used in training and. remaining in the test stages. The subset of the TIMIT database consists 38 speakers from New England region. Each speaker's 10 different utterances are equally selected for using in training and test session. Mel frequency cepstral coefficients (MFCC) method is used for feature extraction of the training and test vectors. When the study is compared with different studies for the same databases, this study gives good results as much as the others.