{"title":"自动说话人验证的三种判别模型的比较","authors":"S. Slomka, P. Castellano, S. Sridharan","doi":"10.1109/ISSPA.1996.615747","DOIUrl":null,"url":null,"abstract":"Automatic Speaker Rewgnition (ASR) is composed of Automatic Speaker Identification (ASI) and Verification (ASV). In either case, it is a 4 step process consisting of speech data collection, preprocessing of the speech signal (enrolment), pattern matching and result adjudication. In the second ASR step, parametrised speech, representative of each speaker taken into account, in a given problem, is produced. The pattern matching step is performed through the use of a discrimination model which may consist of a single classifier or an architecture incorporating several classifiers. In closed set ASK the reference speaker whose speech most closely matches the unknown speech is retained. In ASV, a speaker is accepted only if matching exceeds a preset threshold. The present study compares the speaker discrimination performance of three speaker discrimination models which are the","PeriodicalId":359344,"journal":{"name":"Fourth International Symposium on Signal Processing and Its Applications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Comparison of Three Discriminant Models for Automatic Speaker Verification\",\"authors\":\"S. Slomka, P. Castellano, S. Sridharan\",\"doi\":\"10.1109/ISSPA.1996.615747\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic Speaker Rewgnition (ASR) is composed of Automatic Speaker Identification (ASI) and Verification (ASV). In either case, it is a 4 step process consisting of speech data collection, preprocessing of the speech signal (enrolment), pattern matching and result adjudication. In the second ASR step, parametrised speech, representative of each speaker taken into account, in a given problem, is produced. The pattern matching step is performed through the use of a discrimination model which may consist of a single classifier or an architecture incorporating several classifiers. In closed set ASK the reference speaker whose speech most closely matches the unknown speech is retained. In ASV, a speaker is accepted only if matching exceeds a preset threshold. The present study compares the speaker discrimination performance of three speaker discrimination models which are the\",\"PeriodicalId\":359344,\"journal\":{\"name\":\"Fourth International Symposium on Signal Processing and Its Applications\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fourth International Symposium on Signal Processing and Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSPA.1996.615747\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fourth International Symposium on Signal Processing and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.1996.615747","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparison of Three Discriminant Models for Automatic Speaker Verification
Automatic Speaker Rewgnition (ASR) is composed of Automatic Speaker Identification (ASI) and Verification (ASV). In either case, it is a 4 step process consisting of speech data collection, preprocessing of the speech signal (enrolment), pattern matching and result adjudication. In the second ASR step, parametrised speech, representative of each speaker taken into account, in a given problem, is produced. The pattern matching step is performed through the use of a discrimination model which may consist of a single classifier or an architecture incorporating several classifiers. In closed set ASK the reference speaker whose speech most closely matches the unknown speech is retained. In ASV, a speaker is accepted only if matching exceeds a preset threshold. The present study compares the speaker discrimination performance of three speaker discrimination models which are the