N. Jakovljević, Tijana Delic, Simona V. Etinski, D. Mišković, T. Lončar-Turukalo
{"title":"基于PLDA和深度神经网络的多目标说话人检测与识别系统","authors":"N. Jakovljević, Tijana Delic, Simona V. Etinski, D. Mišković, T. Lončar-Turukalo","doi":"10.1109/TELFOR.2018.8612052","DOIUrl":null,"url":null,"abstract":"The paper describes a multi-target speaker detection and identification system based on a fusion of probabilistic linear discriminant analysis (PLDA) and deep neural network (DNN). PLDA is the state-of-the-art approach used in speaker recognition, thus we selected it as our baseline. We tried to develop a DNN based approach, that would be more accurate than the baseline, but only better discrimination between blacklist and background speakers was achieved. The fusion of PLDA and DNN improved performance of the baseline system.","PeriodicalId":229131,"journal":{"name":"2018 26th Telecommunications Forum (TELFOR)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Multi-Target Speaker Detection and Identification System Based on Combination of PLDA and DNN\",\"authors\":\"N. Jakovljević, Tijana Delic, Simona V. Etinski, D. Mišković, T. Lončar-Turukalo\",\"doi\":\"10.1109/TELFOR.2018.8612052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper describes a multi-target speaker detection and identification system based on a fusion of probabilistic linear discriminant analysis (PLDA) and deep neural network (DNN). PLDA is the state-of-the-art approach used in speaker recognition, thus we selected it as our baseline. We tried to develop a DNN based approach, that would be more accurate than the baseline, but only better discrimination between blacklist and background speakers was achieved. The fusion of PLDA and DNN improved performance of the baseline system.\",\"PeriodicalId\":229131,\"journal\":{\"name\":\"2018 26th Telecommunications Forum (TELFOR)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 26th Telecommunications Forum (TELFOR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TELFOR.2018.8612052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 26th Telecommunications Forum (TELFOR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TELFOR.2018.8612052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Multi-Target Speaker Detection and Identification System Based on Combination of PLDA and DNN
The paper describes a multi-target speaker detection and identification system based on a fusion of probabilistic linear discriminant analysis (PLDA) and deep neural network (DNN). PLDA is the state-of-the-art approach used in speaker recognition, thus we selected it as our baseline. We tried to develop a DNN based approach, that would be more accurate than the baseline, but only better discrimination between blacklist and background speakers was achieved. The fusion of PLDA and DNN improved performance of the baseline system.