{"title":"A system for speaker detection and tracking in audio broadcast news","authors":"Dabbabi Karim, Chérif Adnen, Hajji Salah","doi":"10.1109/ICEMIS.2017.8272968","DOIUrl":null,"url":null,"abstract":"A system for speaker-based audio indexing and for speaker tracking in broadcast news audio is presented. Several tasks which are treated as a multistage process construct the process of producing indexing information in continuous audio streams based on detected speakers. The main constructing blocks of such an indexing system contain components for an audio segmentation, speaker detection, speaker clustering, and speaker identification. In the proposed speaker-based audio indexing system, three probabilistic Linear Disciminant Analysis (PLDA) variants-standard, simplified and two-covariance-, and Gaussian Mixture Model (GMM) are proposed in the speaker identification stage. The evaluation is performed on audio data from the broadcast news domain and the obtained results demonstrate the superiority of two-covariance PLDA model in terms of performance results compared to other proposed algorithms.","PeriodicalId":117908,"journal":{"name":"2017 International Conference on Engineering & MIS (ICEMIS)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Engineering & MIS (ICEMIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEMIS.2017.8272968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A system for speaker-based audio indexing and for speaker tracking in broadcast news audio is presented. Several tasks which are treated as a multistage process construct the process of producing indexing information in continuous audio streams based on detected speakers. The main constructing blocks of such an indexing system contain components for an audio segmentation, speaker detection, speaker clustering, and speaker identification. In the proposed speaker-based audio indexing system, three probabilistic Linear Disciminant Analysis (PLDA) variants-standard, simplified and two-covariance-, and Gaussian Mixture Model (GMM) are proposed in the speaker identification stage. The evaluation is performed on audio data from the broadcast news domain and the obtained results demonstrate the superiority of two-covariance PLDA model in terms of performance results compared to other proposed algorithms.