{"title":"A new speaker change detection method in a speaker identification system for two-speakers segmentation","authors":"M. Bazyar, R. Sudirman","doi":"10.1109/ISCAIE.2014.7010226","DOIUrl":null,"url":null,"abstract":"Speaker change detection is done in many speaker and speech identification applications that the speech is from two speakers. However, the standard metric-based methods performance is not suitable and stable owing to the amid window distance calculation stability. Therefore, a new method is proposed to improve the stability and enhance the performance of the system according to speakers' characteristics using between window correlations. Moreover, reference speaker models set that shows the space of the entire speaker model are trained in this approach. A metric is defined as the between window correlation of scores likelihood vectors versus the reference models. The Peak and Valley information and gender information are also used. In this paper, we look at telephone conversations where it is known a priori that there are two speakers, but the identity of the speakers is not known. Experiments over Farsdat Database show better performance In comparison with the GLR and the BIC approaches. This new approach has more effect rather than the GLR and the BIC approaches in the broader value of defined thresholds.","PeriodicalId":385258,"journal":{"name":"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCAIE.2014.7010226","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Speaker change detection is done in many speaker and speech identification applications that the speech is from two speakers. However, the standard metric-based methods performance is not suitable and stable owing to the amid window distance calculation stability. Therefore, a new method is proposed to improve the stability and enhance the performance of the system according to speakers' characteristics using between window correlations. Moreover, reference speaker models set that shows the space of the entire speaker model are trained in this approach. A metric is defined as the between window correlation of scores likelihood vectors versus the reference models. The Peak and Valley information and gender information are also used. In this paper, we look at telephone conversations where it is known a priori that there are two speakers, but the identity of the speakers is not known. Experiments over Farsdat Database show better performance In comparison with the GLR and the BIC approaches. This new approach has more effect rather than the GLR and the BIC approaches in the broader value of defined thresholds.