{"title":"使用本地说话人建模更快的BIC分割","authors":"R. Travadi, G. Saha","doi":"10.1109/NCC.2012.6176884","DOIUrl":null,"url":null,"abstract":"Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.","PeriodicalId":178278,"journal":{"name":"2012 National Conference on Communications (NCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Faster BIC segmentation using local speaker modeling\",\"authors\":\"R. Travadi, G. Saha\",\"doi\":\"10.1109/NCC.2012.6176884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.\",\"PeriodicalId\":178278,\"journal\":{\"name\":\"2012 National Conference on Communications (NCC)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2012.6176884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2012.6176884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Faster BIC segmentation using local speaker modeling
Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.