伪标签质量对自监督说话人验证任务的影响分析

2023 11th International Workshop on Biometrics and Forensics (IWBF) Pub Date : 2023-04-19 DOI:10.1109/IWBF57495.2023.10157651

A. Fathan, J. Alam

{"title":"伪标签质量对自监督说话人验证任务的影响分析","authors":"A. Fathan, J. Alam","doi":"10.1109/IWBF57495.2023.10157651","DOIUrl":null,"url":null,"abstract":"One of the most widely used self-supervised (SS) speaker verification (SV) system training methods is to optimize the speaker embedding network in a discriminative fashion using clustering algorithm (CA)-driven Pseudo-Labels (PLs). Although the PL-based SS training scheme showed impressive performance, recent studies have shown that label noise can significantly impact performance. In this paper, we have explored various PLs driven by different CAs and conducted a fine-grained analysis of the relationship between the quality of the PLs and the SV performance. Experimentally, we shed light on several previously overlooked aspects of the PLs that can impact SV performance. Moreover, we could observe that the SS-SV performance is heavily dependent on multiple qualitative aspects of the CA used to generate the PLs. Furthermore, we show that SV performance can be severely degraded from overfitting the noisy PLs and that the mixup strategy can mitigate the memorization effects of label noise.","PeriodicalId":273412,"journal":{"name":"2023 11th International Workshop on Biometrics and Forensics (IWBF)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the influence of the quality of pseudo-labels on the self-supervised speaker verification task: a thorough analysis\",\"authors\":\"A. Fathan, J. Alam\",\"doi\":\"10.1109/IWBF57495.2023.10157651\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the most widely used self-supervised (SS) speaker verification (SV) system training methods is to optimize the speaker embedding network in a discriminative fashion using clustering algorithm (CA)-driven Pseudo-Labels (PLs). Although the PL-based SS training scheme showed impressive performance, recent studies have shown that label noise can significantly impact performance. In this paper, we have explored various PLs driven by different CAs and conducted a fine-grained analysis of the relationship between the quality of the PLs and the SV performance. Experimentally, we shed light on several previously overlooked aspects of the PLs that can impact SV performance. Moreover, we could observe that the SS-SV performance is heavily dependent on multiple qualitative aspects of the CA used to generate the PLs. Furthermore, we show that SV performance can be severely degraded from overfitting the noisy PLs and that the mixup strategy can mitigate the memorization effects of label noise.\",\"PeriodicalId\":273412,\"journal\":{\"name\":\"2023 11th International Workshop on Biometrics and Forensics (IWBF)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 11th International Workshop on Biometrics and Forensics (IWBF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWBF57495.2023.10157651\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 11th International Workshop on Biometrics and Forensics (IWBF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWBF57495.2023.10157651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

自监督说话人验证(SV)系统训练中应用最广泛的一种方法是利用聚类算法(CA)驱动的伪标签(PLs)以判别方式优化说话人嵌入网络。尽管基于pl的SS训练方案表现出令人印象深刻的性能，但最近的研究表明，标签噪声会显著影响性能。在本文中，我们探讨了由不同ca驱动的各种PLs，并对PLs质量与SV性能之间的关系进行了细致的分析。通过实验，我们揭示了几个以前被忽视的可能影响SV性能的PLs方面。此外，我们可以观察到，SS-SV的性能严重依赖于用于生成PLs的CA的多个定性方面。此外，我们表明，SV的性能可能会因过度拟合有噪声的PLs而严重下降，并且混合策略可以减轻标签噪声的记忆影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On the influence of the quality of pseudo-labels on the self-supervised speaker verification task: a thorough analysis

One of the most widely used self-supervised (SS) speaker verification (SV) system training methods is to optimize the speaker embedding network in a discriminative fashion using clustering algorithm (CA)-driven Pseudo-Labels (PLs). Although the PL-based SS training scheme showed impressive performance, recent studies have shown that label noise can significantly impact performance. In this paper, we have explored various PLs driven by different CAs and conducted a fine-grained analysis of the relationship between the quality of the PLs and the SV performance. Experimentally, we shed light on several previously overlooked aspects of the PLs that can impact SV performance. Moreover, we could observe that the SS-SV performance is heavily dependent on multiple qualitative aspects of the CA used to generate the PLs. Furthermore, we show that SV performance can be severely degraded from overfitting the noisy PLs and that the mixup strategy can mitigate the memorization effects of label noise.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 11th International Workshop on Biometrics and Forensics (IWBF)

自引率

0.00%

发文量