测量人类声音的独特性

2017 IEEE Workshop on Information Forensics and Security (WIFS) Pub Date : 1900-01-01 DOI:10.1109/WIFS.2017.8267666

S. Tandogan, H. Sencar, B. Tavlı

{"title":"测量人类声音的独特性","authors":"S. Tandogan, H. Sencar, B. Tavlı","doi":"10.1109/WIFS.2017.8267666","DOIUrl":null,"url":null,"abstract":"The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.","PeriodicalId":305837,"journal":{"name":"2017 IEEE Workshop on Information Forensics and Security (WIFS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Towards measuring uniqueness of human voice\",\"authors\":\"S. Tandogan, H. Sencar, B. Tavlı\",\"doi\":\"10.1109/WIFS.2017.8267666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.\",\"PeriodicalId\":305837,\"journal\":{\"name\":\"2017 IEEE Workshop on Information Forensics and Security (WIFS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Workshop on Information Forensics and Security (WIFS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WIFS.2017.8267666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Workshop on Information Forensics and Security (WIFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIFS.2017.8267666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

使用语音作为一种生物识别方式进行用户认证和身份识别的发展非常迅速。因此，我们理解这种系统的局限性是非常重要的，它最终将取决于声音生物识别的鉴别能力。在本文中，我们通过制定新的测量方法和创建新的数据集来执行更可靠的测量，为测量语音生物识别的独特性做出了贡献。为此，我们评估了该领域的主要方法，并提出了一种新的方法，该方法更好地结合了用户内部的可变性，并且在分析上更易于处理。我们新创建的数据集包括从近2000个TED演讲视频中提取的语音样本。总的来说，我们对这个数据集的测量揭示了人类声音中大约60位的生物特征信息内容。此外，通过在样本中添加一些通用语音效果进行的测试表明，独特性降低了近20位，这意味着当用户样本中反映出真正的可变性时，结果熵可能会进一步降低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Towards measuring uniqueness of human voice

The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE Workshop on Information Forensics and Security (WIFS)

自引率

0.00%

发文量