{"title":"测量人类声音的独特性","authors":"S. Tandogan, H. Sencar, B. Tavlı","doi":"10.1109/WIFS.2017.8267666","DOIUrl":null,"url":null,"abstract":"The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.","PeriodicalId":305837,"journal":{"name":"2017 IEEE Workshop on Information Forensics and Security (WIFS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Towards measuring uniqueness of human voice\",\"authors\":\"S. Tandogan, H. Sencar, B. Tavlı\",\"doi\":\"10.1109/WIFS.2017.8267666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.\",\"PeriodicalId\":305837,\"journal\":{\"name\":\"2017 IEEE Workshop on Information Forensics and Security (WIFS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Workshop on Information Forensics and Security (WIFS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WIFS.2017.8267666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Workshop on Information Forensics and Security (WIFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIFS.2017.8267666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The use of voice as a biometrie modality for user authentication and identification has grown very rapidly. It is therefore very important that we understand limitations of such systems which will ultimately depend on the discriminative power of the voice biometric. In this paper, we have contributed towards measuring distinctiveness of voice biometric by both formulating a new measure and creating a new dataset to perform more reliable measurements. For this purpose, we evaluate the prominent approaches in the field and propose a new approach that better incorporates within-user variability and is analytically more tractable. Our newly created dataset includes voice samples extracted from close to two thousand TED Talks videos. Overall our measurements on this dataset revealed a biometric information content of about 60 bits in human voice. Further, tests performed by adding some generic voice effects on the samples show that the distinctiveness reduces by almost 20 bits, implying that when true variability is reflected in user samples resulting entropy may further reduce.