{"title":"Towards forensic speaker identification in Spanish using triplet loss","authors":"Emmanuel Maqueda, Javier Alvarez, Iván V. Meza","doi":"10.52591/lxai2020121210","DOIUrl":null,"url":null,"abstract":"This work explores the use of a triplet loss deep network setting for the forensic identification of speakers in Spanish. Within the framework, we train a convolutional network to produce vector representations of speech spectrogram slices. Then we test how similar these vectors are for a given speaker and how dissimilar are compared with other speakers. Based on these metrics we propose the calculation of the Likelihood Radio which is a cornerstone for forensic identification.","PeriodicalId":301818,"journal":{"name":"LatinX in AI at Neural Information Processing Systems Conference 2020","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"LatinX in AI at Neural Information Processing Systems Conference 2020","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52591/lxai2020121210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This work explores the use of a triplet loss deep network setting for the forensic identification of speakers in Spanish. Within the framework, we train a convolutional network to produce vector representations of speech spectrogram slices. Then we test how similar these vectors are for a given speaker and how dissimilar are compared with other speakers. Based on these metrics we propose the calculation of the Likelihood Radio which is a cornerstone for forensic identification.