Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah
{"title":"使用i-vector方法的余弦距离评分的说话人验证","authors":"Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah","doi":"10.1109/CSASE48920.2020.9142088","DOIUrl":null,"url":null,"abstract":"In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.","PeriodicalId":254581,"journal":{"name":"2020 International Conference on Computer Science and Software Engineering (CSASE)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Speaker Verification Using Cosine Distance Scoring with i-vector Approach\",\"authors\":\"Musab T. S. Al-Kaltakchi, R. Al-Nima, Mahmood Alfathe, Mohammed A. M. Abdullah\",\"doi\":\"10.1109/CSASE48920.2020.9142088\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.\",\"PeriodicalId\":254581,\"journal\":{\"name\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"volume\":\"75 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Computer Science and Software Engineering (CSASE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSASE48920.2020.9142088\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Computer Science and Software Engineering (CSASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSASE48920.2020.9142088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker Verification Using Cosine Distance Scoring with i-vector Approach
In this paper, a robust yet simple speaker verification system is implemented. The speaker verification system is investigated employing the i-vector approach with the Cosine Distance Scoring (CDS) for system classification. In addition, to measure the system performance, Equal Error Rate (EER), Detection Error Trade-off (DET) Curve, Receiver Operating Characteristic (ROC) curve as well as Detection Cost Function (DCF) were utilized. Experimental results are conducted on the TMIT database using 64 randomly selected speakers. The proposed system utilizes the Mel Frequency Cepstral Coefficients (MFCC) and Power Normalized Cepstral Coefficients (PNCC) for feature extraction. In addition, features normalization methods such as Feature Warping (FW) and Cepstral Mean-Variance Normalization (CMVN) are used in order to mitigate channel effect noise. The speakers are modeled with the i-vector while CDS is used for classification. Experimental results demonstrate that the proposed system achieved promising results while being computationally efficient.