G. Costantini, M. Todisco, R. Perfetti, A. Paoloni, G. Saggio
{"title":"Single-sided objective speech intelligibility assessment based on Sparse signal representation","authors":"G. Costantini, M. Todisco, R. Perfetti, A. Paoloni, G. Saggio","doi":"10.1109/MLSP.2012.6349776","DOIUrl":null,"url":null,"abstract":"Transcription of speech signals, originating from a lawful interception, is particularly important in the forensic phonetics framework. These signals are often degraded and the transcript may not replicate what was actually pronounced. In the absence of the clean signal, the only way to estimate the level of accuracy that can be obtained in the transcription is to develop an objective methodology for intelligibility measurements. In this paper a method based on the Normalized Spectrum Envelope (NSE) and Sparse Non-negative Matrix Factorization (SNMF) is proposed to evaluate the signal intelligibility. The approaches are tested with three different noise types and the results are compared with the speech intelligibility scores measured by subjective tests. The results of the experiments show a high correlation between objective measurements and subjective evaluations. Therefore, the proposed methodology can be successfully used in order to establish whether a given intercepted signal can be transcribed with sufficient reliability.","PeriodicalId":262601,"journal":{"name":"2012 IEEE International Workshop on Machine Learning for Signal Processing","volume":"16 17","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Workshop on Machine Learning for Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MLSP.2012.6349776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Transcription of speech signals, originating from a lawful interception, is particularly important in the forensic phonetics framework. These signals are often degraded and the transcript may not replicate what was actually pronounced. In the absence of the clean signal, the only way to estimate the level of accuracy that can be obtained in the transcription is to develop an objective methodology for intelligibility measurements. In this paper a method based on the Normalized Spectrum Envelope (NSE) and Sparse Non-negative Matrix Factorization (SNMF) is proposed to evaluate the signal intelligibility. The approaches are tested with three different noise types and the results are compared with the speech intelligibility scores measured by subjective tests. The results of the experiments show a high correlation between objective measurements and subjective evaluations. Therefore, the proposed methodology can be successfully used in order to establish whether a given intercepted signal can be transcribed with sufficient reliability.