{"title":"基于稀疏表示的欺骗性语音检测","authors":"Xiaohe Fan, Heming Zhao, Xueqin Chen, Cheng Fan, Shuxi Chen","doi":"10.1109/CSPA.2016.7515793","DOIUrl":null,"url":null,"abstract":"Generally, the extracted features of distinguishing deceptive speeches always focused on prosodic, vocal tract, lexical and glottal waveform features. The purpose of this paper is to examine the effectiveness of sparse coefficients for deception detection. In this paper, we firstly extract the Mel-Frequency Cepstrum Coefficient (MFCC) and Zero Crossing Rate (ZCR) from speech utterances as the input data of K-SVD algorithm to learn a mixture dictionary. And sparse coefficients are obtained by Orthogonal Matching Pursuit (OMP) algorithm. Then we use those coefficients as features to train Support Vector Machine (SVM) model and test the classifier accuracy based on the trained model. Finally, we present the experimental results of this approach and compare the results with the conventional features consisting of Short-Time, Pitch, Formant, and Duration based on corpus of Soochow University Speech Processing Researches-Deception Speech Detection Corpus (SUSP-DSD). It shows that sparse coefficients perform better than the conventional features in deception detection.","PeriodicalId":314829,"journal":{"name":"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Deceptive Speech Detection based on sparse representation\",\"authors\":\"Xiaohe Fan, Heming Zhao, Xueqin Chen, Cheng Fan, Shuxi Chen\",\"doi\":\"10.1109/CSPA.2016.7515793\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generally, the extracted features of distinguishing deceptive speeches always focused on prosodic, vocal tract, lexical and glottal waveform features. The purpose of this paper is to examine the effectiveness of sparse coefficients for deception detection. In this paper, we firstly extract the Mel-Frequency Cepstrum Coefficient (MFCC) and Zero Crossing Rate (ZCR) from speech utterances as the input data of K-SVD algorithm to learn a mixture dictionary. And sparse coefficients are obtained by Orthogonal Matching Pursuit (OMP) algorithm. Then we use those coefficients as features to train Support Vector Machine (SVM) model and test the classifier accuracy based on the trained model. Finally, we present the experimental results of this approach and compare the results with the conventional features consisting of Short-Time, Pitch, Formant, and Duration based on corpus of Soochow University Speech Processing Researches-Deception Speech Detection Corpus (SUSP-DSD). It shows that sparse coefficients perform better than the conventional features in deception detection.\",\"PeriodicalId\":314829,\"journal\":{\"name\":\"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSPA.2016.7515793\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSPA.2016.7515793","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deceptive Speech Detection based on sparse representation
Generally, the extracted features of distinguishing deceptive speeches always focused on prosodic, vocal tract, lexical and glottal waveform features. The purpose of this paper is to examine the effectiveness of sparse coefficients for deception detection. In this paper, we firstly extract the Mel-Frequency Cepstrum Coefficient (MFCC) and Zero Crossing Rate (ZCR) from speech utterances as the input data of K-SVD algorithm to learn a mixture dictionary. And sparse coefficients are obtained by Orthogonal Matching Pursuit (OMP) algorithm. Then we use those coefficients as features to train Support Vector Machine (SVM) model and test the classifier accuracy based on the trained model. Finally, we present the experimental results of this approach and compare the results with the conventional features consisting of Short-Time, Pitch, Formant, and Duration based on corpus of Soochow University Speech Processing Researches-Deception Speech Detection Corpus (SUSP-DSD). It shows that sparse coefficients perform better than the conventional features in deception detection.