{"title":"基于MFCC特征和矢量量化的色情音频检测","authors":"Zhiyi Qu, Jing Yu, Qiang Niu","doi":"10.1109/ICCIS.2010.228","DOIUrl":null,"url":null,"abstract":"Identifying the phonetic characteristics of speakers is an important branch of speech recognition. The audition system of human being is an ideal speaker recognition system. MFCC (Mel-frequency cepstral coefficients) characterizes the auditory features of humans effectively, and thus has been widely used in practice. This paper explores applying MFCC and VQ (vector quantization) algorithms on the pornographic audios detection. Firstly, MFCC of selected pornographic audios are extracted and then encoded into codebooks using VQ algorithm, Secondly, all of the codebooks obtained will be averaged to get an average codebook, Finally, the type of any newly input audio belonging to, either pornographic or non-pornographic, will be determined by measuring the Euclidean distance between the average codebook and its own codebook. Experiment results show that the algorithm can detect pornographic audios effectively.","PeriodicalId":227848,"journal":{"name":"2010 International Conference on Computational and Information Sciences","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Pornographic Audios Detection Using MFCC Features and Vector Quantization\",\"authors\":\"Zhiyi Qu, Jing Yu, Qiang Niu\",\"doi\":\"10.1109/ICCIS.2010.228\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identifying the phonetic characteristics of speakers is an important branch of speech recognition. The audition system of human being is an ideal speaker recognition system. MFCC (Mel-frequency cepstral coefficients) characterizes the auditory features of humans effectively, and thus has been widely used in practice. This paper explores applying MFCC and VQ (vector quantization) algorithms on the pornographic audios detection. Firstly, MFCC of selected pornographic audios are extracted and then encoded into codebooks using VQ algorithm, Secondly, all of the codebooks obtained will be averaged to get an average codebook, Finally, the type of any newly input audio belonging to, either pornographic or non-pornographic, will be determined by measuring the Euclidean distance between the average codebook and its own codebook. Experiment results show that the algorithm can detect pornographic audios effectively.\",\"PeriodicalId\":227848,\"journal\":{\"name\":\"2010 International Conference on Computational and Information Sciences\",\"volume\":\"50 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 International Conference on Computational and Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIS.2010.228\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Computational and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS.2010.228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pornographic Audios Detection Using MFCC Features and Vector Quantization
Identifying the phonetic characteristics of speakers is an important branch of speech recognition. The audition system of human being is an ideal speaker recognition system. MFCC (Mel-frequency cepstral coefficients) characterizes the auditory features of humans effectively, and thus has been widely used in practice. This paper explores applying MFCC and VQ (vector quantization) algorithms on the pornographic audios detection. Firstly, MFCC of selected pornographic audios are extracted and then encoded into codebooks using VQ algorithm, Secondly, all of the codebooks obtained will be averaged to get an average codebook, Finally, the type of any newly input audio belonging to, either pornographic or non-pornographic, will be determined by measuring the Euclidean distance between the average codebook and its own codebook. Experiment results show that the algorithm can detect pornographic audios effectively.