{"title":"使用基于mdct的特征和监督学习检测AAC压缩","authors":"José Juan García-Hernández, W. Gómez-Flores","doi":"10.1080/0952813X.2021.1882003","DOIUrl":null,"url":null,"abstract":"ABSTRACT Audio files are frequent targets of malicious users who seek illegal profit trading with fake-quality content. For increasing the confidence in the integrity of audio files, the detection of fake-quality content is an important task. This paper proposes a method for detecting Advanced Audio Coding (AAC) compression on suspicious WAV files, in which the variance of the Modified Discrete Cosine Transform (MDCT) characterises four compression bitrates: uncompressed, 64 kbps, 128 kbps, and 256 kbps. This scheme takes advantage of the reduction of the variance of the high-frequency MDCT coefficients in compressed signals. Data obtained from MDCT coefficients generate a high-dimensional feature space. Hence, Principal Component Analysis, followed by Linear Discriminant Analysis, is used for projecting the high-dimensional data onto a lower-dimensional space. Besides, six supervised learning algorithms are compared for classifying four compression bitrates. The experiments show that using audio samples with 20 seconds and 1024 MDCT coefficients, an accuracy of 93% is reached with a Bayesian classifier. Collaterally, the detection between uncompressed and compressed signals attains an accuracy of 97% with Multinomial Logistic Regression. In conclusion, the proposed approach can detect previous AAC compression and can be potentially used when it is unfeasible to recover the suspicious signal completely.","PeriodicalId":15677,"journal":{"name":"Journal of Experimental & Theoretical Artificial Intelligence","volume":"22 1","pages":"451 - 468"},"PeriodicalIF":1.7000,"publicationDate":"2021-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Detection of AAC compression using MDCT-based features and supervised learning\",\"authors\":\"José Juan García-Hernández, W. Gómez-Flores\",\"doi\":\"10.1080/0952813X.2021.1882003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Audio files are frequent targets of malicious users who seek illegal profit trading with fake-quality content. For increasing the confidence in the integrity of audio files, the detection of fake-quality content is an important task. This paper proposes a method for detecting Advanced Audio Coding (AAC) compression on suspicious WAV files, in which the variance of the Modified Discrete Cosine Transform (MDCT) characterises four compression bitrates: uncompressed, 64 kbps, 128 kbps, and 256 kbps. This scheme takes advantage of the reduction of the variance of the high-frequency MDCT coefficients in compressed signals. Data obtained from MDCT coefficients generate a high-dimensional feature space. Hence, Principal Component Analysis, followed by Linear Discriminant Analysis, is used for projecting the high-dimensional data onto a lower-dimensional space. Besides, six supervised learning algorithms are compared for classifying four compression bitrates. The experiments show that using audio samples with 20 seconds and 1024 MDCT coefficients, an accuracy of 93% is reached with a Bayesian classifier. Collaterally, the detection between uncompressed and compressed signals attains an accuracy of 97% with Multinomial Logistic Regression. In conclusion, the proposed approach can detect previous AAC compression and can be potentially used when it is unfeasible to recover the suspicious signal completely.\",\"PeriodicalId\":15677,\"journal\":{\"name\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"volume\":\"22 1\",\"pages\":\"451 - 468\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2021-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1080/0952813X.2021.1882003\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental & Theoretical Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/0952813X.2021.1882003","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Detection of AAC compression using MDCT-based features and supervised learning
ABSTRACT Audio files are frequent targets of malicious users who seek illegal profit trading with fake-quality content. For increasing the confidence in the integrity of audio files, the detection of fake-quality content is an important task. This paper proposes a method for detecting Advanced Audio Coding (AAC) compression on suspicious WAV files, in which the variance of the Modified Discrete Cosine Transform (MDCT) characterises four compression bitrates: uncompressed, 64 kbps, 128 kbps, and 256 kbps. This scheme takes advantage of the reduction of the variance of the high-frequency MDCT coefficients in compressed signals. Data obtained from MDCT coefficients generate a high-dimensional feature space. Hence, Principal Component Analysis, followed by Linear Discriminant Analysis, is used for projecting the high-dimensional data onto a lower-dimensional space. Besides, six supervised learning algorithms are compared for classifying four compression bitrates. The experiments show that using audio samples with 20 seconds and 1024 MDCT coefficients, an accuracy of 93% is reached with a Bayesian classifier. Collaterally, the detection between uncompressed and compressed signals attains an accuracy of 97% with Multinomial Logistic Regression. In conclusion, the proposed approach can detect previous AAC compression and can be potentially used when it is unfeasible to recover the suspicious signal completely.
期刊介绍:
Journal of Experimental & Theoretical Artificial Intelligence (JETAI) is a world leading journal dedicated to publishing high quality, rigorously reviewed, original papers in artificial intelligence (AI) research.
The journal features work in all subfields of AI research and accepts both theoretical and applied research. Topics covered include, but are not limited to, the following:
• cognitive science
• games
• learning
• knowledge representation
• memory and neural system modelling
• perception
• problem-solving