Classification of Audio Codecs with Variable Bit- Rates

2020 10th International Conference on Computer and Knowledge Engineering (ICCKE) Pub Date : 2020-10-29 DOI:10.1109/ICCKE50421.2020.9303637

Atieh Khodadadi, M. Teimouri

{"title":"Classification of Audio Codecs with Variable Bit- Rates","authors":"Atieh Khodadadi, M. Teimouri","doi":"10.1109/ICCKE50421.2020.9303637","DOIUrl":null,"url":null,"abstract":"A large portion of the Internet bandwidth is used for transmission of multimedia such as audio data. For eavesdropping or network surveillance purposes, the first step of a sniffer may be to determine the codec by which a fragment is generated. This problem is usually modeled as a multi-class classification problem. The basic methods for determining the codec type of each fragment rely on the metadata in the corresponding file header. However, in a non-cooperative context, the whole file is not available. So, generally, statistical features extracted from the fragments combined with machine learning algorithms are used for solving this multi-class classification problem. To date, almost all frameworks implicitly assume fixed and known bit-rates for the employed codecs. However, in practical situations, various rates of a specific codec may be used in a network. In this situation, as it is shown in this paper, the classifiers trained by codecs with fixed bit-rates perform poorly when the test data is generated by various rates of the codecs. In this paper, the classification of audio codec fragments with variable bit-rates is considered, simulated, and analyzed. According to the simulation results, for 1 Kbyte fragments, the accuracy of the proposed random forest classifier is about 89%.","PeriodicalId":402043,"journal":{"name":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE50421.2020.9303637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

A large portion of the Internet bandwidth is used for transmission of multimedia such as audio data. For eavesdropping or network surveillance purposes, the first step of a sniffer may be to determine the codec by which a fragment is generated. This problem is usually modeled as a multi-class classification problem. The basic methods for determining the codec type of each fragment rely on the metadata in the corresponding file header. However, in a non-cooperative context, the whole file is not available. So, generally, statistical features extracted from the fragments combined with machine learning algorithms are used for solving this multi-class classification problem. To date, almost all frameworks implicitly assume fixed and known bit-rates for the employed codecs. However, in practical situations, various rates of a specific codec may be used in a network. In this situation, as it is shown in this paper, the classifiers trained by codecs with fixed bit-rates perform poorly when the test data is generated by various rates of the codecs. In this paper, the classification of audio codec fragments with variable bit-rates is considered, simulated, and analyzed. According to the simulation results, for 1 Kbyte fragments, the accuracy of the proposed random forest classifier is about 89%.

查看原文本刊更多论文

可变比特率音频编解码器的分类

互联网带宽的很大一部分用于传输多媒体，如音频数据。为了窃听或网络监视目的，嗅探器的第一步可能是确定生成片段的编解码器。这个问题通常被建模为一个多类分类问题。确定每个片段的编解码器类型的基本方法依赖于相应文件头中的元数据。然而，在非合作的上下文中，整个文件是不可用的。因此，通常采用从碎片中提取统计特征并结合机器学习算法来解决这一多类分类问题。到目前为止，几乎所有的框架都隐式地为所使用的编解码器假定固定和已知的比特率。然而，在实际情况中，可以在网络中使用特定编解码器的各种速率。在这种情况下，如本文所示，当测试数据由不同速率的编解码器生成时，由固定比特率的编解码器训练的分类器表现不佳。本文对可变比特率音频编解码器片段的分类进行了研究、仿真和分析。仿真结果表明，对于1kbyte的碎片，所提出的随机森林分类器的准确率约为89%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 10th International Conference on Computer and Knowledge Engineering (ICCKE)

自引率

0.00%

发文量