Audio Acoustic Features Based Tagging and Comparative Analysis of its Classifications

2018 Eleventh International Conference on Contemporary Computing (IC3) Pub Date : 2018-08-01 DOI:10.1109/IC3.2018.8530512

Somya Goel, Raghav Pangasa, Suma Dawn, Anuja Arora

引用次数: 4

Abstract

Musical genres can be used to distribute and manage music datasets to increase the ease in finding a music piece a person wants to listen to. This paper presents a research for creating a suitable model for genre recognition in audio files using machine learning classifiers on the IRMAS11 https://www.upf.edu/web/mtg/irmas dataset. Python language library pyAudinAnalysls22 https://github.com/tyiannak/pyAudioAnalysis for extracting features from audio files is used. Further, three base classifiers, namely Support Vector Machines (SVM), Decision Tree and Random Forest are also depicted. IRMAS [10] genre dataset provides 6705 audio files of four genres classical, country folk, jazz and pop-rock. Also explored is an ensemble classification model by creating a stack of classifiers for the genre recognition task. Genre classification using SMOTE has been characterized in the confusion matrix. Maximum accuracy of 81.56% using the ensemble classifier is achieved using the proposed methodology.

查看原文本刊更多论文

基于声学特征的标记及其分类比较分析

音乐类型可以用于分发和管理音乐数据集，以增加查找一个人想听的音乐片段的便利性。本文介绍了在IRMAS11 https://www.upf.edu/web/mtg/irmas数据集上使用机器学习分类器创建适合音频文件类型识别的模型的研究。使用Python语言库pyAudinAnalysls22 https://github.com/tyiannak/pyAudioAnalysis从音频文件中提取特征。此外，还描述了三种基本分类器，即支持向量机(SVM)、决策树和随机森林。IRMAS[10]类型数据集提供了古典、乡村民谣、爵士和流行摇滚四种类型的6705个音频文件。还探讨了通过为类型识别任务创建一堆分类器的集成分类模型。使用SMOTE的类型分类已经在混淆矩阵中得到表征。使用所提出的方法，使用集成分类器实现了81.56%的最大准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Eleventh International Conference on Contemporary Computing (IC3)

自引率

0.00%

发文量