{"title":"基于深度神经网络的原始音频信号音乐情感识别","authors":"Richard Orjesek, R. Jarina, M. Chmulik, M. Kuba","doi":"10.1109/RADIOELEK.2019.8733572","DOIUrl":null,"url":null,"abstract":"Music emotion is an important component in the field of music information retrieval and computational musicology. Considering the complexity of the state-of-the-art methods and challenges brought by lack of information available in existing public databases, we propose novel approach to emotion recognition based on convolutional and recurrent neural networks for feature mining. The proposed method utilizes a latest findings in deep learning by stacking convolution layer with bidirectional gated recurrent unit. The method was evaluated on the MediaEval Emotion in Music dataset and has shown exceptional performance using only raw audio signals and without any need for pre-processing.","PeriodicalId":336454,"journal":{"name":"2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":"{\"title\":\"DNN Based Music Emotion Recognition from Raw Audio Signal\",\"authors\":\"Richard Orjesek, R. Jarina, M. Chmulik, M. Kuba\",\"doi\":\"10.1109/RADIOELEK.2019.8733572\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Music emotion is an important component in the field of music information retrieval and computational musicology. Considering the complexity of the state-of-the-art methods and challenges brought by lack of information available in existing public databases, we propose novel approach to emotion recognition based on convolutional and recurrent neural networks for feature mining. The proposed method utilizes a latest findings in deep learning by stacking convolution layer with bidirectional gated recurrent unit. The method was evaluated on the MediaEval Emotion in Music dataset and has shown exceptional performance using only raw audio signals and without any need for pre-processing.\",\"PeriodicalId\":336454,\"journal\":{\"name\":\"2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"21\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RADIOELEK.2019.8733572\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 29th International Conference Radioelektronika (RADIOELEKTRONIKA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RADIOELEK.2019.8733572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21
摘要
音乐情感是音乐信息检索和计算音乐学领域的重要组成部分。考虑到最先进的方法的复杂性和现有公共数据库缺乏可用信息所带来的挑战,我们提出了基于卷积和递归神经网络特征挖掘的情感识别新方法。该方法利用深度学习领域的最新研究成果,将卷积层与双向门控循环单元叠加。该方法在MediaEval Emotion in Music数据集上进行了评估,仅使用原始音频信号且不需要任何预处理,就显示出了出色的性能。
DNN Based Music Emotion Recognition from Raw Audio Signal
Music emotion is an important component in the field of music information retrieval and computational musicology. Considering the complexity of the state-of-the-art methods and challenges brought by lack of information available in existing public databases, we propose novel approach to emotion recognition based on convolutional and recurrent neural networks for feature mining. The proposed method utilizes a latest findings in deep learning by stacking convolution layer with bidirectional gated recurrent unit. The method was evaluated on the MediaEval Emotion in Music dataset and has shown exceptional performance using only raw audio signals and without any need for pre-processing.