卡纳蒂克音乐中Janya Raga分类的深度学习方法

2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI) Pub Date : 2022-12-01 DOI:10.1109/SLAAI-ICAI56923.2022.10002700

P. Kavitha, J. Charles, L. S. Lekamge

{"title":"卡纳蒂克音乐中Janya Raga分类的深度学习方法","authors":"P. Kavitha, J. Charles, L. S. Lekamge","doi":"10.1109/SLAAI-ICAI56923.2022.10002700","DOIUrl":null,"url":null,"abstract":"Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.","PeriodicalId":308901,"journal":{"name":"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)","volume":"235 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Deep Learning Based Approach for Janya Raga Classification in Carnatic Music\",\"authors\":\"P. Kavitha, J. Charles, L. S. Lekamge\",\"doi\":\"10.1109/SLAAI-ICAI56923.2022.10002700\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.\",\"PeriodicalId\":308901,\"journal\":{\"name\":\"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)\",\"volume\":\"235 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SLAAI-ICAI56923.2022.10002700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SLAAI-ICAI56923.2022.10002700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

Janya Ragas是卡纳蒂克音乐Ragas，源于基本的Ragas称为Melakarta Raga，通过各种上升和下降的swaras的排列和组合。拉格识别是印度古典音乐音乐信息检索研究的重要内容和基础。尽管自动拉格识别方法在卡纳蒂克音乐中被广泛采用，但大多数现有的方法仅限于Melakarta Ragas。识别Janya Raga中的Melakarta Raga是一项重要的任务，特别是对于提高音乐推荐性能。在本研究中，我们使用基于深度学习的方法来解决这个问题。收集了7个Melakarta Ragas的65个Janya Ragas的音乐文件。本研究采用基于分类的监督深度学习模型:1D CNN、LSTM和1D CNN-LSTM。通过使用TensorFlow和Keras api，实现了从音频样本的Mel-Frequency Cepstral系数(MFCCs)特征的1-20均值中学习相似度的模型。结果表明，一维CNN-LSTM模型的准确率为82.0%，优于其他模型。在未来，可以引入基于1D CNN-LSTM的Siamese神经网络来减少对大量标记音频数据的依赖。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Deep Learning Based Approach for Janya Raga Classification in Carnatic Music

Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)

自引率

0.00%

发文量