卡纳蒂克音乐中Janya Raga分类的深度学习方法

P. Kavitha, J. Charles, L. S. Lekamge
{"title":"卡纳蒂克音乐中Janya Raga分类的深度学习方法","authors":"P. Kavitha, J. Charles, L. S. Lekamge","doi":"10.1109/SLAAI-ICAI56923.2022.10002700","DOIUrl":null,"url":null,"abstract":"Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.","PeriodicalId":308901,"journal":{"name":"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)","volume":"235 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Deep Learning Based Approach for Janya Raga Classification in Carnatic Music\",\"authors\":\"P. Kavitha, J. Charles, L. S. Lekamge\",\"doi\":\"10.1109/SLAAI-ICAI56923.2022.10002700\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.\",\"PeriodicalId\":308901,\"journal\":{\"name\":\"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)\",\"volume\":\"235 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SLAAI-ICAI56923.2022.10002700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th SLAAI International Conference on Artificial Intelligence (SLAAI-ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SLAAI-ICAI56923.2022.10002700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

Janya Ragas是卡纳蒂克音乐Ragas,源于基本的Ragas称为Melakarta Raga,通过各种上升和下降的swaras的排列和组合。拉格识别是印度古典音乐音乐信息检索研究的重要内容和基础。尽管自动拉格识别方法在卡纳蒂克音乐中被广泛采用,但大多数现有的方法仅限于Melakarta Ragas。识别Janya Raga中的Melakarta Raga是一项重要的任务,特别是对于提高音乐推荐性能。在本研究中,我们使用基于深度学习的方法来解决这个问题。收集了7个Melakarta Ragas的65个Janya Ragas的音乐文件。本研究采用基于分类的监督深度学习模型:1D CNN、LSTM和1D CNN-LSTM。通过使用TensorFlow和Keras api,实现了从音频样本的Mel-Frequency Cepstral系数(MFCCs)特征的1-20均值中学习相似度的模型。结果表明,一维CNN-LSTM模型的准确率为82.0%,优于其他模型。在未来,可以引入基于1D CNN-LSTM的Siamese神经网络来减少对大量标记音频数据的依赖。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Deep Learning Based Approach for Janya Raga Classification in Carnatic Music
Janya Ragas are Carnatic music ragas derived from the fundamental ragas called Melakarta Raga, by the permutation and combination of various ascending and descending swaras. Raga recognition is an important task, essential to research in Music Information Retrieval in Indian Classical Music. Even though the automatic raga recognition methods have been widely adopted in Carnatic music, most of the existing methods are restricted only to Melakarta Ragas. It is an important task to identify the Melakarta Raga of a Janya Raga, especially for improved music recommendation performance. In this study, we used deep learning-based approach to address this problem. Music files were collected for 65 Janya Ragas belonging to seven Melakarta Ragas. Classification-based supervised deep learning models: 1D CNN, LSTM and 1D CNN-LSTM were used in the study. The models were implemented to learn similarities from 1–20 mean values of Mel-Frequency Cepstral Coefficients (MFCCs) features of audio samples, by using TensorFlow and Keras APIs. The results revealed that the 1D CNN-LSTM model outperformed the other models with an accuracy of 82.0%. In the future, 1D CNN-LSTM based Siamese neural networks can be introduced to reduce the dependence on large amounts of labeled audio data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信