Akshara transcription of mrudangam strokes in Carnatic music

Jom Kuriakose, J. Kumar, Sarala Padi, H. Murthy, Umayalpuram K. Sivaraman
{"title":"Akshara transcription of mrudangam strokes in Carnatic music","authors":"Jom Kuriakose, J. Kumar, Sarala Padi, H. Murthy, Umayalpuram K. Sivaraman","doi":"10.1109/NCC.2015.7084906","DOIUrl":null,"url":null,"abstract":"Percussion instruments play a significant role in Carnatic music concerts. The percussion artist enjoys a great degree of freedom in improvising within the defined tala structure of a composition. The objective of this paper is to transcribe the improvisations, treating the percussion strokes as syllables or aksharas. Onset detection is performed to segment the waveform at each akshara. Using the transcriptions from the training data, a three-state Hidden Markov Model is built for each akshara. The language model is derived from the training data. Testing is also performed isolated style using onset detection to segment the phrase, and the language model to correct the transcription. Transcription is performed on both concert recordings and studio recordings. This technique yields upto ≈ 96% accuracy on studio recordings and ≈ 76% accuracy for concert recordings. As the mrudangam1 is an instrument that is based on tonic; tonic normalised features, namely, Cent Filterbank Cepstral coefficients are used. It is shown that tonic normalisation helps in transcription across different tonics.","PeriodicalId":302718,"journal":{"name":"2015 Twenty First National Conference on Communications (NCC)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Twenty First National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2015.7084906","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14

Abstract

Percussion instruments play a significant role in Carnatic music concerts. The percussion artist enjoys a great degree of freedom in improvising within the defined tala structure of a composition. The objective of this paper is to transcribe the improvisations, treating the percussion strokes as syllables or aksharas. Onset detection is performed to segment the waveform at each akshara. Using the transcriptions from the training data, a three-state Hidden Markov Model is built for each akshara. The language model is derived from the training data. Testing is also performed isolated style using onset detection to segment the phrase, and the language model to correct the transcription. Transcription is performed on both concert recordings and studio recordings. This technique yields upto ≈ 96% accuracy on studio recordings and ≈ 76% accuracy for concert recordings. As the mrudangam1 is an instrument that is based on tonic; tonic normalised features, namely, Cent Filterbank Cepstral coefficients are used. It is shown that tonic normalisation helps in transcription across different tonics.
卡纳蒂克音乐中mrudangam笔画的Akshara转录
打击乐器在卡纳蒂克音乐会上扮演着重要的角色。打击乐艺术家在乐曲的既定塔拉结构中享有很大程度的即兴创作自由。本文的目的是抄写即兴,把打击乐的笔划当作音节或音。执行起始检测以在每个akshara处分割波形。利用训练数据的转录,为每个akshara建立一个三状态隐马尔可夫模型。语言模型由训练数据导出。测试还执行了孤立的风格,使用开始检测来分割短语,并使用语言模型来纠正转录。转录是在音乐会录音和录音室录音中进行的。这种技术在录音室录音上的准确度高达≈96%,在音乐会录音上的准确度高达≈76%。mrudangam1是一种以主音为基础的乐器;tonic归一化特征,即分滤波器组倒谱系数被使用。研究表明,主音正常化有助于不同主音的转录。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信