基于广义最小-最大分类器的乐器识别

G. Costantini, A. Rizzi, D. Casali
{"title":"基于广义最小-最大分类器的乐器识别","authors":"G. Costantini, A. Rizzi, D. Casali","doi":"10.1109/NNSP.2003.1318055","DOIUrl":null,"url":null,"abstract":"The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Recognition of musical instruments by generalized min-max classifiers\",\"authors\":\"G. Costantini, A. Rizzi, D. Casali\",\"doi\":\"10.1109/NNSP.2003.1318055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.\",\"PeriodicalId\":315958,\"journal\":{\"name\":\"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NNSP.2003.1318055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NNSP.2003.1318055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

单一音源的正确分类是音源分离任务和复调音乐自动转写的一个相关方面。本文研究了小提琴、单簧管、长笛、双簧管、萨克斯管和钢琴六种不同乐器的分类问题。这种识别问题的满意解决方案主要取决于预处理程序(从行数据中提取的特征集)和所采用的分类系统。在特征提取方面,采用了基于FFT、QFT (Q-constant frequency transform)和倒谱系数的信号预处理。我们采用最小-最大神经模糊网络作为分类模型,包括经典模型和广义模型。这些分类器的综合是通过自适应分辨率训练技术(ARC, PARC和GPARC算法)进行的,因为它保证了良好的性能和良好的自动化程度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Recognition of musical instruments by generalized min-max classifiers
The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信