{"title":"Feature fusion based automatic chord recognition model: BTC-FDAA-FGF","authors":"Chen Li, Hao Wu, JingYi Jiang, Lihua Tian","doi":"10.1016/j.compeleceng.2025.110555","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic chord recognition is a significant topic in the field of Music Information Retrieval (MIR). This paper introduces a novel feature fusion method combining Hybrid Constant-Q Transform (HCQT) and adaptive attention for chord detection, especially with a focus on improving the accuracy of chord detection of rare chord classes. Serving as one of the cornerstone features of music, the chords obtained by chord recognition algorithms are the basis of many high-level semantic tasks. At present, a severe class imbalance problem exists in the domain of automatic chord recognition. The recognition accuracy of rare chords is much lower than that of common chords, which significantly affect the overall performance of chord recognition algorithms. In this paper, a chord recognition algorithm based on feature fusion is proposed. First, in the feature extraction part, Hybrid Constant-Q Transform (HCQT) is introduced to assist with Constant-Q Transform(CQT) to obtain richer and finer musical signal features, enabling better tracking of overtones. Next, in the chord estimation part, the frequency-domain adaptive attention (FDAA) mechanism is used to enhance feature saliency, ensuring that the network can adaptively adjust the weights for different frequency components when training. Thereby frequency-domain features that contain important information can be selectively enhanced. The enhanced features are then fed into an aggregation module that integrates a bidirectional self-attention module and Fourier transform module, enabling more effective capture of fine-grained features, global context information, and periodic structures in chords. The experimental result shows that proposed algorithm outperforms existing mainstream baseline methods by 1.2% to 2.2% on the MIREX metrics, validating the effectiveness of the algorithm.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"127 ","pages":"Article 110555"},"PeriodicalIF":4.0000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790625004987","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic chord recognition is a significant topic in the field of Music Information Retrieval (MIR). This paper introduces a novel feature fusion method combining Hybrid Constant-Q Transform (HCQT) and adaptive attention for chord detection, especially with a focus on improving the accuracy of chord detection of rare chord classes. Serving as one of the cornerstone features of music, the chords obtained by chord recognition algorithms are the basis of many high-level semantic tasks. At present, a severe class imbalance problem exists in the domain of automatic chord recognition. The recognition accuracy of rare chords is much lower than that of common chords, which significantly affect the overall performance of chord recognition algorithms. In this paper, a chord recognition algorithm based on feature fusion is proposed. First, in the feature extraction part, Hybrid Constant-Q Transform (HCQT) is introduced to assist with Constant-Q Transform(CQT) to obtain richer and finer musical signal features, enabling better tracking of overtones. Next, in the chord estimation part, the frequency-domain adaptive attention (FDAA) mechanism is used to enhance feature saliency, ensuring that the network can adaptively adjust the weights for different frequency components when training. Thereby frequency-domain features that contain important information can be selectively enhanced. The enhanced features are then fed into an aggregation module that integrates a bidirectional self-attention module and Fourier transform module, enabling more effective capture of fine-grained features, global context information, and periodic structures in chords. The experimental result shows that proposed algorithm outperforms existing mainstream baseline methods by 1.2% to 2.2% on the MIREX metrics, validating the effectiveness of the algorithm.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.