{"title":"A trio-based feature extraction framework for bird sounds classification","authors":"Burak Celik , Ayhan Akbal","doi":"10.1016/j.apacoust.2025.111064","DOIUrl":null,"url":null,"abstract":"<div><div>Bird species identification is crucial for environmental monitoring, ecological studies, and species tracking. Automated bird sound classification systems have been developed to achieve precise species detection. While deep learning models offer high accuracy, their computational complexity poses challenges for resource-limited environments. To address this, we propose a novel lightweight and highly accurate bird sound classification model utilizing a multilevel feature generation framework named AvisPat, derived from the Latin term “Avis” (bird), emphasizing its focus on avian bioacoustics. The AvisPat model leverages a 7-level discrete wavelet transform (DWT) to decompose audio signals, extracting signum, upper ternary, and lower ternary features to capture diverse signal attributes. For feature selection, an enhanced iterative Neighborhood Component Analysis (NCA) and ReliefF methods are applied iteratively to select the most discriminative features, generating multiple feature subsets. These features are classified using k-Nearest Neighbor (k-NN) and Support Vector Machine (SVM) classifiers. In addition, the proposed model achieved 96.72% accuracy on a separate Xeno-Canto dataset containing 10 bird species from diverse geographic regions, demonstrating strong generalization capability. The ’trio’ in AvisPat is chosen because the combination of signum, ternary features extracted via 7-level discrete wavelet transform comprehensively captures the time, frequency, and amplitude aspects of bird sounds, enhancing the model’s ability to distinguish between species with high accuracy.</div></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":"242 ","pages":"Article 111064"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X25005365","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Bird species identification is crucial for environmental monitoring, ecological studies, and species tracking. Automated bird sound classification systems have been developed to achieve precise species detection. While deep learning models offer high accuracy, their computational complexity poses challenges for resource-limited environments. To address this, we propose a novel lightweight and highly accurate bird sound classification model utilizing a multilevel feature generation framework named AvisPat, derived from the Latin term “Avis” (bird), emphasizing its focus on avian bioacoustics. The AvisPat model leverages a 7-level discrete wavelet transform (DWT) to decompose audio signals, extracting signum, upper ternary, and lower ternary features to capture diverse signal attributes. For feature selection, an enhanced iterative Neighborhood Component Analysis (NCA) and ReliefF methods are applied iteratively to select the most discriminative features, generating multiple feature subsets. These features are classified using k-Nearest Neighbor (k-NN) and Support Vector Machine (SVM) classifiers. In addition, the proposed model achieved 96.72% accuracy on a separate Xeno-Canto dataset containing 10 bird species from diverse geographic regions, demonstrating strong generalization capability. The ’trio’ in AvisPat is chosen because the combination of signum, ternary features extracted via 7-level discrete wavelet transform comprehensively captures the time, frequency, and amplitude aspects of bird sounds, enhancing the model’s ability to distinguish between species with high accuracy.
期刊介绍:
Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense.
Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems.
Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.