Jean Baptiste Tary, Sergio F Poveda, Ka Lok Li, Christine Peirce, Richard W Hobbs, Carlos Alberto Vargas
{"title":"Detection and localization of Bryde's whale calls using machine learning and probabilistic back-projection.","authors":"Jean Baptiste Tary, Sergio F Poveda, Ka Lok Li, Christine Peirce, Richard W Hobbs, Carlos Alberto Vargas","doi":"10.1121/10.0039043","DOIUrl":null,"url":null,"abstract":"<p><p>Passive acoustic monitoring can inform our understanding of baleen whale behavior by recording and analyzing their vocalizations. Two crucial factors in the analysis of whale calls are their detection and localization. In this study, we first develop a machine learning method to detect Bryde's whale calls observed by ocean-bottom instruments deployed in the Panama basin, and back-project the detected events to determine their localizations. Using previously identified Bryde's whale calls, we apply data augmentation strategies to increase the size of our training dataset to ultimately obtain 890 214 training examples. Using an evaluation dataset, we determine which detection thresholds optimize false positives and negatives and apply these to continuously recorded hydrophone data. The detection resulted in 4514 potential events, of which 899 were recorded by at least three instruments. The waveforms of these events were automatically extracted, cross correlation probability envelopes were computed between hydrophones, and these were finally back-projected onto a 3D grid to obtain final event localizations. For this network, this procedure is shown to be robust to high noise levels, random time errors, and systematic bias introduced by the velocity model. This approach has further advantages, such as being computationally efficient and requiring minimal manual intervention.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"158 2","pages":"1386-1397"},"PeriodicalIF":2.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0039043","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Passive acoustic monitoring can inform our understanding of baleen whale behavior by recording and analyzing their vocalizations. Two crucial factors in the analysis of whale calls are their detection and localization. In this study, we first develop a machine learning method to detect Bryde's whale calls observed by ocean-bottom instruments deployed in the Panama basin, and back-project the detected events to determine their localizations. Using previously identified Bryde's whale calls, we apply data augmentation strategies to increase the size of our training dataset to ultimately obtain 890 214 training examples. Using an evaluation dataset, we determine which detection thresholds optimize false positives and negatives and apply these to continuously recorded hydrophone data. The detection resulted in 4514 potential events, of which 899 were recorded by at least three instruments. The waveforms of these events were automatically extracted, cross correlation probability envelopes were computed between hydrophones, and these were finally back-projected onto a 3D grid to obtain final event localizations. For this network, this procedure is shown to be robust to high noise levels, random time errors, and systematic bias introduced by the velocity model. This approach has further advantages, such as being computationally efficient and requiring minimal manual intervention.
期刊介绍:
Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.