用于算法开发和分析的高质量水声数据集。

IF 6.9 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Victor Lobo, Nuno Pessanha Santos, Ricardo Moura
{"title":"用于算法开发和分析的高质量水声数据集。","authors":"Victor Lobo, Nuno Pessanha Santos, Ricardo Moura","doi":"10.1038/s41597-025-05564-x","DOIUrl":null,"url":null,"abstract":"<p><p>As data becomes increasingly available, relying on quality datasets for algorithm analysis and development is essential. However, data gathering can be expensive and time-consuming, and this process must be optimized to allow others to reuse data with simplicity and accuracy. The Wolfset is an acoustic dataset gathered using a Bruel & Kjaer type 8104 hydrophone in an anechoic tank usually used for ships' sonar calibration. The name Wolfset is inspired by the Seawolf submarine class, renowned for its advanced sound source detection and classification capabilities. Using an anechoic tank, we can obtain a high-quality dataset representing acoustic sources without undesired external perturbations. In many operating conditions, several outboard motors and an electric motor from a basic remotely controlled ship model were used as sound sources, usually called targets. Then, external transients and noise sources were added to approximate the dataset to the sounds present in real-world conditions. This dataset uses a systematic approach to demonstrate the diversity and accuracy needed for effective algorithm development.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1323"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12311032/pdf/","citationCount":"0","resultStr":"{\"title\":\"A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis.\",\"authors\":\"Victor Lobo, Nuno Pessanha Santos, Ricardo Moura\",\"doi\":\"10.1038/s41597-025-05564-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>As data becomes increasingly available, relying on quality datasets for algorithm analysis and development is essential. However, data gathering can be expensive and time-consuming, and this process must be optimized to allow others to reuse data with simplicity and accuracy. The Wolfset is an acoustic dataset gathered using a Bruel & Kjaer type 8104 hydrophone in an anechoic tank usually used for ships' sonar calibration. The name Wolfset is inspired by the Seawolf submarine class, renowned for its advanced sound source detection and classification capabilities. Using an anechoic tank, we can obtain a high-quality dataset representing acoustic sources without undesired external perturbations. In many operating conditions, several outboard motors and an electric motor from a basic remotely controlled ship model were used as sound sources, usually called targets. Then, external transients and noise sources were added to approximate the dataset to the sounds present in real-world conditions. This dataset uses a systematic approach to demonstrate the diversity and accuracy needed for effective algorithm development.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"12 1\",\"pages\":\"1323\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12311032/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-025-05564-x\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-05564-x","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

随着数据变得越来越可用,依靠高质量的数据集进行算法分析和开发是必不可少的。然而,数据收集可能既昂贵又耗时,必须对该过程进行优化,以允许其他人简单而准确地重用数据。Wolfset是使用Bruel & Kjaer型8104型水听器收集的声学数据集,通常用于船舶声纳校准。沃尔夫塞特这个名字的灵感来自海狼级潜艇,以其先进的声源探测和分类能力而闻名。使用消声槽,我们可以获得一个高质量的数据集,表示没有不希望的外部扰动的声源。在许多操作条件下,几个舷外发动机和一个来自基本遥控船舶模型的电动机被用作声源,通常称为目标。然后,将外部瞬态和噪声源添加到数据集中,使其近似于现实世界中存在的声音。该数据集使用系统的方法来展示有效算法开发所需的多样性和准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A High-Quality Underwater Acoustic Dataset for Algorithm Development and Analysis.

As data becomes increasingly available, relying on quality datasets for algorithm analysis and development is essential. However, data gathering can be expensive and time-consuming, and this process must be optimized to allow others to reuse data with simplicity and accuracy. The Wolfset is an acoustic dataset gathered using a Bruel & Kjaer type 8104 hydrophone in an anechoic tank usually used for ships' sonar calibration. The name Wolfset is inspired by the Seawolf submarine class, renowned for its advanced sound source detection and classification capabilities. Using an anechoic tank, we can obtain a high-quality dataset representing acoustic sources without undesired external perturbations. In many operating conditions, several outboard motors and an electric motor from a basic remotely controlled ship model were used as sound sources, usually called targets. Then, external transients and noise sources were added to approximate the dataset to the sounds present in real-world conditions. This dataset uses a systematic approach to demonstrate the diversity and accuracy needed for effective algorithm development.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific Data
Scientific Data Social Sciences-Education
CiteScore
11.20
自引率
4.10%
发文量
689
审稿时长
16 weeks
期刊介绍: Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data. The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信