Machine learning for shipwreck segmentation from side scan sonar imagery: Dataset and benchmark

The International Journal of Robotics Research Pub Date : 2024-07-21 DOI:10.1177/02783649241266853

Advaith V. Sethuraman, Anja Sheppard, Onur Bagoren, Christopher Pinnow, Jamey Anderson, Timothy C. Havens, Katherine A. Skinner

{"title":"Machine learning for shipwreck segmentation from side scan sonar imagery: Dataset and benchmark","authors":"Advaith V. Sethuraman, Anja Sheppard, Onur Bagoren, Christopher Pinnow, Jamey Anderson, Timothy C. Havens, Katherine A. Skinner","doi":"10.1177/02783649241266853","DOIUrl":null,"url":null,"abstract":"Open-source benchmark datasets have been a critical component for advancing machine learning for robot perception in terrestrial applications. Benchmark datasets enable the widespread development of state-of-the-art machine learning methods, which require large datasets for training, validation, and thorough comparison to competing approaches. Underwater environments impose several operational challenges that hinder efforts to collect large benchmark datasets for marine robot perception. Furthermore, a low abundance of targets of interest relative to the size of the search space leads to increased time and cost required to collect useful datasets for a specific task. As a result, there is limited availability of labeled benchmark datasets for underwater applications. We present the AI4Shipwrecks dataset, which consists of 28 distinct shipwrecks totaling 286 high-resolution labeled side scan sonar images to advance the state-of-the-art in autonomous sonar image understanding. We leverage the unique abundance of targets in Thunder Bay National Marine Sanctuary in Lake Huron, MI, to collect and compile a sonar imagery benchmark dataset through surveys with an autonomous underwater vehicle (AUV). We consulted with expert marine archaeologists for the labeling of robotically gathered data. We then leverage this dataset to perform benchmark experiments for comparison of state-of-the-art supervised segmentation methods, and we present insights on opportunities and open challenges for the field. The dataset and benchmarking tools will be released as an open-source benchmark dataset to spur innovation in machine learning for Great Lakes and ocean exploration. The dataset and accompanying software are available at https://umfieldrobotics.github.io/ai4shipwrecks/ .","PeriodicalId":501362,"journal":{"name":"The International Journal of Robotics Research","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International Journal of Robotics Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/02783649241266853","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Open-source benchmark datasets have been a critical component for advancing machine learning for robot perception in terrestrial applications. Benchmark datasets enable the widespread development of state-of-the-art machine learning methods, which require large datasets for training, validation, and thorough comparison to competing approaches. Underwater environments impose several operational challenges that hinder efforts to collect large benchmark datasets for marine robot perception. Furthermore, a low abundance of targets of interest relative to the size of the search space leads to increased time and cost required to collect useful datasets for a specific task. As a result, there is limited availability of labeled benchmark datasets for underwater applications. We present the AI4Shipwrecks dataset, which consists of 28 distinct shipwrecks totaling 286 high-resolution labeled side scan sonar images to advance the state-of-the-art in autonomous sonar image understanding. We leverage the unique abundance of targets in Thunder Bay National Marine Sanctuary in Lake Huron, MI, to collect and compile a sonar imagery benchmark dataset through surveys with an autonomous underwater vehicle (AUV). We consulted with expert marine archaeologists for the labeling of robotically gathered data. We then leverage this dataset to perform benchmark experiments for comparison of state-of-the-art supervised segmentation methods, and we present insights on opportunities and open challenges for the field. The dataset and benchmarking tools will be released as an open-source benchmark dataset to spur innovation in machine learning for Great Lakes and ocean exploration. The dataset and accompanying software are available at https://umfieldrobotics.github.io/ai4shipwrecks/ .

查看原文本刊更多论文

利用机器学习从侧扫声纳图像中分割沉船：数据集和基准

开源基准数据集一直是推动陆地应用中机器人感知机器学习的关键组成部分。基准数据集有助于广泛开发最先进的机器学习方法，这些方法需要大型数据集进行训练、验证以及与竞争方法进行全面比较。水下环境带来了一些操作上的挑战，阻碍了为海洋机器人感知收集大型基准数据集的工作。此外，相对于搜索空间的大小而言，感兴趣的目标数量较少，这导致为特定任务收集有用数据集所需的时间和成本增加。因此，用于水下应用的标注基准数据集非常有限。我们提出了 AI4Shipwrecks 数据集，该数据集由 28 个不同的沉船组成，共包含 286 幅高分辨率标记侧扫声纳图像，从而推动了自主声纳图像理解技术的发展。我们利用密歇根州休伦湖雷霆湾国家海洋保护区独特的大量目标，通过使用自动潜航器（AUV）进行勘测，收集和编制声纳图像基准数据集。我们咨询了海洋考古专家，以对机器人收集的数据进行标注。然后，我们利用该数据集进行基准实验，对最先进的监督分割方法进行比较，并就该领域的机遇和公开挑战提出见解。该数据集和基准测试工具将作为开源基准数据集发布，以促进大湖和海洋勘探机器学习的创新。该数据集和配套软件可在 https://umfieldrobotics.github.io/ai4shipwrecks/ 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The International Journal of Robotics Research

自引率

0.00%

发文量