Deep reinforcement learning for time-critical wilderness search and rescue using drones.

IF 2.9 Q2 ROBOTICS

Frontiers in Robotics and AI Pub Date : 2025-02-03 eCollection Date: 2024-01-01 DOI:10.3389/frobt.2024.1527095

Jan-Hendrik Ewers, David Anderson, Douglas Thomson

{"title":"Deep reinforcement learning for time-critical wilderness search and rescue using drones.","authors":"Jan-Hendrik Ewers, David Anderson, Douglas Thomson","doi":"10.3389/frobt.2024.1527095","DOIUrl":null,"url":null,"abstract":"Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over <math><mrow><mn>160</mn> <mi>%</mi></mrow> </math> , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.","PeriodicalId":47597,"journal":{"name":"Frontiers in Robotics and AI","volume":"11 ","pages":"1527095"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11831046/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Robotics and AI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frobt.2024.1527095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ROBOTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over $160 %$ , a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.

查看原文本刊更多论文

使用无人机进行时间紧迫的野外搜索和救援的深度强化学习。

在荒野地区，传统的搜索和救援方法既耗时又覆盖范围有限。无人机提供了一个更快、更灵活的解决方案，但优化它们的搜索路径对于有效的行动至关重要。本文提出了一种新的算法，利用深度强化学习为无人机在荒野环境中创建有效的搜索路径。我们的方法以概率分布图的形式利用了关于搜索区域和失踪者的先验数据。这使得该策略能够学习最佳飞行路径，从而最大限度地快速找到失踪者。实验结果表明，与传统的覆盖规划和搜索规划算法相比，我们的方法在搜索时间上实现了超过160%的显著改进，这一差异在现实世界的搜索操作中可能意味着生命或死亡。此外，与以前的工作不同，我们的方法结合了一个由立方体支持的连续行动空间，允许更细微的飞行模式。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in Robotics and AI ROBOTICS-

CiteScore

6.50

自引率

5.90%

发文量

355

审稿时长

14 weeks

期刊介绍： Frontiers in Robotics and AI publishes rigorously peer-reviewed research covering all theory and applications of robotics, technology, and artificial intelligence, from biomedical to space robotics.