Drivable path detection for a mobile robot with differential drive using a deep Learning based segmentation method for indoor navigation.

IF 2.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science Pub Date : 2024-11-19 eCollection Date: 2024-01-01 DOI:10.7717/peerj-cs.2514

Oğuz Mısır

{"title":"Drivable path detection for a mobile robot with differential drive using a deep Learning based segmentation method for indoor navigation.","authors":"Oğuz Mısır","doi":"10.7717/peerj-cs.2514","DOIUrl":null,"url":null,"abstract":"<p><p>The integration of artificial intelligence into the field of robotics enables robots to perform their tasks more meaningfully. In particular, deep-learning methods contribute significantly to robots becoming intelligent cybernetic systems. The effective use of deep-learning mobile cyber-physical systems has enabled mobile robots to become more intelligent. This effective use of deep learning can also help mobile robots determine a safe path. The drivable pathfinding problem involves a mobile robot finding the path to a target in a challenging environment with obstacles. In this paper, a semantic-segmentation-based drivable path detection method is presented for use in the indoor navigation of mobile robots. The proposed method uses a perspective transformation strategy based on transforming high-accuracy segmented images into real-world space. This transformation enables the motion space to be divided into grids, based on the image perceived in a real-world space. A grid-based RRT* navigation strategy was developed that uses images divided into grids to enable the mobile robot to avoid obstacles and meet the optimal path requirements. Smoothing was performed to improve the path planning of the grid-based RRT* and avoid unnecessary turning angles of the mobile robot. Thus, the mobile robot could reach the target in an optimum manner in the drivable area determined by segmentation. Deeplabv3+ and ResNet50 backbone architecture with superior segmentation ability are proposed for accurate determination of drivable path. Gaussian filter was used to reduce the noise caused by segmentation. In addition, multi-otsu thresholding was used to improve the masked images in multiple classes. The segmentation model and backbone architecture were compared in terms of their performance using different methods. DeepLabv3+ and ResNet50 backbone architectures outperformed the other compared methods by 0.21%-4.18% on many metrics. In addition, a mobile robot design is presented to test the proposed drivable path determination method. This design validates the proposed method by using different scenarios in an indoor environment.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2514"},"PeriodicalIF":2.5000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11639217/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2514","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The integration of artificial intelligence into the field of robotics enables robots to perform their tasks more meaningfully. In particular, deep-learning methods contribute significantly to robots becoming intelligent cybernetic systems. The effective use of deep-learning mobile cyber-physical systems has enabled mobile robots to become more intelligent. This effective use of deep learning can also help mobile robots determine a safe path. The drivable pathfinding problem involves a mobile robot finding the path to a target in a challenging environment with obstacles. In this paper, a semantic-segmentation-based drivable path detection method is presented for use in the indoor navigation of mobile robots. The proposed method uses a perspective transformation strategy based on transforming high-accuracy segmented images into real-world space. This transformation enables the motion space to be divided into grids, based on the image perceived in a real-world space. A grid-based RRT* navigation strategy was developed that uses images divided into grids to enable the mobile robot to avoid obstacles and meet the optimal path requirements. Smoothing was performed to improve the path planning of the grid-based RRT* and avoid unnecessary turning angles of the mobile robot. Thus, the mobile robot could reach the target in an optimum manner in the drivable area determined by segmentation. Deeplabv3+ and ResNet50 backbone architecture with superior segmentation ability are proposed for accurate determination of drivable path. Gaussian filter was used to reduce the noise caused by segmentation. In addition, multi-otsu thresholding was used to improve the masked images in multiple classes. The segmentation model and backbone architecture were compared in terms of their performance using different methods. DeepLabv3+ and ResNet50 backbone architectures outperformed the other compared methods by 0.21%-4.18% on many metrics. In addition, a mobile robot design is presented to test the proposed drivable path determination method. This design validates the proposed method by using different scenarios in an indoor environment.

查看原文本刊更多论文

使用基于深度学习的室内导航分割方法，为带差分驱动的移动机器人检测可驾驶路径。

将人工智能融入机器人学领域，能让机器人更有意义地执行任务。其中，深度学习方法对机器人成为智能控制论系统做出了重大贡献。深度学习移动网络物理系统的有效使用使移动机器人变得更加智能。深度学习的有效利用还能帮助移动机器人确定安全路径。可驾驶寻路问题涉及移动机器人在充满障碍的挑战环境中寻找通往目标的路径。本文提出了一种基于语义分割的可驾驶路径检测方法，用于移动机器人的室内导航。所提出的方法采用透视转换策略，将高精度的分割图像转换到真实世界空间。通过这种转换，可以根据在真实世界空间中感知到的图像，将运动空间划分为网格。我们开发了一种基于网格的 RRT* 导航策略，该策略使用划分为网格的图像，使移动机器人能够避开障碍物并满足最佳路径要求。为了改善基于网格的 RRT* 的路径规划，避免移动机器人不必要的转弯角度，对图像进行了平滑处理。这样，移动机器人就能在分割确定的可驾驶区域内以最佳方式到达目标。为精确确定可驾驶路径，提出了具有卓越分割能力的 Deeplabv3+ 和 ResNet50 骨干架构。使用高斯滤波器来降低分割产生的噪声。此外，还使用了多阈值技术来改进多类图像中的屏蔽图像。使用不同的方法对分割模型和骨干架构的性能进行了比较。DeepLabv3+和ResNet50骨干架构在许多指标上都优于其他比较方法0.21%-4.18%。此外，还介绍了一个移动机器人设计，以测试所提出的可驾驶路径确定方法。该设计通过使用室内环境中的不同场景验证了所提出的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.