Automated AI-Based Annotation Framework for 3D Object Detection from LIDAR Data in Industrial Areas

SAE Technical Paper Series Pub Date : 2024-07-02 DOI:10.4271/2024-01-2999

Gina Abdelhalim, Kevin Simon, Robert Bensch, Sai Parimi, Bilal Ahmed Qureshi

{"title":"Automated AI-Based Annotation Framework for 3D Object Detection from LIDAR Data in Industrial Areas","authors":"Gina Abdelhalim, Kevin Simon, Robert Bensch, Sai Parimi, Bilal Ahmed Qureshi","doi":"10.4271/2024-01-2999","DOIUrl":null,"url":null,"abstract":"Autonomous Driving is used in various settings, including indoor areas such as industrial halls and warehouses. For perception in these environments, LIDAR is currently very popular due to its high accuracy compared to RADAR and its robustness to varying lighting conditions compared to cameras. However, there is a notable lack of freely available labeled LIDAR data in these settings, and most public datasets, such as KITTI and Waymo, focus on public road scenarios. As a result, specialized publicly available annotation frameworks are rare as well. This work tackles these shortcomings by developing an automated AI-based labeling tool to generate a LIDAR dataset with 3D ground truth annotations for industrial warehouse scenarios. The base pipeline for the annotation framework first upsamples the incoming 16-channel data into dense 64-channel data. The upsampled data is then manually annotated for the defined classes and this annotated 64-channel dataset is used to fine-tune the Part-A2-Net that has been pretrained on the KITTI dataset. This fine-tuned network shows promising results for the defined classes. To overcome some shortcomings with this pipeline, which mainly involves artefacts from upsampling and manual labeling, we extend the pipeline to make use of SLAM to generate the dense point cloud and use the generated poses to speed up the labeling process. The progression, therefore shows the three generations of the framework which started with manual upsampling and labeling. This then was extended to a semi-automated approach with automatic generation of dense map using SLAM and automatic annotation propagation to all the scans for all static classes and then the complete automatic pipeline that generates ground truth using the Part-A2-Net which was trained using the dataset generated from the manual and semi-automated pipelines. The dataset generated for this warehouse environment will continuously be extended and is publicly available at https://github.com/anavsgmbh/lidar-warehouse-dataset.","PeriodicalId":510086,"journal":{"name":"SAE Technical Paper Series","volume":"55 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SAE Technical Paper Series","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4271/2024-01-2999","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Autonomous Driving is used in various settings, including indoor areas such as industrial halls and warehouses. For perception in these environments, LIDAR is currently very popular due to its high accuracy compared to RADAR and its robustness to varying lighting conditions compared to cameras. However, there is a notable lack of freely available labeled LIDAR data in these settings, and most public datasets, such as KITTI and Waymo, focus on public road scenarios. As a result, specialized publicly available annotation frameworks are rare as well. This work tackles these shortcomings by developing an automated AI-based labeling tool to generate a LIDAR dataset with 3D ground truth annotations for industrial warehouse scenarios. The base pipeline for the annotation framework first upsamples the incoming 16-channel data into dense 64-channel data. The upsampled data is then manually annotated for the defined classes and this annotated 64-channel dataset is used to fine-tune the Part-A2-Net that has been pretrained on the KITTI dataset. This fine-tuned network shows promising results for the defined classes. To overcome some shortcomings with this pipeline, which mainly involves artefacts from upsampling and manual labeling, we extend the pipeline to make use of SLAM to generate the dense point cloud and use the generated poses to speed up the labeling process. The progression, therefore shows the three generations of the framework which started with manual upsampling and labeling. This then was extended to a semi-automated approach with automatic generation of dense map using SLAM and automatic annotation propagation to all the scans for all static classes and then the complete automatic pipeline that generates ground truth using the Part-A2-Net which was trained using the dataset generated from the manual and semi-automated pipelines. The dataset generated for this warehouse environment will continuously be extended and is publicly available at https://github.com/anavsgmbh/lidar-warehouse-dataset.

查看原文本刊更多论文

基于人工智能的自动注释框架，用于从工业区激光雷达数据中检测 3D 物体

自动驾驶应用于各种环境，包括工业厂房和仓库等室内区域。对于这些环境中的感知，激光雷达目前非常流行，因为与雷达相比，激光雷达精度高，与摄像头相比，激光雷达对不同光照条件的适应性强。然而，在这些环境中明显缺乏免费提供的标注激光雷达数据，大多数公共数据集（如 KITTI 和 Waymo）都集中在公共道路场景。因此，专门的公开标注框架也很少见。这项工作通过开发一种基于人工智能的自动标注工具来解决这些缺陷，该工具可生成带有工业仓库场景三维地面实况标注的激光雷达数据集。标注框架的基本管道首先将输入的 16 通道数据升采样为密集的 64 通道数据。然后，针对定义的类别对上采样数据进行人工标注，并使用这个标注过的 64 通道数据集对在 KITTI 数据集上预训练过的 Part-A2 网络进行微调。这一经过微调的网络在所定义的类别中显示出了良好的效果。为了克服这一流程的一些不足之处（主要涉及上采样和人工标注所产生的人工痕迹），我们扩展了这一流程，利用 SLAM 生成密集点云，并利用生成的姿势来加快标注过程。因此，这一进展显示了该框架的三代发展历程。然后扩展到半自动方法，使用 SLAM 自动生成密集地图，并自动将注释传播到所有静态类别的所有扫描中，然后使用 Part-A2-Net 生成完整的自动流水线，该流水线使用手动和半自动流水线生成的数据集进行训练，从而生成地面实况。为该仓库环境生成的数据集将不断扩展，并可在 https://github.com/anavsgmbh/lidar-warehouse-dataset 上公开获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

SAE Technical Paper Series

自引率

0.00%

发文量