{"title":"Spatial Layout Consistency for 3D Semantic Segmentation","authors":"M. Jameela, G. Sohn","doi":"10.48550/arXiv.2303.00939","DOIUrl":null,"url":null,"abstract":"Due to the aged nature of much of the utility network infrastructure, developing a robust and trustworthy computer vision system capable of inspecting it with minimal human intervention has attracted considerable research attention. The airborne laser terrain mapping (ALTM) system quickly becomes the central data collection system among the numerous available sensors. Its ability to penetrate foliage with high-powered energy provides wide coverage and achieves survey-grade ranging accuracy. However, the post-data acquisition process for classifying the ALTM's dense and irregular point clouds is a critical bottleneck that must be addressed to improve efficiency and accuracy. We introduce a novel deep convolutional neural network (DCNN) technique for achieving voxel-based semantic segmentation of the ALTM's point clouds. The suggested deep learning method, Semantic Utility Network (SUNet) is a multi-dimensional and multi-resolution network. SUNet combines two networks: one classifies point clouds at multi-resolution with object categories in three dimensions and another predicts two-dimensional regional labels distinguishing corridor regions from non-corridors. A significant innovation of the SUNet is that it imposes spatial layout consistency on the outcomes of voxel-based and regional segmentation results. The proposed multi-dimensional DCNN combines hierarchical context for spatial layout embedding with a coarse-to-fine strategy. We conducted a comprehensive ablation study to test SUNet's performance using 67 km x 67 km of utility corridor data at a density of 5pp/m2. Our experiments demonstrated that SUNet's spatial layout consistency and a multi-resolution feature aggregation could significantly improve performance, outperforming the SOTA baseline network and achieving a good F1 score for pylon 89%, ground 99%, vegetation 99% and powerline 98% classes.","PeriodicalId":391161,"journal":{"name":"ICPR Workshops","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICPR Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2303.00939","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Due to the aged nature of much of the utility network infrastructure, developing a robust and trustworthy computer vision system capable of inspecting it with minimal human intervention has attracted considerable research attention. The airborne laser terrain mapping (ALTM) system quickly becomes the central data collection system among the numerous available sensors. Its ability to penetrate foliage with high-powered energy provides wide coverage and achieves survey-grade ranging accuracy. However, the post-data acquisition process for classifying the ALTM's dense and irregular point clouds is a critical bottleneck that must be addressed to improve efficiency and accuracy. We introduce a novel deep convolutional neural network (DCNN) technique for achieving voxel-based semantic segmentation of the ALTM's point clouds. The suggested deep learning method, Semantic Utility Network (SUNet) is a multi-dimensional and multi-resolution network. SUNet combines two networks: one classifies point clouds at multi-resolution with object categories in three dimensions and another predicts two-dimensional regional labels distinguishing corridor regions from non-corridors. A significant innovation of the SUNet is that it imposes spatial layout consistency on the outcomes of voxel-based and regional segmentation results. The proposed multi-dimensional DCNN combines hierarchical context for spatial layout embedding with a coarse-to-fine strategy. We conducted a comprehensive ablation study to test SUNet's performance using 67 km x 67 km of utility corridor data at a density of 5pp/m2. Our experiments demonstrated that SUNet's spatial layout consistency and a multi-resolution feature aggregation could significantly improve performance, outperforming the SOTA baseline network and achieving a good F1 score for pylon 89%, ground 99%, vegetation 99% and powerline 98% classes.
由于许多公用事业网络基础设施的老化性质,开发一种强大而可靠的计算机视觉系统,能够以最小的人为干预对其进行检查,已经引起了相当大的研究关注。机载激光地形测绘(ALTM)系统迅速成为众多可用传感器中的中心数据采集系统。它能够以高功率能量穿透树叶,提供广泛的覆盖范围,并达到测量级的测距精度。然而,对ALTM密集和不规则点云进行分类的数据采集后过程是提高效率和准确性必须解决的关键瓶颈。我们提出了一种新的深度卷积神经网络(DCNN)技术来实现基于体素的ALTM点云语义分割。本文提出的深度学习方法语义效用网络(SUNet)是一个多维、多分辨率的网络。SUNet结合了两种网络:一种是用三维物体类别对多分辨率点云进行分类,另一种是预测区分走廊区域和非走廊区域的二维区域标签。SUNet的一个重要创新是它对基于体素和区域分割结果的结果施加了空间布局一致性。本文提出的多维DCNN结合了层次上下文的空间布局嵌入和从粗到精的策略。我们进行了一项全面的消融研究,使用密度为5pp/m2的67 km x 67 km公用事业走廊数据来测试SUNet的性能。我们的实验表明,SUNet的空间布局一致性和多分辨率特征聚合可以显著提高性能,优于SOTA基线网络,并在塔架89%,地面99%,植被99%和电力线98%的类别中获得良好的F1分数。