小而强大：利用 U-Next 框架加强三维点云语义分割

IF 8.6 Q1 REMOTE SENSING

International journal of applied earth observation and geoinformation : ITC journal Pub Date : 2025-02-01 DOI:10.1016/j.jag.2024.104309

Ziyin Zeng , Qingyong Hu , Zhong Xie , Bijun Li , Jian Zhou , Yongyang Xu

{"title":"小而强大：利用 U-Next 框架加强三维点云语义分割","authors":"Ziyin Zeng , Qingyong Hu , Zhong Xie , Bijun Li , Jian Zhou , Yongyang Xu","doi":"10.1016/j.jag.2024.104309","DOIUrl":null,"url":null,"abstract":"<div><div>We investigate the problem of 3D point clouds semantic segmentation. Recently, a large amount of research work has focused on local feature aggregation. However, the foundational framework of semantic segmentation of 3D point clouds has been neglected, where the majority of current methods default to the U-Net framework. In this study, we present U-Next, a small but mighty framework designed specifically for point cloud semantic segmentation. The key innovation of this framework is to capture multi-scale hierarchical features. Specifically, we construct the U-Next by stacking multiple U-Net <span><math><msup><mrow><mi>L</mi></mrow><mrow><mn>1</mn></mrow></msup></math></span> sub-networks in a dense arrangement to diminish the semantic gap. Concurrently, it integrates feature maps across various scales to proficiently restore intricate fine-grained details. Additionally, a multi-level deep supervision mechanism is introduced for smoothing gradient propagation and facilitating network optimization. We conduct extensive experiments on benchmarks, including the indoor S3DIS dataset, the LiDAR-based outdoor Toronto3D dataset, and the urban-scale photogrammetry-based SensatUrban dataset, demonstrate the superiority of U-Next. The U-Next framework consistently exhibits significant performance enhancements across various benchmarks and baselines, demonstrating its considerable potential as a versatile point-based framework for future endeavors. The code has been released at <span><span>https://github.com/zeng-ziyin/U-Next</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"136 ","pages":"Article 104309"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Small but mighty: Enhancing 3D point clouds semantic segmentation with U-Next framework\",\"authors\":\"Ziyin Zeng , Qingyong Hu , Zhong Xie , Bijun Li , Jian Zhou , Yongyang Xu\",\"doi\":\"10.1016/j.jag.2024.104309\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We investigate the problem of 3D point clouds semantic segmentation. Recently, a large amount of research work has focused on local feature aggregation. However, the foundational framework of semantic segmentation of 3D point clouds has been neglected, where the majority of current methods default to the U-Net framework. In this study, we present U-Next, a small but mighty framework designed specifically for point cloud semantic segmentation. The key innovation of this framework is to capture multi-scale hierarchical features. Specifically, we construct the U-Next by stacking multiple U-Net <span><math><msup><mrow><mi>L</mi></mrow><mrow><mn>1</mn></mrow></msup></math></span> sub-networks in a dense arrangement to diminish the semantic gap. Concurrently, it integrates feature maps across various scales to proficiently restore intricate fine-grained details. Additionally, a multi-level deep supervision mechanism is introduced for smoothing gradient propagation and facilitating network optimization. We conduct extensive experiments on benchmarks, including the indoor S3DIS dataset, the LiDAR-based outdoor Toronto3D dataset, and the urban-scale photogrammetry-based SensatUrban dataset, demonstrate the superiority of U-Next. The U-Next framework consistently exhibits significant performance enhancements across various benchmarks and baselines, demonstrating its considerable potential as a versatile point-based framework for future endeavors. The code has been released at <span><span>https://github.com/zeng-ziyin/U-Next</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"136 \",\"pages\":\"Article 104309\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843224006678\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224006678","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

摘要

研究了三维点云的语义分割问题。近年来，大量的研究工作集中在局部特征聚合上。然而，三维点云语义分割的基本框架一直被忽视，目前大多数方法都默认使用U-Net框架。在这项研究中，我们提出了U-Next，一个专门为点云语义分割设计的小而强大的框架。该框架的关键创新在于捕获了多尺度的层次特征。具体来说，我们通过密集地堆叠多个U-Net L1子网来构建U-Next，以减小语义缺口。同时，它集成了各种尺度的特征图，以熟练地恢复复杂的细粒度细节。此外，引入多层次深度监督机制平滑梯度传播，促进网络优化。我们在基准测试上进行了大量实验，包括室内S3DIS数据集、基于lidar的室外Toronto3D数据集和基于城市尺度摄影测量的SensatUrban数据集，证明了U-Next的优势。U-Next框架在各种基准测试和基准测试中始终表现出显著的性能增强，显示出其作为未来努力的通用点基础框架的巨大潜力。该代码已在https://github.com/zeng-ziyin/U-Next上发布。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Small but mighty: Enhancing 3D point clouds semantic segmentation with U-Next framework

We investigate the problem of 3D point clouds semantic segmentation. Recently, a large amount of research work has focused on local feature aggregation. However, the foundational framework of semantic segmentation of 3D point clouds has been neglected, where the majority of current methods default to the U-Net framework. In this study, we present U-Next, a small but mighty framework designed specifically for point cloud semantic segmentation. The key innovation of this framework is to capture multi-scale hierarchical features. Specifically, we construct the U-Next by stacking multiple U-Net

L^{1}

sub-networks in a dense arrangement to diminish the semantic gap. Concurrently, it integrates feature maps across various scales to proficiently restore intricate fine-grained details. Additionally, a multi-level deep supervision mechanism is introduced for smoothing gradient propagation and facilitating network optimization. We conduct extensive experiments on benchmarks, including the indoor S3DIS dataset, the LiDAR-based outdoor Toronto3D dataset, and the urban-scale photogrammetry-based SensatUrban dataset, demonstrate the superiority of U-Next. The U-Next framework consistently exhibits significant performance enhancements across various benchmarks and baselines, demonstrating its considerable potential as a versatile point-based framework for future endeavors. The code has been released at https://github.com/zeng-ziyin/U-Next.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences

CiteScore

12.00

自引率

0.00%

发文量

审稿时长

77 days

期刊介绍： The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.