Weikang Yang , Xinghao Lu , Binjie Chen , Chenlu Lin , Xueye Bao , Weiquan Liu , Yu Zang , Junyu Xu , Cheng Wang
{"title":"DeLA: An extremely faster network with decoupled local aggregation for large scale point cloud learning","authors":"Weikang Yang , Xinghao Lu , Binjie Chen , Chenlu Lin , Xueye Bao , Weiquan Liu , Yu Zang , Junyu Xu , Cheng Wang","doi":"10.1016/j.jag.2024.104255","DOIUrl":null,"url":null,"abstract":"<div><div>With advances in data collection technology, the volume of recent remote sensing point cloud datasets has grown significantly, posing substantial challenges for point cloud deep learning, particularly in neighborhood aggregation operations. Unlike simple pooling, neighborhood aggregation incorporates spatial relationships between points into the feature aggregation process, requiring repeated relationship learning and resulting in substantial computational redundancy. The exponential increase in data volume exacerbates this issue. To address this, we theoretically demonstrate that if basic spatial information is encoded in point features, simple pooling operations can effectively aggregate features. This means the spatial relationships can be extracted and integrated with other features during aggregation. Based on this concept, we propose a lightweight point network called DeLA (Decoupled Local Aggregation). DeLA separates the traditional neighborhood aggregation process into distinct spatial encoding and local aggregation operations, reducing the computational complexity by a factor of K, where K is the number of neighbors in the K-Nearest Neighbor algorithm (K-NN). Experimental results on five classic benchmarks show that DeLA achieves state-of-the-art performance with reduced or equivalent latency. Specifically, DeLA exceeds 90% overall accuracy on ScanObjectNN and 74% mIoU on S3DIS Area 5. Additionally, DeLA achieves state-of-the-art results on ScanNetV2 with only 20% of the parameters of equivalent models. Our code is available at <span><span>https://github.com/Matrix-ASC/DeLA</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"135 ","pages":"Article 104255"},"PeriodicalIF":7.6000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224006113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
Abstract
With advances in data collection technology, the volume of recent remote sensing point cloud datasets has grown significantly, posing substantial challenges for point cloud deep learning, particularly in neighborhood aggregation operations. Unlike simple pooling, neighborhood aggregation incorporates spatial relationships between points into the feature aggregation process, requiring repeated relationship learning and resulting in substantial computational redundancy. The exponential increase in data volume exacerbates this issue. To address this, we theoretically demonstrate that if basic spatial information is encoded in point features, simple pooling operations can effectively aggregate features. This means the spatial relationships can be extracted and integrated with other features during aggregation. Based on this concept, we propose a lightweight point network called DeLA (Decoupled Local Aggregation). DeLA separates the traditional neighborhood aggregation process into distinct spatial encoding and local aggregation operations, reducing the computational complexity by a factor of K, where K is the number of neighbors in the K-Nearest Neighbor algorithm (K-NN). Experimental results on five classic benchmarks show that DeLA achieves state-of-the-art performance with reduced or equivalent latency. Specifically, DeLA exceeds 90% overall accuracy on ScanObjectNN and 74% mIoU on S3DIS Area 5. Additionally, DeLA achieves state-of-the-art results on ScanNetV2 with only 20% of the parameters of equivalent models. Our code is available at https://github.com/Matrix-ASC/DeLA.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.