PointeNet:有效、高效的点云分析轻量级框架

IF 1.3 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Lipeng Gu , Xuefeng Yan , Liangliang Nan , Dingkun Zhu , Honghua Chen , Weiming Wang , Mingqiang Wei
{"title":"PointeNet:有效、高效的点云分析轻量级框架","authors":"Lipeng Gu ,&nbsp;Xuefeng Yan ,&nbsp;Liangliang Nan ,&nbsp;Dingkun Zhu ,&nbsp;Honghua Chen ,&nbsp;Weiming Wang ,&nbsp;Mingqiang Wei","doi":"10.1016/j.cagd.2024.102311","DOIUrl":null,"url":null,"abstract":"<div><p>The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce <strong>PointeNet</strong>, an <strong>e</strong>fficient <strong>net</strong>work designed specifically for <strong>point</strong> cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an <em>optional</em> Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in <span><math><mn>3</mn><mi>D</mi><mspace></mspace><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mi>R</mi><mn>40</mn></mrow></msub></math></span> for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at <span>https://github.com/lipeng-gu/PointeNet</span><svg><path></path></svg>.</p></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"110 ","pages":"Article 102311"},"PeriodicalIF":1.3000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PointeNet: A lightweight framework for effective and efficient point cloud analysis\",\"authors\":\"Lipeng Gu ,&nbsp;Xuefeng Yan ,&nbsp;Liangliang Nan ,&nbsp;Dingkun Zhu ,&nbsp;Honghua Chen ,&nbsp;Weiming Wang ,&nbsp;Mingqiang Wei\",\"doi\":\"10.1016/j.cagd.2024.102311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce <strong>PointeNet</strong>, an <strong>e</strong>fficient <strong>net</strong>work designed specifically for <strong>point</strong> cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an <em>optional</em> Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in <span><math><mn>3</mn><mi>D</mi><mspace></mspace><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mi>R</mi><mn>40</mn></mrow></msub></math></span> for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at <span>https://github.com/lipeng-gu/PointeNet</span><svg><path></path></svg>.</p></div>\",\"PeriodicalId\":55226,\"journal\":{\"name\":\"Computer Aided Geometric Design\",\"volume\":\"110 \",\"pages\":\"Article 102311\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Aided Geometric Design\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167839624000451\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839624000451","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

点云分析的传统智慧主要是探索三维几何图形。这通常是通过在编码器中引入复杂的可学习几何提取器或通过重复块深化网络来实现的。然而,这些方法包含大量可学习参数,导致大量计算成本,并对 CPU/GPU 造成内存负担。此外,这些方法主要针对对象级点云分类和分割任务,对关键场景级应用(如自动驾驶)的扩展有限。为此,我们引入了专门为点云分析设计的高效网络 PointeNet。PointeNet 以其轻量级架构、低训练成本和即插即用功能而与众不同,同时还能有效捕捉具有代表性的特征。该网络由一个多变量几何编码(MGE)模块和一个可选的距离感知语义增强(DSE)模块组成。MGE 采用采样、分组、汇集和多变量几何聚合等操作,以轻量级捕获和自适应聚合多变量几何特征,提供全面的三维几何描述。DSE 专为真实世界的自动驾驶场景而设计,可增强点云的语义感知,尤其是对远距离点的感知。我们的方法具有灵活性,可与分类/分割头无缝集成,或嵌入现成的三维物体检测网络,以最小的成本实现显著的性能提升。在对象级数据集(包括 ModelNet40、ScanObjectNN、ShapeNetPart 和场景级数据集 KITTI)上进行的大量实验表明,PointeNet 在点云分析方面的性能优于最先进的方法。值得注意的是,在 ModelNet40、ScanObjectNN 和 ShapeNetPart 上,PointeNet 用更少的参数就超越了 PointMLP;在 KITTI 上,PointRCNN 在 3DAPR40 上取得了超过 2% 的大幅提升,而参数成本仅为 140 万。代码可在 https://github.com/lipeng-gu/PointeNet 公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PointeNet: A lightweight framework for effective and efficient point cloud analysis

The conventional wisdom in point cloud analysis predominantly explores 3D geometries. It is often achieved through the introduction of intricate learnable geometric extractors in the encoder or by deepening networks with repeated blocks. However, these methods contain a significant number of learnable parameters, resulting in substantial computational costs and imposing memory burdens on CPU/GPU. Moreover, they are primarily tailored for object-level point cloud classification and segmentation tasks, with limited extensions to crucial scene-level applications, such as autonomous driving. To this end, we introduce PointeNet, an efficient network designed specifically for point cloud analysis. PointeNet distinguishes itself with its lightweight architecture, low training cost, and plug-and-play capability, while also effectively capturing representative features. The network consists of a Multivariate Geometric Encoding (MGE) module and an optional Distance-aware Semantic Enhancement (DSE) module. MGE employs operations of sampling, grouping, pooling, and multivariate geometric aggregation to lightweightly capture and adaptively aggregate multivariate geometric features, providing a comprehensive depiction of 3D geometries. DSE, designed for real-world autonomous driving scenarios, enhances the semantic perception of point clouds, particularly for distant points. Our method demonstrates flexibility by seamlessly integrating with a classification/segmentation head or embedding into off-the-shelf 3D object detection networks, achieving notable performance improvements at a minimal cost. Extensive experiments on object-level datasets, including ModelNet40, ScanObjectNN, ShapeNetPart, and the scene-level dataset KITTI, demonstrate the superior performance of PointeNet over state-of-the-art methods in point cloud analysis. Notably, PointeNet outperforms PointMLP with significantly fewer parameters on ModelNet40, ScanObjectNN, and ShapeNetPart, and achieves a substantial improvement of over 2% in 3DAPR40 for PointRCNN on KITTI with a minimal parameter cost of 1.4 million. Code is publicly available at https://github.com/lipeng-gu/PointeNet.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Aided Geometric Design
Computer Aided Geometric Design 工程技术-计算机:软件工程
CiteScore
3.50
自引率
13.30%
发文量
57
审稿时长
60 days
期刊介绍: The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following: -Mathematical and Geometric Foundations- Curve, Surface, and Volume generation- CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision- Industrial, medical, and scientific applications. The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信