基于邻域学习的高效轻量级点云识别网络

IF 2.2 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IET Image Processing Pub Date : 2025-10-09 DOI:10.1049/ipr2.70226

Yanxia Bao, Zilong Liu, Yahong Chen, Yang Shen

{"title":"基于邻域学习的高效轻量级点云识别网络","authors":"Yanxia Bao, Zilong Liu, Yahong Chen, Yang Shen","doi":"10.1049/ipr2.70226","DOIUrl":null,"url":null,"abstract":"<p>Point cloud recognition has wide applications in fields such as autonomous driving and shape classification. Although significant progress has been made in point cloud processing in recent years, most of it has been achieved by designing more complex networks to attain better performance. This paper proposes a novel lightweight point cloud recognition network by introducing a new local neighborhood optimization layer (LNOL), which improves traditional sampling methods by correlation learning in local area. The LNOL is embedded within a single-layer local transformer architecture, significantly reducing computational complexity and parameters while maintaining the model's expressive power. Experimental results on the ModelNet40 benchmark dataset demonstrate that our method achieves a classification accuracy of 93.3% and an average precision of 92.0% without using a voting strategy. Compared to the mainstream local transformer model point transformer, our network requires only 9.95G FLOPs and 2.33M parameters, reducing computational cost by 94.7% and parameter count by 75.7%, with only a 0.4% drop in accuracy. This study provides an efficient solution for real-time 3D recognition applications, significantly lowering computational resource requirements while maintaining performance.</p>","PeriodicalId":56303,"journal":{"name":"IET Image Processing","volume":"19 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ipr2.70226","citationCount":"0","resultStr":"{\"title\":\"An Efficient and Lightweight Point Cloud Recognition Network Based on Neighborhood Learning\",\"authors\":\"Yanxia Bao, Zilong Liu, Yahong Chen, Yang Shen\",\"doi\":\"10.1049/ipr2.70226\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Point cloud recognition has wide applications in fields such as autonomous driving and shape classification. Although significant progress has been made in point cloud processing in recent years, most of it has been achieved by designing more complex networks to attain better performance. This paper proposes a novel lightweight point cloud recognition network by introducing a new local neighborhood optimization layer (LNOL), which improves traditional sampling methods by correlation learning in local area. The LNOL is embedded within a single-layer local transformer architecture, significantly reducing computational complexity and parameters while maintaining the model's expressive power. Experimental results on the ModelNet40 benchmark dataset demonstrate that our method achieves a classification accuracy of 93.3% and an average precision of 92.0% without using a voting strategy. Compared to the mainstream local transformer model point transformer, our network requires only 9.95G FLOPs and 2.33M parameters, reducing computational cost by 94.7% and parameter count by 75.7%, with only a 0.4% drop in accuracy. This study provides an efficient solution for real-time 3D recognition applications, significantly lowering computational resource requirements while maintaining performance.</p>\",\"PeriodicalId\":56303,\"journal\":{\"name\":\"IET Image Processing\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ipr2.70226\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Image Processing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/ipr2.70226\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Image Processing","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/ipr2.70226","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

点云识别在自动驾驶、形状分类等领域有着广泛的应用。虽然近年来在点云处理方面取得了重大进展，但大多数都是通过设计更复杂的网络来获得更好的性能。通过引入局部邻域优化层（LNOL），改进了传统的局部相关学习采样方法，提出了一种新的轻量级点云识别网络。LNOL嵌入在单层局部变压器体系结构中，显著降低了计算复杂度和参数，同时保持了模型的表达能力。在ModelNet40基准数据集上的实验结果表明，在不使用投票策略的情况下，我们的方法达到了93.3%的分类准确率和92.0%的平均准确率。与主流的局部变压器模型点变压器相比，我们的网络只需要9.95G的FLOPs和2.33M的参数，计算成本降低了94.7%，参数数量减少了75.7%，精度仅下降了0.4%。本研究为实时3D识别应用提供了一种高效的解决方案，在保持性能的同时显著降低了计算资源需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

An Efficient and Lightweight Point Cloud Recognition Network Based on Neighborhood Learning

查看原文本刊更多论文

An Efficient and Lightweight Point Cloud Recognition Network Based on Neighborhood Learning

Point cloud recognition has wide applications in fields such as autonomous driving and shape classification. Although significant progress has been made in point cloud processing in recent years, most of it has been achieved by designing more complex networks to attain better performance. This paper proposes a novel lightweight point cloud recognition network by introducing a new local neighborhood optimization layer (LNOL), which improves traditional sampling methods by correlation learning in local area. The LNOL is embedded within a single-layer local transformer architecture, significantly reducing computational complexity and parameters while maintaining the model's expressive power. Experimental results on the ModelNet40 benchmark dataset demonstrate that our method achieves a classification accuracy of 93.3% and an average precision of 92.0% without using a voting strategy. Compared to the mainstream local transformer model point transformer, our network requires only 9.95G FLOPs and 2.33M parameters, reducing computational cost by 94.7% and parameter count by 75.7%, with only a 0.4% drop in accuracy. This study provides an efficient solution for real-time 3D recognition applications, significantly lowering computational resource requirements while maintaining performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IET Image Processing 工程技术-工程：电子与电气

CiteScore

5.40

自引率

8.70%

发文量

282

审稿时长

6 months

期刊介绍： The IET Image Processing journal encompasses research areas related to the generation, processing and communication of visual information. The focus of the journal is the coverage of the latest research results in image and video processing, including image generation and display, enhancement and restoration, segmentation, colour and texture analysis, coding and communication, implementations and architectures as well as innovative applications. Principal topics include: Generation and Display - Imaging sensors and acquisition systems, illumination, sampling and scanning, quantization, colour reproduction, image rendering, display and printing systems, evaluation of image quality. Processing and Analysis - Image enhancement, restoration, segmentation, registration, multispectral, colour and texture processing, multiresolution processing and wavelets, morphological operations, stereoscopic and 3-D processing, motion detection and estimation, video and image sequence processing. Implementations and Architectures - Image and video processing hardware and software, design and construction, architectures and software, neural, adaptive, and fuzzy processing. Coding and Transmission - Image and video compression and coding, compression standards, noise modelling, visual information networks, streamed video. Retrieval and Multimedia - Storage of images and video, database design, image retrieval, video annotation and editing, mixed media incorporating visual information, multimedia systems and applications, image and video watermarking, steganography. Applications - Innovative application of image and video processing technologies to any field, including life sciences, earth sciences, astronomy, document processing and security. Current Special Issue Call for Papers: Evolutionary Computation for Image Processing - https://digital-library.theiet.org/files/IET_IPR_CFP_EC.pdf AI-Powered 3D Vision - https://digital-library.theiet.org/files/IET_IPR_CFP_AIPV.pdf Multidisciplinary advancement of Imaging Technologies: From Medical Diagnostics and Genomics to Cognitive Machine Vision, and Artificial Intelligence - https://digital-library.theiet.org/files/IET_IPR_CFP_IST.pdf Deep Learning for 3D Reconstruction - https://digital-library.theiet.org/files/IET_IPR_CFP_DLR.pdf