{"title":"ELMF-Net:基于高效局部特征学习和多尺度融合的大规模点云语义分割","authors":"Maomao Sun;Ting Rui;Dong Wang;Chengsong Yang;Nan Zheng","doi":"10.1109/JSEN.2025.3534319","DOIUrl":null,"url":null,"abstract":"The semantic segmentation of 3-D point clouds can precisely describe 3-D environmental information, serving as an important research direction for environmental perception in unmanned systems. However, existing methods face drawbacks owing to the limitations in local semantic feature representation and cross-scale information fusion capabilities. To address these issues, we propose ELMF-Net, an efficient and accurate semantic segmentation model for large-scale 3-D point clouds. First, we introduce a local feature learning method that does not rely on strict geometric relationships and establish a local feature learner (w-LFL) model to capture and aggregate locally semantic discriminative features from point clouds. Subsequently, a novel multiscale feature fusion (MSFF) module was designed to collaborate with the decoder to deeply integrate shallow encoding layer features at different resolutions and high-level semantic features from deep encoding layers, providing an efficient representation of objects with varying scales. Finally, we validate the performance of ELMF-Net on three large-scale datasets, Stanford large-scale 3D indoor spaces dataset (S3DIS), Toronto3D, and Semantic Karlsruhe Institute of Technology and Toyota Technological Institute at Chicago (KITTI), demonstrating the excellent performance of the ELMF-Net network in large-scale, multitarget scene.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 7","pages":"11392-11404"},"PeriodicalIF":4.3000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ELMF-Net: Semantic Segmentation of Large-Scale Point Clouds via Efficient Local Feature Learner and Multiscale Fusion\",\"authors\":\"Maomao Sun;Ting Rui;Dong Wang;Chengsong Yang;Nan Zheng\",\"doi\":\"10.1109/JSEN.2025.3534319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The semantic segmentation of 3-D point clouds can precisely describe 3-D environmental information, serving as an important research direction for environmental perception in unmanned systems. However, existing methods face drawbacks owing to the limitations in local semantic feature representation and cross-scale information fusion capabilities. To address these issues, we propose ELMF-Net, an efficient and accurate semantic segmentation model for large-scale 3-D point clouds. First, we introduce a local feature learning method that does not rely on strict geometric relationships and establish a local feature learner (w-LFL) model to capture and aggregate locally semantic discriminative features from point clouds. Subsequently, a novel multiscale feature fusion (MSFF) module was designed to collaborate with the decoder to deeply integrate shallow encoding layer features at different resolutions and high-level semantic features from deep encoding layers, providing an efficient representation of objects with varying scales. Finally, we validate the performance of ELMF-Net on three large-scale datasets, Stanford large-scale 3D indoor spaces dataset (S3DIS), Toronto3D, and Semantic Karlsruhe Institute of Technology and Toyota Technological Institute at Chicago (KITTI), demonstrating the excellent performance of the ELMF-Net network in large-scale, multitarget scene.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"25 7\",\"pages\":\"11392-11404\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-02-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10869312/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10869312/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
ELMF-Net: Semantic Segmentation of Large-Scale Point Clouds via Efficient Local Feature Learner and Multiscale Fusion
The semantic segmentation of 3-D point clouds can precisely describe 3-D environmental information, serving as an important research direction for environmental perception in unmanned systems. However, existing methods face drawbacks owing to the limitations in local semantic feature representation and cross-scale information fusion capabilities. To address these issues, we propose ELMF-Net, an efficient and accurate semantic segmentation model for large-scale 3-D point clouds. First, we introduce a local feature learning method that does not rely on strict geometric relationships and establish a local feature learner (w-LFL) model to capture and aggregate locally semantic discriminative features from point clouds. Subsequently, a novel multiscale feature fusion (MSFF) module was designed to collaborate with the decoder to deeply integrate shallow encoding layer features at different resolutions and high-level semantic features from deep encoding layers, providing an efficient representation of objects with varying scales. Finally, we validate the performance of ELMF-Net on three large-scale datasets, Stanford large-scale 3D indoor spaces dataset (S3DIS), Toronto3D, and Semantic Karlsruhe Institute of Technology and Toyota Technological Institute at Chicago (KITTI), demonstrating the excellent performance of the ELMF-Net network in large-scale, multitarget scene.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice