面向森林类型和传感器平台的森林场景语义分割的点云理解框架

IF 11.1 1区 地球科学 Q1 ENVIRONMENTAL SCIENCES
Hao Lu, Bowen Li, Gang Yang, Guangpeng Fan, Han Wang, Yong Pang, Zuyuan Wang, Yining Lian, Haifeng Xu, Huan Huang
{"title":"面向森林类型和传感器平台的森林场景语义分割的点云理解框架","authors":"Hao Lu, Bowen Li, Gang Yang, Guangpeng Fan, Han Wang, Yong Pang, Zuyuan Wang, Yining Lian, Haifeng Xu, Huan Huang","doi":"10.1016/j.rse.2024.114591","DOIUrl":null,"url":null,"abstract":"Foliage, wood (i.e., trunk, branch), ground, and lower objects (i.e., grass, shrubs), are key semantic components of forests that play different roles in the forest ecosystem. For understanding forest ecosystem structure and function, Light Detection And Ranging (LiDAR) point cloud is a valuable form of remote sensing observation. The understanding intensively relies on precisely performing semantic segmentation task to segment semantic components from forest point cloud data. However, the semantic segmentation of massive point cloud data from forest scenes remains a significant challenge. The forest environment is highly heterogeneous and complex due to tree species and terrain conditions. Different climate zones lead to varying canopy characteristics and the diversity of LiDAR platforms delivers inconsistent point cloud properties. Heuristic approaches and conventional machine learning approaches inevitably suffer from poor generalization. Additionally, most deep learning methods lack a dedicated network design to address the characteristics of forests. This paper introduces Sen-net, a point cloud understanding network specifically constructed for semantic component segmentation of forest scene point cloud. Sen-net implements three modules tailored for forest characteristics. First, a spatial context enhancement module (SCEM) is designed for providing both global and dataset-level perspectives to mine geometric information and robust features hidden in heterogeneous forest. Second, a semantic-driven detail enrichment module (SDEM) is incorporated to preserve rich geometric details and semantic information thereby enhancing the learning of complex structures in the forests. Finally, an adaptive guidance flow (AGF) is added to seamlessly fuse the semantic and detailed features. Comprehensive experiments were conducted on both self-built Lin3D dataset and public datasets. Sen-net achieved an OA of 97.6 % and 85.1 % MIoU on the Lin3D dataset, and an OA of 94.5 % and 78.2 % MIoU on the public dataset FOR-instance. Results show that Sen-net outperformed the representative forest scene point cloud semantic segmentation approaches and state-of-the-art deep learning networks, and it has the potential to generalize to point cloud data collected by LiDAR from other platforms. It is concluded that Sen-net is a powerful and robust framework with substantial potential for being widely and deeply explored in forest ecosystem studies.","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"22 1","pages":""},"PeriodicalIF":11.1000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a point cloud understanding framework for forest scene semantic segmentation across forest types and sensor platforms\",\"authors\":\"Hao Lu, Bowen Li, Gang Yang, Guangpeng Fan, Han Wang, Yong Pang, Zuyuan Wang, Yining Lian, Haifeng Xu, Huan Huang\",\"doi\":\"10.1016/j.rse.2024.114591\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Foliage, wood (i.e., trunk, branch), ground, and lower objects (i.e., grass, shrubs), are key semantic components of forests that play different roles in the forest ecosystem. For understanding forest ecosystem structure and function, Light Detection And Ranging (LiDAR) point cloud is a valuable form of remote sensing observation. The understanding intensively relies on precisely performing semantic segmentation task to segment semantic components from forest point cloud data. However, the semantic segmentation of massive point cloud data from forest scenes remains a significant challenge. The forest environment is highly heterogeneous and complex due to tree species and terrain conditions. Different climate zones lead to varying canopy characteristics and the diversity of LiDAR platforms delivers inconsistent point cloud properties. Heuristic approaches and conventional machine learning approaches inevitably suffer from poor generalization. Additionally, most deep learning methods lack a dedicated network design to address the characteristics of forests. This paper introduces Sen-net, a point cloud understanding network specifically constructed for semantic component segmentation of forest scene point cloud. Sen-net implements three modules tailored for forest characteristics. First, a spatial context enhancement module (SCEM) is designed for providing both global and dataset-level perspectives to mine geometric information and robust features hidden in heterogeneous forest. Second, a semantic-driven detail enrichment module (SDEM) is incorporated to preserve rich geometric details and semantic information thereby enhancing the learning of complex structures in the forests. Finally, an adaptive guidance flow (AGF) is added to seamlessly fuse the semantic and detailed features. Comprehensive experiments were conducted on both self-built Lin3D dataset and public datasets. Sen-net achieved an OA of 97.6 % and 85.1 % MIoU on the Lin3D dataset, and an OA of 94.5 % and 78.2 % MIoU on the public dataset FOR-instance. Results show that Sen-net outperformed the representative forest scene point cloud semantic segmentation approaches and state-of-the-art deep learning networks, and it has the potential to generalize to point cloud data collected by LiDAR from other platforms. It is concluded that Sen-net is a powerful and robust framework with substantial potential for being widely and deeply explored in forest ecosystem studies.\",\"PeriodicalId\":417,\"journal\":{\"name\":\"Remote Sensing of Environment\",\"volume\":\"22 1\",\"pages\":\"\"},\"PeriodicalIF\":11.1000,\"publicationDate\":\"2025-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Remote Sensing of Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.rse.2024.114591\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.rse.2024.114591","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

树叶、木材(如树干、树枝)、地面和下部物体(如草、灌木)是森林的关键语义组成部分,在森林生态系统中发挥着不同的作用。为了了解森林生态系统的结构和功能,光探测和测距(LiDAR)点云是一种宝贵的遥感观测形式。要了解森林生态系统的结构和功能,主要依赖于精确执行语义分割任务,从森林点云数据中分割出语义成分。然而,对来自森林场景的海量点云数据进行语义分割仍然是一项重大挑战。由于树种和地形条件的不同,森林环境具有高度的异质性和复杂性。不同的气候带会导致不同的树冠特征,而 LiDAR 平台的多样性会带来不一致的点云属性。启发式方法和传统的机器学习方法不可避免地存在泛化能力差的问题。此外,大多数深度学习方法缺乏针对森林特征的专用网络设计。本文介绍的 Sen-net 是一种点云理解网络,专门用于森林场景点云的语义组件分割。Sen-net 实现了三个针对森林特征的模块。首先,空间上下文增强模块(SCEM)旨在提供全局和数据集级视角,以挖掘隐藏在异构森林中的几何信息和稳健特征。其次,语义驱动的细节丰富模块(SDEM)用于保留丰富的几何细节和语义信息,从而加强对森林复杂结构的学习。最后,添加了自适应引导流(AGF),以无缝融合语义和细节特征。在自建的 Lin3D 数据集和公共数据集上进行了综合实验。在 Lin3D 数据集上,Sen-net 实现了 97.6 % 的 OA 和 85.1 % 的 MIoU,在公共数据集 FOR-instance 上,实现了 94.5 % 的 OA 和 78.2 % 的 MIoU。结果表明,Sen-net 的性能优于具有代表性的森林场景点云语义分割方法和最先进的深度学习网络,而且有可能推广到其他平台的激光雷达采集的点云数据。结论是,Sen-net 是一个强大而稳健的框架,具有在森林生态系统研究中进行广泛而深入探索的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards a point cloud understanding framework for forest scene semantic segmentation across forest types and sensor platforms
Foliage, wood (i.e., trunk, branch), ground, and lower objects (i.e., grass, shrubs), are key semantic components of forests that play different roles in the forest ecosystem. For understanding forest ecosystem structure and function, Light Detection And Ranging (LiDAR) point cloud is a valuable form of remote sensing observation. The understanding intensively relies on precisely performing semantic segmentation task to segment semantic components from forest point cloud data. However, the semantic segmentation of massive point cloud data from forest scenes remains a significant challenge. The forest environment is highly heterogeneous and complex due to tree species and terrain conditions. Different climate zones lead to varying canopy characteristics and the diversity of LiDAR platforms delivers inconsistent point cloud properties. Heuristic approaches and conventional machine learning approaches inevitably suffer from poor generalization. Additionally, most deep learning methods lack a dedicated network design to address the characteristics of forests. This paper introduces Sen-net, a point cloud understanding network specifically constructed for semantic component segmentation of forest scene point cloud. Sen-net implements three modules tailored for forest characteristics. First, a spatial context enhancement module (SCEM) is designed for providing both global and dataset-level perspectives to mine geometric information and robust features hidden in heterogeneous forest. Second, a semantic-driven detail enrichment module (SDEM) is incorporated to preserve rich geometric details and semantic information thereby enhancing the learning of complex structures in the forests. Finally, an adaptive guidance flow (AGF) is added to seamlessly fuse the semantic and detailed features. Comprehensive experiments were conducted on both self-built Lin3D dataset and public datasets. Sen-net achieved an OA of 97.6 % and 85.1 % MIoU on the Lin3D dataset, and an OA of 94.5 % and 78.2 % MIoU on the public dataset FOR-instance. Results show that Sen-net outperformed the representative forest scene point cloud semantic segmentation approaches and state-of-the-art deep learning networks, and it has the potential to generalize to point cloud data collected by LiDAR from other platforms. It is concluded that Sen-net is a powerful and robust framework with substantial potential for being widely and deeply explored in forest ecosystem studies.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Remote Sensing of Environment
Remote Sensing of Environment 环境科学-成像科学与照相技术
CiteScore
25.10
自引率
8.90%
发文量
455
审稿时长
53 days
期刊介绍: Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing. The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques. RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信