从 MLS 点云绘制大规模车道图的基准方法和数据集

IF 7.6 Q1 REMOTE SENSING

International journal of applied earth observation and geoinformation : ITC journal Pub Date : 2024-09-01 DOI:10.1016/j.jag.2024.104139

Xiaoxin Mi , Zhen Dong , Zhipeng Cao , Bisheng Yang , Zhen Cao , Chao Zheng , Jantien Stoter , Liangliang Nan

{"title":"从 MLS 点云绘制大规模车道图的基准方法和数据集","authors":"Xiaoxin Mi , Zhen Dong , Zhipeng Cao , Bisheng Yang , Zhen Cao , Chao Zheng , Jantien Stoter , Liangliang Nan","doi":"10.1016/j.jag.2024.104139","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate lane maps with semantics are crucial for various applications, such as high-definition maps (HD Maps), intelligent transportation systems (ITS), and digital twins. Manual annotation of lanes is labor-intensive and costly, prompting researchers to explore automatic lane extraction methods. This paper presents an end-to-end large-scale lane mapping method that considers both lane geometry and semantics. This study represents lane markings as polylines with uniformly sampled points and associated semantics, allowing for adaptation to varying lane shapes. Additionally, we propose an end-to-end network to extract lane polylines from mobile laser scanning (MLS) data, enabling the inference of vectorized lane instances without complex post-processing. The network consists of three components: a feature encoder, a column proposal generator, and a lane information decoder. The feature encoder encodes textual and structural information of lane markings to enhance the method’s robustness to data imperfections, such as varying lane intensity, uneven point density, and occlusion-induced incomplete data. The column proposal generator generates regions of interest for the subsequent decoder. Leveraging the embedded multi-scale features from the feature encoder, the lane decoder effectively predicts lane polylines and their associated semantics without requiring step-by-step conditional inference. Comprehensive experiments conducted on three lane datasets have demonstrated the performance of the proposed method, even in the presence of incomplete data and complex lane topology. Furthermore, the datasets used in this work, including source ground points, generated bird’s eye view (BEV) images, and annotations, will be publicly available with the publication of the paper. The code and dataset will be accessible through <span><span>here</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"133 ","pages":"Article 104139"},"PeriodicalIF":7.6000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S156984322400493X/pdfft?md5=d68dcec273c41a425a3f022365adcf23&pid=1-s2.0-S156984322400493X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"A benchmark approach and dataset for large-scale lane mapping from MLS point clouds\",\"authors\":\"Xiaoxin Mi , Zhen Dong , Zhipeng Cao , Bisheng Yang , Zhen Cao , Chao Zheng , Jantien Stoter , Liangliang Nan\",\"doi\":\"10.1016/j.jag.2024.104139\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Accurate lane maps with semantics are crucial for various applications, such as high-definition maps (HD Maps), intelligent transportation systems (ITS), and digital twins. Manual annotation of lanes is labor-intensive and costly, prompting researchers to explore automatic lane extraction methods. This paper presents an end-to-end large-scale lane mapping method that considers both lane geometry and semantics. This study represents lane markings as polylines with uniformly sampled points and associated semantics, allowing for adaptation to varying lane shapes. Additionally, we propose an end-to-end network to extract lane polylines from mobile laser scanning (MLS) data, enabling the inference of vectorized lane instances without complex post-processing. The network consists of three components: a feature encoder, a column proposal generator, and a lane information decoder. The feature encoder encodes textual and structural information of lane markings to enhance the method’s robustness to data imperfections, such as varying lane intensity, uneven point density, and occlusion-induced incomplete data. The column proposal generator generates regions of interest for the subsequent decoder. Leveraging the embedded multi-scale features from the feature encoder, the lane decoder effectively predicts lane polylines and their associated semantics without requiring step-by-step conditional inference. Comprehensive experiments conducted on three lane datasets have demonstrated the performance of the proposed method, even in the presence of incomplete data and complex lane topology. Furthermore, the datasets used in this work, including source ground points, generated bird’s eye view (BEV) images, and annotations, will be publicly available with the publication of the paper. The code and dataset will be accessible through <span><span>here</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"133 \",\"pages\":\"Article 104139\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S156984322400493X/pdfft?md5=d68dcec273c41a425a3f022365adcf23&pid=1-s2.0-S156984322400493X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S156984322400493X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S156984322400493X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

摘要

具有语义的精确车道地图对于高清地图（HD Maps）、智能交通系统（ITS）和数字孪生等各种应用至关重要。人工标注车道耗费大量人力和财力，促使研究人员探索自动提取车道的方法。本文介绍了一种端到端的大规模车道映射方法，该方法同时考虑了车道的几何形状和语义。这项研究将车道标记表示为具有均匀采样点和相关语义的折线，从而可以适应不同的车道形状。此外，我们还提出了一种端到端网络，用于从移动激光扫描 (MLS) 数据中提取车道折线，从而无需复杂的后处理即可推断出矢量化的车道实例。该网络由三个部分组成：特征编码器、列建议生成器和车道信息解码器。特征编码器对车道标记的文本和结构信息进行编码，以增强该方法对数据缺陷的鲁棒性，例如不同的车道强度、不均匀的点密度以及遮挡引起的不完整数据。列建议生成器为后续解码器生成感兴趣区域。利用来自特征编码器的嵌入式多尺度特征，车道解码器可有效预测车道折线及其相关语义，而无需逐步进行条件推理。在三个车道数据集上进行的综合实验证明了所提出方法的性能，即使在数据不完整和车道拓扑结构复杂的情况下也是如此。此外，这项工作中使用的数据集，包括源地面点、生成的鸟瞰（BEV）图像和注释，将在论文发表时公开。代码和数据集可通过此处访问。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A benchmark approach and dataset for large-scale lane mapping from MLS point clouds

Accurate lane maps with semantics are crucial for various applications, such as high-definition maps (HD Maps), intelligent transportation systems (ITS), and digital twins. Manual annotation of lanes is labor-intensive and costly, prompting researchers to explore automatic lane extraction methods. This paper presents an end-to-end large-scale lane mapping method that considers both lane geometry and semantics. This study represents lane markings as polylines with uniformly sampled points and associated semantics, allowing for adaptation to varying lane shapes. Additionally, we propose an end-to-end network to extract lane polylines from mobile laser scanning (MLS) data, enabling the inference of vectorized lane instances without complex post-processing. The network consists of three components: a feature encoder, a column proposal generator, and a lane information decoder. The feature encoder encodes textual and structural information of lane markings to enhance the method’s robustness to data imperfections, such as varying lane intensity, uneven point density, and occlusion-induced incomplete data. The column proposal generator generates regions of interest for the subsequent decoder. Leveraging the embedded multi-scale features from the feature encoder, the lane decoder effectively predicts lane polylines and their associated semantics without requiring step-by-step conditional inference. Comprehensive experiments conducted on three lane datasets have demonstrated the performance of the proposed method, even in the presence of incomplete data and complex lane topology. Furthermore, the datasets used in this work, including source ground points, generated bird’s eye view (BEV) images, and annotations, will be publicly available with the publication of the paper. The code and dataset will be accessible through here.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences

CiteScore

12.00

自引率

0.00%

发文量

审稿时长

77 days

期刊介绍： The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.