PBMA：通过点对盒运动增强增强3D点云跟踪

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-04-15 DOI:10.1016/j.eswa.2025.127548

Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu

{"title":"PBMA：通过点对盒运动增强增强3D点云跟踪","authors":"Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu","doi":"10.1016/j.eswa.2025.127548","DOIUrl":null,"url":null,"abstract":"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127548"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation\",\"authors\":\"Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu\",\"doi\":\"10.1016/j.eswa.2025.127548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127548\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425011704\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011704","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

3D单目标跟踪（SOT）在各种应用中具有重要意义，特别是在自动驾驶领域。三维SOT任务涉及处理包含丰富帧间时空信息和目标运动线索的点云序列数据。然而，当前的方法往往涉及场景裁剪和坐标系变换，导致对时空背景的利用不足，破坏了连续帧之间固有的运动一致性。为了解决这个问题，我们提出了点到盒运动增强（PBMA），这是一种直观的方法，它结合了连续帧之间的坐标位移和特征位移来增强目标运动的建模。特别是，我们的方法估计帧间场景流和特征流，它们共同被称为低级点运动。这使得对动态环境的全面理解和从时空背景中捕获运动线索成为可能，而不需要裁剪或坐标系统转换。最终，PBMA转换低级点运动和几何特征，以增强高级目标运动建模。在大规模数据集上的大量实验表明，PBMA在以25 FPS运行时达到了最先进的跟踪性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation

3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose Point-to-Box Motion Augmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.