Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu
{"title":"PBMA:通过点对盒运动增强增强3D点云跟踪","authors":"Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu","doi":"10.1016/j.eswa.2025.127548","DOIUrl":null,"url":null,"abstract":"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127548"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation\",\"authors\":\"Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu\",\"doi\":\"10.1016/j.eswa.2025.127548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127548\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425011704\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011704","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation
3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose Point-to-Box Motion Augmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.