PBMA:通过点对盒运动增强增强3D点云跟踪

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Kaijie Zhao , Haitao Zhao , Zhongze Wang , Lujian Yao , Jingchao Peng , Zhengwei Hu
{"title":"PBMA:通过点对盒运动增强增强3D点云跟踪","authors":"Kaijie Zhao ,&nbsp;Haitao Zhao ,&nbsp;Zhongze Wang ,&nbsp;Lujian Yao ,&nbsp;Jingchao Peng ,&nbsp;Zhengwei Hu","doi":"10.1016/j.eswa.2025.127548","DOIUrl":null,"url":null,"abstract":"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"281 ","pages":"Article 127548"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation\",\"authors\":\"Kaijie Zhao ,&nbsp;Haitao Zhao ,&nbsp;Zhongze Wang ,&nbsp;Lujian Yao ,&nbsp;Jingchao Peng ,&nbsp;Zhengwei Hu\",\"doi\":\"10.1016/j.eswa.2025.127548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose <strong>P</strong>oint-to-<strong>B</strong>ox <strong>M</strong>otion <strong>A</strong>ugmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"281 \",\"pages\":\"Article 127548\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425011704\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425011704","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

3D单目标跟踪(SOT)在各种应用中具有重要意义,特别是在自动驾驶领域。三维SOT任务涉及处理包含丰富帧间时空信息和目标运动线索的点云序列数据。然而,当前的方法往往涉及场景裁剪和坐标系变换,导致对时空背景的利用不足,破坏了连续帧之间固有的运动一致性。为了解决这个问题,我们提出了点到盒运动增强(PBMA),这是一种直观的方法,它结合了连续帧之间的坐标位移和特征位移来增强目标运动的建模。特别是,我们的方法估计帧间场景流和特征流,它们共同被称为低级点运动。这使得对动态环境的全面理解和从时空背景中捕获运动线索成为可能,而不需要裁剪或坐标系统转换。最终,PBMA转换低级点运动和几何特征,以增强高级目标运动建模。在大规模数据集上的大量实验表明,PBMA在以25 FPS运行时达到了最先进的跟踪性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PBMA: Enhancing 3D point cloud tracking with Point-to-Box Motion Augmentation
3D single object tracking (SOT) holds significance in various applications, especially within the domain of autonomous driving. The 3D SOT task involves the processing of point cloud sequence data that comprises rich inter-frame spatio-temporal information and target motion cues. However, contemporary methods often involve scene cropping and coordinate system transformations, leading to under-utilize of the spatio-temporal context and disrupt the intrinsic motion coherence between successive frames. To tackle this problem, we propose Point-to-Box Motion Augmentation (PBMA), an intuitive method that incorporates coordinate shifts and feature displacement across consecutive frames to augment the modeling of target motion. In particular, our approach estimates inter-frame scene flow and feature flow, which are jointly referred to low-level point motion. This enables a comprehensive understanding of the dynamic environment and the capture of motion cues from the spatio-temporal context, all without the need for cropping or coordinate system transformations. Ultimately, PBMA translates low-level point motions and geometry features for augmenting high-level target motion modeling. Extensive experiments on large-scale datasets show that PBMA achieves state-of-the-art tracking performance while running at 25 FPS.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信