Jinjun Rao , Kai Yang , QinFei Zhao , Zhenwei Li , Jinbo Chen , Jingtao Lei , Mei Liu , Wojciech Giernacki
{"title":"f - resppoint:一种基于截锥体剔除的多模态融合目标检测算法","authors":"Jinjun Rao , Kai Yang , QinFei Zhao , Zhenwei Li , Jinbo Chen , Jingtao Lei , Mei Liu , Wojciech Giernacki","doi":"10.1016/j.displa.2025.103135","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and lightweight 3D object detection is crucial for the perception ability of mobile intelligent robots. Aiming at the problem that current 3D object detection is difficult to balance lightweight and accuracy, this paper presents a multimodal fusion object detection method, F-ResPoint, based on point cloud culling in vision frustum. F-ResPoint is a lightweight two-stage object detection method, in which the frustum point cloud generation algorithm based on the improved YOLOv5 is used first to obtain the point cloud containing the object information in the view frustum from the point cloud in the original field. Then, the point cloud detection network based on the Residual-SA module is utilized for detecting the three-dimensional (3D) object from point cloud in the view frustum and obtaining its position and actual size of the object in 3D space. Experimental results demonstrate that this method achieves high detection accuracy and maintains good real-time performance while remaining lightweight.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"90 ","pages":"Article 103135"},"PeriodicalIF":3.7000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"F-ResPoint: a multi-modal fusion object detection algorithm based on frustum culling\",\"authors\":\"Jinjun Rao , Kai Yang , QinFei Zhao , Zhenwei Li , Jinbo Chen , Jingtao Lei , Mei Liu , Wojciech Giernacki\",\"doi\":\"10.1016/j.displa.2025.103135\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate and lightweight 3D object detection is crucial for the perception ability of mobile intelligent robots. Aiming at the problem that current 3D object detection is difficult to balance lightweight and accuracy, this paper presents a multimodal fusion object detection method, F-ResPoint, based on point cloud culling in vision frustum. F-ResPoint is a lightweight two-stage object detection method, in which the frustum point cloud generation algorithm based on the improved YOLOv5 is used first to obtain the point cloud containing the object information in the view frustum from the point cloud in the original field. Then, the point cloud detection network based on the Residual-SA module is utilized for detecting the three-dimensional (3D) object from point cloud in the view frustum and obtaining its position and actual size of the object in 3D space. Experimental results demonstrate that this method achieves high detection accuracy and maintains good real-time performance while remaining lightweight.</div></div>\",\"PeriodicalId\":50570,\"journal\":{\"name\":\"Displays\",\"volume\":\"90 \",\"pages\":\"Article 103135\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Displays\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141938225001726\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225001726","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
F-ResPoint: a multi-modal fusion object detection algorithm based on frustum culling
Accurate and lightweight 3D object detection is crucial for the perception ability of mobile intelligent robots. Aiming at the problem that current 3D object detection is difficult to balance lightweight and accuracy, this paper presents a multimodal fusion object detection method, F-ResPoint, based on point cloud culling in vision frustum. F-ResPoint is a lightweight two-stage object detection method, in which the frustum point cloud generation algorithm based on the improved YOLOv5 is used first to obtain the point cloud containing the object information in the view frustum from the point cloud in the original field. Then, the point cloud detection network based on the Residual-SA module is utilized for detecting the three-dimensional (3D) object from point cloud in the view frustum and obtaining its position and actual size of the object in 3D space. Experimental results demonstrate that this method achieves high detection accuracy and maintains good real-time performance while remaining lightweight.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.