{"title":"Enhanced lightweight detection of small and tiny objects in high-resolution images using object tracking-based region of interest proposal","authors":"Aleksandra Kos , Karol Majek , Dominik Belter","doi":"10.1016/j.engappai.2025.110852","DOIUrl":null,"url":null,"abstract":"<div><div>Detecting small objects in high-resolution images is challenging in practical applications. Downsampling the image makes small objects difficult or impossible to detect, while processing multiple low-resolution detection windows using a sliding-window approach is both time-consuming and impractical. To deal with this engineering problem, we present a novel window-based object detection system designed for detecting tiny and multi-scale objects in high-resolution images. Our approach introduces two Region of Interest (ROI) Modules that select full-resolution areas for the detector to focus on. We use a low-resolution current frame to estimate ROIs with a segmentation-based branch combined with past detection metadata to predict object locations with a tracking-based branch. By fusing outputs from these modules, we effectively recover regions with tiny objects overlooked by the estimation branch, and we significantly reduce the number of object detection runs compared to the sliding-window approach, maintaining the method’s speed. Moreover, we employ a fusion of downscaling and sliding-window techniques within large ROIs, complemented by our novel Overlapping Box Suppression (OBS) algorithm to reduce partial false-positive detections. We analyze our system and all its components on two challenging datasets — SeaDronesSee and DroneCrowd to show superior performance compared to state-of-the-art object detectors. Our approach enhances both the Artificial Intelligence (AI) and engineering domains by improving the quality and efficiency of tiny object detection, facilitating its integration into demanding real-time robotics applications. The inference code is available at <span><span>https://github.com/deepdrivepl/TinyROIFusion</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"153 ","pages":"Article 110852"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625008528","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting small objects in high-resolution images is challenging in practical applications. Downsampling the image makes small objects difficult or impossible to detect, while processing multiple low-resolution detection windows using a sliding-window approach is both time-consuming and impractical. To deal with this engineering problem, we present a novel window-based object detection system designed for detecting tiny and multi-scale objects in high-resolution images. Our approach introduces two Region of Interest (ROI) Modules that select full-resolution areas for the detector to focus on. We use a low-resolution current frame to estimate ROIs with a segmentation-based branch combined with past detection metadata to predict object locations with a tracking-based branch. By fusing outputs from these modules, we effectively recover regions with tiny objects overlooked by the estimation branch, and we significantly reduce the number of object detection runs compared to the sliding-window approach, maintaining the method’s speed. Moreover, we employ a fusion of downscaling and sliding-window techniques within large ROIs, complemented by our novel Overlapping Box Suppression (OBS) algorithm to reduce partial false-positive detections. We analyze our system and all its components on two challenging datasets — SeaDronesSee and DroneCrowd to show superior performance compared to state-of-the-art object detectors. Our approach enhances both the Artificial Intelligence (AI) and engineering domains by improving the quality and efficiency of tiny object detection, facilitating its integration into demanding real-time robotics applications. The inference code is available at https://github.com/deepdrivepl/TinyROIFusion.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.