UAVs-based Small Object Detection and Tracking in Various Complex Scenarios

2021 International Symposium on Electrical, Electronics and Information Engineering Pub Date : 2021-02-19 DOI:10.1145/3459104.3459141

Shicheng Zu, Kai Yang, Xiulai Wang, Zhongzheng Yu, Yawen Hu, Jia Long

{"title":"UAVs-based Small Object Detection and Tracking in Various Complex Scenarios","authors":"Shicheng Zu, Kai Yang, Xiulai Wang, Zhongzheng Yu, Yawen Hu, Jia Long","doi":"10.1145/3459104.3459141","DOIUrl":null,"url":null,"abstract":"We have witnessed drastic progress in object detection in recent years due to the development of neural networks. Most mainstream object detectors are inclined to detect objects of regular scale because their detection depends on deep convolutional feature maps. Our study focused on UAVs-based small object detection at a high altitude, i.e., 100 meters. We constructed a pipeline by integrating the foreground segmentation algorithm, the image classification algorithm, the boosted cascaded classifier, and the tracker together that can detect and track the small object progressively in a cascaded manner. We performed the qualitative and quantitative evaluation of our pipeline's performance under various complex conditions. The comparison study confirmed its superiority in small object detection and strong robustness against various influential nuisances. Based on our constructed pipeline, we developed a real-time UAVs-based small object detection and tracking system. The system architecture and the general steps taken by the UAVs to realize small object detection were also presented. Finally, we qualitatively and quantitatively evaluated 8 popular trackers based on relevant image attributes. The most suitable tracker can be determined in response to a given condition. Our study testified that by taking advantage of each algorithm germane to a given task, the implementation performance can be improved. We also performed a quantitative evaluation of the 8 trackers on each pertinent image attribute. The results are shown in table 2. For each attribute, we highlighted the most suitable tracker in bold. In term of IV, the trackers utilizing feature assembly, i.e., the CSR-DCF and AdaBoost or the trackers using the texture features, i.e., LBP and HoG, usually perform better because the texture features are not sensitive to the IV [21]. The MIL tracker with the Haar-like features, however, is sensitive to the IV because the Haar-like features reflect the pixel intensity variations by subtracting pixel intensities between adjacent rectangular regions [21]. As far as OCC is concerned, the AdaBoost has superior performance because it allows online switching of multiple features for every frame [19]. The KCF shows diminished performance because the FFT requires the filter and the search region size to be equal limiting the detection range [17]. The reduced performance is also observed in the GOTURN since it estimates the object's location with one forward pass [20]. For MB, the MOSSE has improved performance because the correlation between the filter and the image becomes an element-wise multiplication in Fourier domain [16]. The MEDIANFLOW tracker does not perform well in MB because the rapid unpredictable motion causes a large discrepancy between the forward and backward tracking trajectories [22]. The OV resembles the occlusion in some respects. The MOSSE has improved performance in OV because it can detect occlusion via Peak-To-Sidelobe Ratio (PSR) and reinitiate tracking if the object reappears [16]. The TLD tracker also has enhanced performance because of its failure-safe detector to detect the object upon tracking failure [23]. The KCF's performance is degraded due to the lack of a failure recovery mechanism [17]. In term of BC, the CSR-DCF is good at coping with BC because of the spatial reliability map [18]. For LR, the KCF has poor performance because of the inadaptation of its initial circulant matrices to resolution variations [17]. In a nutshell, for the conditions which are more challenging, the CSR-DCF is a preferred choice while for conditions that are less complicated, the AdaBoost usually performs better.","PeriodicalId":142284,"journal":{"name":"2021 International Symposium on Electrical, Electronics and Information Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium on Electrical, Electronics and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3459104.3459141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

We have witnessed drastic progress in object detection in recent years due to the development of neural networks. Most mainstream object detectors are inclined to detect objects of regular scale because their detection depends on deep convolutional feature maps. Our study focused on UAVs-based small object detection at a high altitude, i.e., 100 meters. We constructed a pipeline by integrating the foreground segmentation algorithm, the image classification algorithm, the boosted cascaded classifier, and the tracker together that can detect and track the small object progressively in a cascaded manner. We performed the qualitative and quantitative evaluation of our pipeline's performance under various complex conditions. The comparison study confirmed its superiority in small object detection and strong robustness against various influential nuisances. Based on our constructed pipeline, we developed a real-time UAVs-based small object detection and tracking system. The system architecture and the general steps taken by the UAVs to realize small object detection were also presented. Finally, we qualitatively and quantitatively evaluated 8 popular trackers based on relevant image attributes. The most suitable tracker can be determined in response to a given condition. Our study testified that by taking advantage of each algorithm germane to a given task, the implementation performance can be improved. We also performed a quantitative evaluation of the 8 trackers on each pertinent image attribute. The results are shown in table 2. For each attribute, we highlighted the most suitable tracker in bold. In term of IV, the trackers utilizing feature assembly, i.e., the CSR-DCF and AdaBoost or the trackers using the texture features, i.e., LBP and HoG, usually perform better because the texture features are not sensitive to the IV [21]. The MIL tracker with the Haar-like features, however, is sensitive to the IV because the Haar-like features reflect the pixel intensity variations by subtracting pixel intensities between adjacent rectangular regions [21]. As far as OCC is concerned, the AdaBoost has superior performance because it allows online switching of multiple features for every frame [19]. The KCF shows diminished performance because the FFT requires the filter and the search region size to be equal limiting the detection range [17]. The reduced performance is also observed in the GOTURN since it estimates the object's location with one forward pass [20]. For MB, the MOSSE has improved performance because the correlation between the filter and the image becomes an element-wise multiplication in Fourier domain [16]. The MEDIANFLOW tracker does not perform well in MB because the rapid unpredictable motion causes a large discrepancy between the forward and backward tracking trajectories [22]. The OV resembles the occlusion in some respects. The MOSSE has improved performance in OV because it can detect occlusion via Peak-To-Sidelobe Ratio (PSR) and reinitiate tracking if the object reappears [16]. The TLD tracker also has enhanced performance because of its failure-safe detector to detect the object upon tracking failure [23]. The KCF's performance is degraded due to the lack of a failure recovery mechanism [17]. In term of BC, the CSR-DCF is good at coping with BC because of the spatial reliability map [18]. For LR, the KCF has poor performance because of the inadaptation of its initial circulant matrices to resolution variations [17]. In a nutshell, for the conditions which are more challenging, the CSR-DCF is a preferred choice while for conditions that are less complicated, the AdaBoost usually performs better.

查看原文本刊更多论文

基于无人机的复杂场景小目标检测与跟踪

近年来，由于神经网络的发展，我们在目标检测方面取得了巨大的进步。大多数主流的目标检测器都倾向于检测规则尺度的目标，因为它们的检测依赖于深度卷积特征映射。我们的研究重点是基于无人机的高空小目标探测，即100米。我们将前景分割算法、图像分类算法、增强级联分类器和跟踪器集成在一起，构建了一个流水线，以级联的方式逐步检测和跟踪小目标。我们对管道在各种复杂条件下的性能进行了定性和定量评估。对比研究证实了该方法在小目标检测方面的优越性和对各种干扰的鲁棒性。基于我们构建的管道，我们开发了一个基于无人机的实时小目标检测与跟踪系统。介绍了系统结构和无人机实现小目标检测的一般步骤。最后，基于相关图像属性，对8款热门追踪器进行定性和定量评价。可以根据给定条件确定最合适的跟踪器。我们的研究证明，通过利用与给定任务相关的每种算法，可以提高实现性能。我们还对每个相关图像属性的8个跟踪器进行了定量评估。结果如表2所示。对于每个属性，我们用粗体突出显示最合适的跟踪器。在IV方面，利用特征集合的跟踪器，即CSR-DCF和AdaBoost，或者使用纹理特征的跟踪器，即LBP和HoG，通常表现更好，因为纹理特征对IV不敏感[21]。然而，具有haar样特征的MIL跟踪器对IV敏感，因为haar样特征通过减去相邻矩形区域之间的像素强度来反映像素强度的变化[21]。就OCC而言，AdaBoost具有更优越的性能，因为它允许每帧多个特征在线切换[19]。KCF表现出性能下降，因为FFT要求滤波器和搜索区域大小相等，限制了检测范围[17]。在GOTURN中也观察到性能下降，因为它通过一次向前传递来估计对象的位置[20]。对于MB, MOSSE提高了性能，因为滤波器和图像之间的相关性在傅里叶域中变成了逐元乘法[16]。MEDIANFLOW跟踪器在MB中表现不佳，因为快速的不可预测运动导致向前和向后跟踪轨迹之间存在很大差异[22]。OV在某些方面类似于闭塞。MOSSE改进了OV的性能，因为它可以通过峰旁比(Peak-To-Sidelobe Ratio, PSR)检测遮挡，并在物体重新出现时重新启动跟踪[16]。TLD跟踪器还具有增强的性能，因为它的故障安全检测器可以在跟踪失败时检测对象[23]。由于缺乏故障恢复机制，KCF的性能下降[17]。在BC方面，由于空间可靠性图的存在，CSR-DCF具有较好的应对BC的能力[18]。对于LR，由于初始循环矩阵对分辨率变化的不适应，KCF的性能较差[17]。简而言之，对于更具挑战性的条件，CSR-DCF是首选，而对于不太复杂的条件，AdaBoost通常表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Symposium on Electrical, Electronics and Information Engineering

自引率

0.00%

发文量