Na Lin;Lei Zhang;Tianxiong Wu;Ammar Hawbani;Huiyu Zhou;Liang Zhao
{"title":"Surface Multiple Object Tracking: An Accurate HAT-YOLOv8-ADT Tracking Model","authors":"Na Lin;Lei Zhang;Tianxiong Wu;Ammar Hawbani;Huiyu Zhou;Liang Zhao","doi":"10.1109/JIOT.2025.3539852","DOIUrl":null,"url":null,"abstract":"With the development of artificial intelligence technology, Autonomous aerial vehicles (AAV) have the ability to sense the environment. multiple object tracking (MOT) in AAV video is a very important vision task with a wide variety of applications. However, there are still many challenges in MOT in AAV video. First, the movement of the onboard camera in the three-dimensional (3-D) direction during the tracking process, as well as the unpredictable measurement noise characteristics of AAVs flying at high speeds, can lead to significant deviations in the prediction of the object’s position. Second, the applicability of the traditional detection algorithm decreases when the object is small and dense in the AAV viewpoint during detection. Finally, the traditional intersection over union (IoU) matching approach does not take into account the effects of the height and width of the box, and the matching results are inaccurate for the prediction and detection box. In order to address these challenges, we recommend the adaptive DeepSort (ADT) algorithm to reduce the prediction bias due to camera movement and difficulty in predetermining measurement noise characteristics, the hybrid attention transformer-YOLOv8 (HAT-YOLOv8) algorithm to enhance the detection capability of tiny objects, and the IoU of height and width (HWIoU) matching algorithm, which improves the matching accuracy and thus the tracking accuracy. Experimental results show that our proposed solution outperforms the baseline solution. It outperforms the current mainstream StrongSort in MOTA, HOTA and IDF1 by 2.86%, 0.9%, and 9.36%. Code repository link: <uri>https://github.com/networkcommunication/</uri>.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 11","pages":"18266-18278"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10877939/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of artificial intelligence technology, Autonomous aerial vehicles (AAV) have the ability to sense the environment. multiple object tracking (MOT) in AAV video is a very important vision task with a wide variety of applications. However, there are still many challenges in MOT in AAV video. First, the movement of the onboard camera in the three-dimensional (3-D) direction during the tracking process, as well as the unpredictable measurement noise characteristics of AAVs flying at high speeds, can lead to significant deviations in the prediction of the object’s position. Second, the applicability of the traditional detection algorithm decreases when the object is small and dense in the AAV viewpoint during detection. Finally, the traditional intersection over union (IoU) matching approach does not take into account the effects of the height and width of the box, and the matching results are inaccurate for the prediction and detection box. In order to address these challenges, we recommend the adaptive DeepSort (ADT) algorithm to reduce the prediction bias due to camera movement and difficulty in predetermining measurement noise characteristics, the hybrid attention transformer-YOLOv8 (HAT-YOLOv8) algorithm to enhance the detection capability of tiny objects, and the IoU of height and width (HWIoU) matching algorithm, which improves the matching accuracy and thus the tracking accuracy. Experimental results show that our proposed solution outperforms the baseline solution. It outperforms the current mainstream StrongSort in MOTA, HOTA and IDF1 by 2.86%, 0.9%, and 9.36%. Code repository link: https://github.com/networkcommunication/.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.