Shu-Chang Wang, Kun Qian, Jinzheng You, Shang Xinghao
{"title":"SiamDA:一个细节关注的Siamese网络,具有红外光学显著性,用于像素级无人机跟踪。","authors":"Shu-Chang Wang, Kun Qian, Jinzheng You, Shang Xinghao","doi":"10.1364/AO.569136","DOIUrl":null,"url":null,"abstract":"<p><p>With the rapid development of unmanned aerial vehicle (UAV) technology, establishing effective management systems for unmanned aerial vehicles has become increasingly important. Tracking small UAVs in complex environments using infrared imagery is a crucial yet challenging task, owing to limited target visibility and significant background clutter. Further, existing feature extraction methods struggle to effectively capture pixel-level infrared UAV signatures. Therefore, this paper introduces SiamDA, a detail-attentive anchor-free Siamese tracker designed to capture more infrared spectral details to enhance the representation of weak UAV targets. First, a detail-attentive network that employs deformable convolutions to capture fine-grained features, along with a Taylor-difference-inspired edge enhancement module to sharpen boundaries and reinforce geometric shapes of small UAVs. Then, a normalized Wasserstein distance loss and a dynamic template update scheme are integrated to improve tracking robustness. Evaluations on public near-infrared UAV datasets indicate that SiamDA attains an average precision (<i>P</i><sub>5</sub>) of more than 80%, surpassing state-of-the-art trackers trained on the same dataset.</p>","PeriodicalId":101299,"journal":{"name":"Applied optics","volume":"64 25","pages":"7586-7593"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SiamDA: a detail-attentive Siamese network with infrared optical saliency for pixel-level UAV tracking.\",\"authors\":\"Shu-Chang Wang, Kun Qian, Jinzheng You, Shang Xinghao\",\"doi\":\"10.1364/AO.569136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>With the rapid development of unmanned aerial vehicle (UAV) technology, establishing effective management systems for unmanned aerial vehicles has become increasingly important. Tracking small UAVs in complex environments using infrared imagery is a crucial yet challenging task, owing to limited target visibility and significant background clutter. Further, existing feature extraction methods struggle to effectively capture pixel-level infrared UAV signatures. Therefore, this paper introduces SiamDA, a detail-attentive anchor-free Siamese tracker designed to capture more infrared spectral details to enhance the representation of weak UAV targets. First, a detail-attentive network that employs deformable convolutions to capture fine-grained features, along with a Taylor-difference-inspired edge enhancement module to sharpen boundaries and reinforce geometric shapes of small UAVs. Then, a normalized Wasserstein distance loss and a dynamic template update scheme are integrated to improve tracking robustness. Evaluations on public near-infrared UAV datasets indicate that SiamDA attains an average precision (<i>P</i><sub>5</sub>) of more than 80%, surpassing state-of-the-art trackers trained on the same dataset.</p>\",\"PeriodicalId\":101299,\"journal\":{\"name\":\"Applied optics\",\"volume\":\"64 25\",\"pages\":\"7586-7593\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied optics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1364/AO.569136\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1364/AO.569136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SiamDA: a detail-attentive Siamese network with infrared optical saliency for pixel-level UAV tracking.
With the rapid development of unmanned aerial vehicle (UAV) technology, establishing effective management systems for unmanned aerial vehicles has become increasingly important. Tracking small UAVs in complex environments using infrared imagery is a crucial yet challenging task, owing to limited target visibility and significant background clutter. Further, existing feature extraction methods struggle to effectively capture pixel-level infrared UAV signatures. Therefore, this paper introduces SiamDA, a detail-attentive anchor-free Siamese tracker designed to capture more infrared spectral details to enhance the representation of weak UAV targets. First, a detail-attentive network that employs deformable convolutions to capture fine-grained features, along with a Taylor-difference-inspired edge enhancement module to sharpen boundaries and reinforce geometric shapes of small UAVs. Then, a normalized Wasserstein distance loss and a dynamic template update scheme are integrated to improve tracking robustness. Evaluations on public near-infrared UAV datasets indicate that SiamDA attains an average precision (P5) of more than 80%, surpassing state-of-the-art trackers trained on the same dataset.