{"title":"Infrared target tracking based on transformer","authors":"Zhou Xi, Xiaohong Li","doi":"10.1117/12.2682473","DOIUrl":null,"url":null,"abstract":"Infrared target images have low signal-to-noise ratio, blurred edges and missing textures, which make it a great challenge to identify the target and achieve stable tracking in the tracking process. However, ordinary target trackers use feature fusion as a convolutional operation, which is a local matching process that easily leads to the absence of high-level semantic information of the image, and is further limited on infrared images. Inspired by transformer, its attention mechanism can capture global features, as well as contextual relationships between features, and can well establish the association between remote features , long-range dependency and other advantages, we designed transofmer-based infrared target tracker, which is a network that performs feature enhancement and fusion on infrared images by tranformer, and classifies and regresses targets by classification head, and has proved the effectiveness of the method by conducting extensive experiments on challenging benchmarks.","PeriodicalId":440430,"journal":{"name":"International Conference on Electronic Technology and Information Science","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Electronic Technology and Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2682473","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Infrared target images have low signal-to-noise ratio, blurred edges and missing textures, which make it a great challenge to identify the target and achieve stable tracking in the tracking process. However, ordinary target trackers use feature fusion as a convolutional operation, which is a local matching process that easily leads to the absence of high-level semantic information of the image, and is further limited on infrared images. Inspired by transformer, its attention mechanism can capture global features, as well as contextual relationships between features, and can well establish the association between remote features , long-range dependency and other advantages, we designed transofmer-based infrared target tracker, which is a network that performs feature enhancement and fusion on infrared images by tranformer, and classifies and regresses targets by classification head, and has proved the effectiveness of the method by conducting extensive experiments on challenging benchmarks.