Qingyang Lu, Hong Zhu, Guangling Yuan, Congli Li, Xiaoyan Qin
{"title":"CATrack:结合卷积和注意方法的视觉目标跟踪","authors":"Qingyang Lu, Hong Zhu, Guangling Yuan, Congli Li, Xiaoyan Qin","doi":"10.1109/AINIT59027.2023.10212501","DOIUrl":null,"url":null,"abstract":"The current popular tracking frameworks prioritize the modeling of global relationships while neglecting research on local feature extraction. This paper introduces CATrack, a novel approach for visual object tracking that integrates convolution and attention into a unified framework. In contrast to prior research, it constructs a tracking module using a unified framework that incorporates convolution and attention as its core components. Our method effectively bridges the gap between the two calculation methods. It improves the ability to extract fundamental features, integrates past experience in the tracking field more effectively, while balancing local and global contextual information. The proposed tracker achieves competitive performance on 5 challenging short-term and long-term benchmarks and can run at real-time speed.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CATrack: Combining Convolutional and Attentional Methods for Visual Object Tracking\",\"authors\":\"Qingyang Lu, Hong Zhu, Guangling Yuan, Congli Li, Xiaoyan Qin\",\"doi\":\"10.1109/AINIT59027.2023.10212501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The current popular tracking frameworks prioritize the modeling of global relationships while neglecting research on local feature extraction. This paper introduces CATrack, a novel approach for visual object tracking that integrates convolution and attention into a unified framework. In contrast to prior research, it constructs a tracking module using a unified framework that incorporates convolution and attention as its core components. Our method effectively bridges the gap between the two calculation methods. It improves the ability to extract fundamental features, integrates past experience in the tracking field more effectively, while balancing local and global contextual information. The proposed tracker achieves competitive performance on 5 challenging short-term and long-term benchmarks and can run at real-time speed.\",\"PeriodicalId\":276778,\"journal\":{\"name\":\"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AINIT59027.2023.10212501\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AINIT59027.2023.10212501","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CATrack: Combining Convolutional and Attentional Methods for Visual Object Tracking
The current popular tracking frameworks prioritize the modeling of global relationships while neglecting research on local feature extraction. This paper introduces CATrack, a novel approach for visual object tracking that integrates convolution and attention into a unified framework. In contrast to prior research, it constructs a tracking module using a unified framework that incorporates convolution and attention as its core components. Our method effectively bridges the gap between the two calculation methods. It improves the ability to extract fundamental features, integrates past experience in the tracking field more effectively, while balancing local and global contextual information. The proposed tracker achieves competitive performance on 5 challenging short-term and long-term benchmarks and can run at real-time speed.