多目标跟踪中基于全局和局部上下文的解耦特征

Conference on Optics and Communication Technology Pub Date : 2022-11-18 DOI:10.1117/12.2653540

Yixing Su, Hongbing Ma, Shengjin Wang

{"title":"多目标跟踪中基于全局和局部上下文的解耦特征","authors":"Yixing Su, Hongbing Ma, Shengjin Wang","doi":"10.1117/12.2653540","DOIUrl":null,"url":null,"abstract":"Multi-object tracking (MOT) system usually consists of two tasks, object detection and re-identification (ReID). Current MOT methods tend to join detection and ReID in a single network to enhance inference speed. Such one-shot models allow joint optimization of detection and Re-ID via a shared backbone, reducing computation cost. However, the different demands of features between the two tasks in one-shot systems lead to competition in the optimization procedure. The detection task needs the features of the instances with the same class to be similar, while the ReID task needs the features of different instances to be distinguishable. Existing methods address the contradiction by disentangling the features into detection-specific and ReID-specific features. But these methods neglect the discussion of semantic interpretation of disentangling modules. In this paper, we propose a feature decoupling module, Global and Local Context-based Decoupling Module (GLCD), to disentangle features extracted by the backbone into two task-specific features. By extracting global and local contexts, the two tasks can choose different contexts by learnable parameters to enforce each self. We conduct our decoupling module into SOTA one-shot MOT method and experiments show performance improvement.","PeriodicalId":253792,"journal":{"name":"Conference on Optics and Communication Technology","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Decoupling features via global and local contexts in multi-object tracking\",\"authors\":\"Yixing Su, Hongbing Ma, Shengjin Wang\",\"doi\":\"10.1117/12.2653540\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi-object tracking (MOT) system usually consists of two tasks, object detection and re-identification (ReID). Current MOT methods tend to join detection and ReID in a single network to enhance inference speed. Such one-shot models allow joint optimization of detection and Re-ID via a shared backbone, reducing computation cost. However, the different demands of features between the two tasks in one-shot systems lead to competition in the optimization procedure. The detection task needs the features of the instances with the same class to be similar, while the ReID task needs the features of different instances to be distinguishable. Existing methods address the contradiction by disentangling the features into detection-specific and ReID-specific features. But these methods neglect the discussion of semantic interpretation of disentangling modules. In this paper, we propose a feature decoupling module, Global and Local Context-based Decoupling Module (GLCD), to disentangle features extracted by the backbone into two task-specific features. By extracting global and local contexts, the two tasks can choose different contexts by learnable parameters to enforce each self. We conduct our decoupling module into SOTA one-shot MOT method and experiments show performance improvement.\",\"PeriodicalId\":253792,\"journal\":{\"name\":\"Conference on Optics and Communication Technology\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Optics and Communication Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2653540\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Optics and Communication Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2653540","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

多目标跟踪(MOT)系统通常包括两个任务:目标检测和再识别(ReID)。目前的MOT方法倾向于将检测和ReID结合在一个网络中，以提高推理速度。这种一次性模型允许通过共享骨干网联合优化检测和Re-ID，从而降低计算成本。然而，在一次性系统中，这两个任务对特征的不同需求导致了优化过程中的竞争。检测任务要求同一类实例的特征相似，ReID任务要求不同类实例的特征可区分。现有的方法通过将特征分解为特定于检测的特征和特定于reid的特征来解决矛盾。但是这些方法忽略了对解缠模块的语义解释的讨论。本文提出了一种特征解耦模块——基于全局和局部上下文的解耦模块(Global and Local Context-based decoupling module, GLCD)，将主干提取的特征解耦为两个特定于任务的特征。通过提取全局和局部上下文，两个任务可以通过可学习的参数选择不同的上下文来执行各自的自我。我们将解耦模块应用到SOTA一次性MOT方法中，实验结果表明，解耦模块的性能得到了改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Decoupling features via global and local contexts in multi-object tracking

Multi-object tracking (MOT) system usually consists of two tasks, object detection and re-identification (ReID). Current MOT methods tend to join detection and ReID in a single network to enhance inference speed. Such one-shot models allow joint optimization of detection and Re-ID via a shared backbone, reducing computation cost. However, the different demands of features between the two tasks in one-shot systems lead to competition in the optimization procedure. The detection task needs the features of the instances with the same class to be similar, while the ReID task needs the features of different instances to be distinguishable. Existing methods address the contradiction by disentangling the features into detection-specific and ReID-specific features. But these methods neglect the discussion of semantic interpretation of disentangling modules. In this paper, we propose a feature decoupling module, Global and Local Context-based Decoupling Module (GLCD), to disentangle features extracted by the backbone into two task-specific features. By extracting global and local contexts, the two tasks can choose different contexts by learnable parameters to enforce each self. We conduct our decoupling module into SOTA one-shot MOT method and experiments show performance improvement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Conference on Optics and Communication Technology

自引率

0.00%

发文量