上下文感知网络的人员搜索

2022 4th International Conference on Robotics and Computer Vision (ICRCV) Pub Date : 2022-09-25 DOI:10.1109/ICRCV55858.2022.9953260

Yu Gu, Tao Lu

{"title":"上下文感知网络的人员搜索","authors":"Yu Gu, Tao Lu","doi":"10.1109/ICRCV55858.2022.9953260","DOIUrl":null,"url":null,"abstract":"The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.","PeriodicalId":399667,"journal":{"name":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Context-Aware Network for Person Search\",\"authors\":\"Yu Gu, Tao Lu\",\"doi\":\"10.1109/ICRCV55858.2022.9953260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.\",\"PeriodicalId\":399667,\"journal\":{\"name\":\"2022 4th International Conference on Robotics and Computer Vision (ICRCV)\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th International Conference on Robotics and Computer Vision (ICRCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRCV55858.2022.9953260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCV55858.2022.9953260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

有效的人物搜索的关键是对行人进行定位，并从大量的监控场景图像中获得人物ReID的判别嵌入表示。现有的一步无锚点方法可以在速度和精度之间取得平衡，但不能充分利用搜索上下文的上下文特征信息，导致定位效果不佳。为了解决这个问题，我们提出了一个上下文感知网络(CANPS)来深入研究高层次的上下文信息。在CANPS中，首先提出了上下文编码器，通过将丰富的上下文信息分布到预测头层来弥合特征映射之间的差距;其次，设计可塑中心采样策略，合理暴露采样区域，关注质心特征表示;此外，我们以可训练的免费袋的方式设计了上述组件，因此实时人员搜索可以在不增加额外推理成本的情况下大大提高准确性。大量的实验表明，我们提出的方法可以在公开的中大-中山大学数据集上优于当前最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Context-Aware Network for Person Search

The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 4th International Conference on Robotics and Computer Vision (ICRCV)

自引率

0.00%

发文量