{"title":"上下文感知网络的人员搜索","authors":"Yu Gu, Tao Lu","doi":"10.1109/ICRCV55858.2022.9953260","DOIUrl":null,"url":null,"abstract":"The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.","PeriodicalId":399667,"journal":{"name":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Context-Aware Network for Person Search\",\"authors\":\"Yu Gu, Tao Lu\",\"doi\":\"10.1109/ICRCV55858.2022.9953260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.\",\"PeriodicalId\":399667,\"journal\":{\"name\":\"2022 4th International Conference on Robotics and Computer Vision (ICRCV)\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 4th International Conference on Robotics and Computer Vision (ICRCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICRCV55858.2022.9953260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Robotics and Computer Vision (ICRCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRCV55858.2022.9953260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The key to effective person search is aiming to localize the pedestrians and obtain the discriminative embeddings representation for person ReID from numerous surveillance scene images. And the existing one-step anchor-free methods can achieve a trade-off between speed and accuracy, but it can not fully exploit the contextual feature information of search context, resulting in undesirable localization. To alleviate this issue, we propose a Context-Aware Network for Person Search (CANPS) to delve into the high-level contextual information. In CANPS, firstly, context encoder is proposed to bridge the gap between the feature maps, achieved by distributing rich contextual information to prediction head layers. Second, we design the malleable center sampling strategy to reasonably expose sample region and focus on the centroid feature representations. What’s more, we design above components in a trainable bag-of-freebies manner, so that real-time person search can greatly improve the accuracy without increasing extra inference cost. Extensive experiments show that the approach we proposed can outperform current state-of-the-art methods in public CUHK-SYSU datasets.