{"title":"微调更快- rcnn量身定制的特征重加权为少数镜头的目标检测","authors":"Dekang Zhu, Hongbo Guo, Tong Li, Zhipeng Meng","doi":"10.1145/3561613.3561621","DOIUrl":null,"url":null,"abstract":"Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.","PeriodicalId":348024,"journal":{"name":"Proceedings of the 5th International Conference on Control and Computer Vision","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection\",\"authors\":\"Dekang Zhu, Hongbo Guo, Tong Li, Zhipeng Meng\",\"doi\":\"10.1145/3561613.3561621\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.\",\"PeriodicalId\":348024,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Control and Computer Vision\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Control and Computer Vision\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3561613.3561621\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Control and Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3561613.3561621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection
Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.