微调更快- rcnn量身定制的特征重加权为少数镜头的目标检测

Proceedings of the 5th International Conference on Control and Computer Vision Pub Date : 2022-08-19 DOI:10.1145/3561613.3561621

Dekang Zhu, Hongbo Guo, Tong Li, Zhipeng Meng

{"title":"微调更快- rcnn量身定制的特征重加权为少数镜头的目标检测","authors":"Dekang Zhu, Hongbo Guo, Tong Li, Zhipeng Meng","doi":"10.1145/3561613.3561621","DOIUrl":null,"url":null,"abstract":"Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.","PeriodicalId":348024,"journal":{"name":"Proceedings of the 5th International Conference on Control and Computer Vision","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection\",\"authors\":\"Dekang Zhu, Hongbo Guo, Tong Li, Zhipeng Meng\",\"doi\":\"10.1145/3561613.3561621\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.\",\"PeriodicalId\":348024,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Control and Computer Vision\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Control and Computer Vision\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3561613.3561621\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Control and Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3561613.3561621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

目前，在计算机视觉中，少镜头目标检测是一个备受关注的问题。一个公认的任务设置是训练网络检测具有丰富图像的基本类和只有少量图像的新类。在这种情况下，开发了两种经典的少镜头目标检测方法。一种是微调，在基类图像上训练检测网络，并在基类和新类图像上对最后一层进行微调。另一种是元学习，其中一个开创性的模型利用元学习器将支持图像转换为重加权向量，用于对通过特征提取器获得的查询图像的特征进行重加权。典型的元学习方法将训练过程分为两个阶段:元训练和元测试。在元训练阶段，首先对基本类进行训练，然后对基本类和新类进行训练。在本文中，我们将这两种管道综合起来。对于网络结构，我们为重权模块定制了Faster-RCNN;对于训练，我们遵循元训练过程，并在元测试期间微调重权重模块和fast - rcnn的最后一层。在NWPU VHR-10图像上的实验表明，我们的方法比重加权和微调方法提高了约10 ~ 20个百分点的mAP。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Fine-tuning Faster-RCNN tailored to Feature Reweighting for Few-shot Object Detection

Few-shot object detection has drawn more attention in computer vision now. One acknowledged task setting is that train the network to detect both of the base classes with abundant images and novel classes with only a few. Under this scenario, two classic pipelines of few-shot object detection are developed. One is fine-tuning, which trains the detection network on images of base classes and fine-tunes the last layer on images of both base and novel classes. The other one is meta learning, in which one pioneering model utilizes a meta-learner to transform supporting images into reweighting vectors, which are used to reweight features of the query images obtained through the feature extractor. A typical meta learning method splits the training process into two phases: meta-training and meta-testing. Firstly in the meta-training phase, the model is trained on base classes, then on both base and novel classes. In this paper, we synthesize these two pipelines together. For the network structure, we tailor Faster-RCNN to the reweighting module; for training, we follow the meta-training procedure and fine-tune the reweighting module and only the last layer of Faster-RCNN during meta-testing. Experiments on NWPU VHR-10 images show that our method improves the mAP by about 10 ∼ 20 percentages than both of the reweighting and fine-tuning methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 5th International Conference on Control and Computer Vision

自引率

0.00%

发文量