Dianlong You, P. Wang, Y. Zhang, Ling Wang, Shunfu Jin
{"title":"基于反向传播和动态学习的少镜头目标检测","authors":"Dianlong You, P. Wang, Y. Zhang, Ling Wang, Shunfu Jin","doi":"10.1109/ICME55011.2023.00493","DOIUrl":null,"url":null,"abstract":"Utilizing traditional object detectors to build a few-shot object detection (FSOD) model ignores the differences between classification and regression tasks and causes task conflict and class confusion, resulting in a decline in classification performance. In contrast, this paper focuses on the above shortcomings and utilizes the strategies of Back Propagation and Dynamic Learning to construct a model for addressing FSOD, named BPDL. Our BPDL has a two-fold main idea: a) it uses the optimized localization boxes to alleviate the task conflict and refine classification features by a correction loss, and b) it develops a dynamic learning strategy to filter the confusing features and mine more realistic prototype representations of the categories to calibrate classification. Extensive experiments on multiple benchmarks show that our BPDL model outperforms existing methods and advances the FSOD task’s state-of-the-art.","PeriodicalId":321830,"journal":{"name":"2023 IEEE International Conference on Multimedia and Expo (ICME)","volume":"31 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-Shot Object Detection via Back Propagation and Dynamic Learning\",\"authors\":\"Dianlong You, P. Wang, Y. Zhang, Ling Wang, Shunfu Jin\",\"doi\":\"10.1109/ICME55011.2023.00493\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Utilizing traditional object detectors to build a few-shot object detection (FSOD) model ignores the differences between classification and regression tasks and causes task conflict and class confusion, resulting in a decline in classification performance. In contrast, this paper focuses on the above shortcomings and utilizes the strategies of Back Propagation and Dynamic Learning to construct a model for addressing FSOD, named BPDL. Our BPDL has a two-fold main idea: a) it uses the optimized localization boxes to alleviate the task conflict and refine classification features by a correction loss, and b) it develops a dynamic learning strategy to filter the confusing features and mine more realistic prototype representations of the categories to calibrate classification. Extensive experiments on multiple benchmarks show that our BPDL model outperforms existing methods and advances the FSOD task’s state-of-the-art.\",\"PeriodicalId\":321830,\"journal\":{\"name\":\"2023 IEEE International Conference on Multimedia and Expo (ICME)\",\"volume\":\"31 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Multimedia and Expo (ICME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICME55011.2023.00493\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Multimedia and Expo (ICME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME55011.2023.00493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Few-Shot Object Detection via Back Propagation and Dynamic Learning
Utilizing traditional object detectors to build a few-shot object detection (FSOD) model ignores the differences between classification and regression tasks and causes task conflict and class confusion, resulting in a decline in classification performance. In contrast, this paper focuses on the above shortcomings and utilizes the strategies of Back Propagation and Dynamic Learning to construct a model for addressing FSOD, named BPDL. Our BPDL has a two-fold main idea: a) it uses the optimized localization boxes to alleviate the task conflict and refine classification features by a correction loss, and b) it develops a dynamic learning strategy to filter the confusing features and mine more realistic prototype representations of the categories to calibrate classification. Extensive experiments on multiple benchmarks show that our BPDL model outperforms existing methods and advances the FSOD task’s state-of-the-art.