少射目标检测:综述

ACM Computing Surveys (CSUR) Pub Date : 2022-02-24 DOI:10.1145/3519022

Simone Antonelli, D. Avola, L. Cinque, Donato Crisostomi, G. Foresti, Fabio Galasso, Marco Raoul Marini, Alessio Mecca, D. Pannone

{"title":"少射目标检测:综述","authors":"Simone Antonelli, D. Avola, L. Cinque, Donato Crisostomi, G. Foresti, Fabio Galasso, Marco Raoul Marini, Alessio Mecca, D. Pannone","doi":"10.1145/3519022","DOIUrl":null,"url":null,"abstract":"Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by leveraging large amounts of data. However, they could fail when the retrieved information is not enough to fit the vast number of parameters, frequently resulting in overfitting and therefore in poor generalizability. Few-Shot Learning aims at designing models that can effectively operate in a scarce data regime, yielding learning strategies that only need few supervised examples to be trained. These procedures are of both practical and theoretical importance, as they are crucial for many real-life scenarios in which data is either costly or even impossible to retrieve. Moreover, they bridge the distance between current data-hungry models and human-like generalization capability. Computer vision offers various tasks that can be few-shot inherent, such as person re-identification. This survey, which to the best of our knowledge is the first tackling this problem, is focused on Few-Shot Object Detection, which has received far less attention compared to Few-Shot Classification due to the intrinsic challenge level. In this regard, this review presents an extensive description of the approaches that have been tested in the current literature, discussing their pros and cons, and classifying them according to a rigorous taxonomy.","PeriodicalId":7000,"journal":{"name":"ACM Computing Surveys (CSUR)","volume":"232 ","pages":"1 - 37"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Few-Shot Object Detection: A Survey\",\"authors\":\"Simone Antonelli, D. Avola, L. Cinque, Donato Crisostomi, G. Foresti, Fabio Galasso, Marco Raoul Marini, Alessio Mecca, D. Pannone\",\"doi\":\"10.1145/3519022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by leveraging large amounts of data. However, they could fail when the retrieved information is not enough to fit the vast number of parameters, frequently resulting in overfitting and therefore in poor generalizability. Few-Shot Learning aims at designing models that can effectively operate in a scarce data regime, yielding learning strategies that only need few supervised examples to be trained. These procedures are of both practical and theoretical importance, as they are crucial for many real-life scenarios in which data is either costly or even impossible to retrieve. Moreover, they bridge the distance between current data-hungry models and human-like generalization capability. Computer vision offers various tasks that can be few-shot inherent, such as person re-identification. This survey, which to the best of our knowledge is the first tackling this problem, is focused on Few-Shot Object Detection, which has received far less attention compared to Few-Shot Classification due to the intrinsic challenge level. In this regard, this review presents an extensive description of the approaches that have been tested in the current literature, discussing their pros and cons, and classifying them according to a rigorous taxonomy.\",\"PeriodicalId\":7000,\"journal\":{\"name\":\"ACM Computing Surveys (CSUR)\",\"volume\":\"232 \",\"pages\":\"1 - 37\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Computing Surveys (CSUR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3519022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys (CSUR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3519022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

摘要

通过利用大量数据，深度学习方法最近在许多领域提高了标准，从自然语言处理到计算机视觉。然而，当检索到的信息不足以拟合大量参数时，它们可能会失败，经常导致过拟合，从而导致较差的泛化性。few - shot Learning旨在设计能够在稀缺数据体系中有效运行的模型，产生只需要少量监督示例进行训练的学习策略。这些程序在实践和理论上都很重要，因为它们对于许多现实生活中的场景至关重要，在这些场景中，数据要么代价高昂，要么甚至无法检索。此外，它们弥合了当前数据饥渴模型与类似人类的泛化能力之间的距离。计算机视觉提供了各种各样的任务，这些任务可能很少是固有的，比如人的重新识别。据我们所知，这项调查是第一次解决这个问题，主要集中在Few-Shot目标检测上，由于其固有的挑战水平，与Few-Shot分类相比，它受到的关注要少得多。在这方面，这篇综述提出了一个广泛的描述，已经在当前的文献中测试的方法，讨论他们的优点和缺点，并根据严格的分类法进行分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Few-Shot Object Detection: A Survey

Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by leveraging large amounts of data. However, they could fail when the retrieved information is not enough to fit the vast number of parameters, frequently resulting in overfitting and therefore in poor generalizability. Few-Shot Learning aims at designing models that can effectively operate in a scarce data regime, yielding learning strategies that only need few supervised examples to be trained. These procedures are of both practical and theoretical importance, as they are crucial for many real-life scenarios in which data is either costly or even impossible to retrieve. Moreover, they bridge the distance between current data-hungry models and human-like generalization capability. Computer vision offers various tasks that can be few-shot inherent, such as person re-identification. This survey, which to the best of our knowledge is the first tackling this problem, is focused on Few-Shot Object Detection, which has received far less attention compared to Few-Shot Classification due to the intrinsic challenge level. In this regard, this review presents an extensive description of the approaches that have been tested in the current literature, discussing their pros and cons, and classifying them according to a rigorous taxonomy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ACM Computing Surveys (CSUR)

自引率

0.00%

发文量