{"title":"复杂环境中检测和识别物体的深度学习技术概览","authors":"Ashish Kumar Dogra , Vipal Sharma , Harsh Sohal","doi":"10.1016/j.cosrev.2024.100686","DOIUrl":null,"url":null,"abstract":"<div><div>Object detection has been used extensively in daily life, and in computer vision, this sub-field is highly significant and challenging. The field of object detection has been transformed by deep learning. Deep learning-based methods have shown to be remarkably effective at identifying and localizing objects in images and video streams when it comes to object detection. Deep learning algorithms can precisely locate and localize objects inside photos and videos because of their capacity to learn complex and nonlinear patterns in data. Deep learning models may also be trained on big datasets with minimal human intervention, allowing them to rapidly improve their performance. This makes deep learning models useful for applications such as self-driving cars, recognizing faces, and healthcare diagnosis. The purpose of this study was to gain an in-depth understanding of the primary state of development for the object detection pipeline in complex environments. Initially, this study describes the benchmark datasets and analyzes the typical detection model, and then, the paper systematic approach covers both one-stage and two-stage detectors, giving a thorough overview of object detection techniques in complex environments. We also discuss the new and traditional applications of object detection. In the end, the study reviews how well various topologies perform over a range of parameters. The study has covered a total of 119 articles, of which 27% are related to one-stage detectors, 26% to two-stage detectors, 24% to supporting data related to deep learning, 14% to survey articles, 8% to the datasets covered in the study, and the remaining 1% to the book chapters.</div></div>","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":null,"pages":null},"PeriodicalIF":13.3000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A survey of deep learning techniques for detecting and recognizing objects in complex environments\",\"authors\":\"Ashish Kumar Dogra , Vipal Sharma , Harsh Sohal\",\"doi\":\"10.1016/j.cosrev.2024.100686\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Object detection has been used extensively in daily life, and in computer vision, this sub-field is highly significant and challenging. The field of object detection has been transformed by deep learning. Deep learning-based methods have shown to be remarkably effective at identifying and localizing objects in images and video streams when it comes to object detection. Deep learning algorithms can precisely locate and localize objects inside photos and videos because of their capacity to learn complex and nonlinear patterns in data. Deep learning models may also be trained on big datasets with minimal human intervention, allowing them to rapidly improve their performance. This makes deep learning models useful for applications such as self-driving cars, recognizing faces, and healthcare diagnosis. The purpose of this study was to gain an in-depth understanding of the primary state of development for the object detection pipeline in complex environments. Initially, this study describes the benchmark datasets and analyzes the typical detection model, and then, the paper systematic approach covers both one-stage and two-stage detectors, giving a thorough overview of object detection techniques in complex environments. We also discuss the new and traditional applications of object detection. In the end, the study reviews how well various topologies perform over a range of parameters. The study has covered a total of 119 articles, of which 27% are related to one-stage detectors, 26% to two-stage detectors, 24% to supporting data related to deep learning, 14% to survey articles, 8% to the datasets covered in the study, and the remaining 1% to the book chapters.</div></div>\",\"PeriodicalId\":48633,\"journal\":{\"name\":\"Computer Science Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":13.3000,\"publicationDate\":\"2024-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science Review\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1574013724000704\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574013724000704","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A survey of deep learning techniques for detecting and recognizing objects in complex environments
Object detection has been used extensively in daily life, and in computer vision, this sub-field is highly significant and challenging. The field of object detection has been transformed by deep learning. Deep learning-based methods have shown to be remarkably effective at identifying and localizing objects in images and video streams when it comes to object detection. Deep learning algorithms can precisely locate and localize objects inside photos and videos because of their capacity to learn complex and nonlinear patterns in data. Deep learning models may also be trained on big datasets with minimal human intervention, allowing them to rapidly improve their performance. This makes deep learning models useful for applications such as self-driving cars, recognizing faces, and healthcare diagnosis. The purpose of this study was to gain an in-depth understanding of the primary state of development for the object detection pipeline in complex environments. Initially, this study describes the benchmark datasets and analyzes the typical detection model, and then, the paper systematic approach covers both one-stage and two-stage detectors, giving a thorough overview of object detection techniques in complex environments. We also discuss the new and traditional applications of object detection. In the end, the study reviews how well various topologies perform over a range of parameters. The study has covered a total of 119 articles, of which 27% are related to one-stage detectors, 26% to two-stage detectors, 24% to supporting data related to deep learning, 14% to survey articles, 8% to the datasets covered in the study, and the remaining 1% to the book chapters.
期刊介绍:
Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.