{"title":"A comprehensive review on YOLO versions for object detection","authors":"Ayşe Aybilge Murat , Mustafa Servet Kiran","doi":"10.1016/j.jestch.2025.102161","DOIUrl":null,"url":null,"abstract":"<div><div>The need for methods used for object detection has gained increasing momentum in recent years. Starting with traditional image processing techniques, this process has been facilitated by the addition of deep learning. Object detection is currently used in areas such as autonomous vehicles, disease diagnosis, robotic vision and industry. The types of systems that are predicted to be needed more and more in the age of developing technology are also increasing. In particular, YOLO (You Only Look Once), which is mostly preferred in real-time object detection, is preferred because it achieves high accuracy in a short time. This paper analyses the main versions of the YOLO algorithm since its first release. The paper systematically analyses the architectural differences between the versions of the YOLO algorithm, the strengths and weaknesses of the models and their contribution to performance. At the same time, in most of the previous studies on YOLO, a comprehensive comparison of the YOLOv9-v11 models is not presented and new architectural features are not evaluated. This review provides an in-depth analysis of the main versions from YOLOv1 to YOLOv11, including recent innovations such as NMS-free, Oriented Bounding Boxes (OBB), GELAN and PGI. This work is intended to be a useful guide for researchers and developers interested in the field.</div></div>","PeriodicalId":48609,"journal":{"name":"Engineering Science and Technology-An International Journal-Jestech","volume":"70 ","pages":"Article 102161"},"PeriodicalIF":5.4000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Science and Technology-An International Journal-Jestech","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215098625002162","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The need for methods used for object detection has gained increasing momentum in recent years. Starting with traditional image processing techniques, this process has been facilitated by the addition of deep learning. Object detection is currently used in areas such as autonomous vehicles, disease diagnosis, robotic vision and industry. The types of systems that are predicted to be needed more and more in the age of developing technology are also increasing. In particular, YOLO (You Only Look Once), which is mostly preferred in real-time object detection, is preferred because it achieves high accuracy in a short time. This paper analyses the main versions of the YOLO algorithm since its first release. The paper systematically analyses the architectural differences between the versions of the YOLO algorithm, the strengths and weaknesses of the models and their contribution to performance. At the same time, in most of the previous studies on YOLO, a comprehensive comparison of the YOLOv9-v11 models is not presented and new architectural features are not evaluated. This review provides an in-depth analysis of the main versions from YOLOv1 to YOLOv11, including recent innovations such as NMS-free, Oriented Bounding Boxes (OBB), GELAN and PGI. This work is intended to be a useful guide for researchers and developers interested in the field.
期刊介绍:
Engineering Science and Technology, an International Journal (JESTECH) (formerly Technology), a peer-reviewed quarterly engineering journal, publishes both theoretical and experimental high quality papers of permanent interest, not previously published in journals, in the field of engineering and applied science which aims to promote the theory and practice of technology and engineering. In addition to peer-reviewed original research papers, the Editorial Board welcomes original research reports, state-of-the-art reviews and communications in the broadly defined field of engineering science and technology.
The scope of JESTECH includes a wide spectrum of subjects including:
-Electrical/Electronics and Computer Engineering (Biomedical Engineering and Instrumentation; Coding, Cryptography, and Information Protection; Communications, Networks, Mobile Computing and Distributed Systems; Compilers and Operating Systems; Computer Architecture, Parallel Processing, and Dependability; Computer Vision and Robotics; Control Theory; Electromagnetic Waves, Microwave Techniques and Antennas; Embedded Systems; Integrated Circuits, VLSI Design, Testing, and CAD; Microelectromechanical Systems; Microelectronics, and Electronic Devices and Circuits; Power, Energy and Energy Conversion Systems; Signal, Image, and Speech Processing)
-Mechanical and Civil Engineering (Automotive Technologies; Biomechanics; Construction Materials; Design and Manufacturing; Dynamics and Control; Energy Generation, Utilization, Conversion, and Storage; Fluid Mechanics and Hydraulics; Heat and Mass Transfer; Micro-Nano Sciences; Renewable and Sustainable Energy Technologies; Robotics and Mechatronics; Solid Mechanics and Structure; Thermal Sciences)
-Metallurgical and Materials Engineering (Advanced Materials Science; Biomaterials; Ceramic and Inorgnanic Materials; Electronic-Magnetic Materials; Energy and Environment; Materials Characterizastion; Metallurgy; Polymers and Nanocomposites)