{"title":"Efficient Detection of Small and Complex Objects for Autonomous Driving Using Deep Learning","authors":"Ansh Sharma, Rashmi Gupta","doi":"10.1109/CSCITA55725.2023.10104969","DOIUrl":null,"url":null,"abstract":"The YOLOv2 is one of the most prominent model used for object detection, it works on the concept of anchor boxes. However, this model is prone to some problems like double anchor boxes, missing small objects, and high time complexity. In this paper, we aim to solve the problem of double anchor boxes and undetected small objects by tuning the parameters like intersection over union (IoU) and customizing non-max suppression thresholds. Also, to reduce the time complexity of the model, we have proposed the use of depth wise convolution (DW-Conv2D) instead of fundamental convolution (Conv2D) in this paper. Once we applied the proposed model to datasets like PASCAL VOC07 and VOC12, we observed significant improvements like reduced floating-point operations per second by 9.5% and better accuracy than the existing state-of-the-art models.","PeriodicalId":224479,"journal":{"name":"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)","volume":"208 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Communication System, Computing and IT Applications (CSCITA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSCITA55725.2023.10104969","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The YOLOv2 is one of the most prominent model used for object detection, it works on the concept of anchor boxes. However, this model is prone to some problems like double anchor boxes, missing small objects, and high time complexity. In this paper, we aim to solve the problem of double anchor boxes and undetected small objects by tuning the parameters like intersection over union (IoU) and customizing non-max suppression thresholds. Also, to reduce the time complexity of the model, we have proposed the use of depth wise convolution (DW-Conv2D) instead of fundamental convolution (Conv2D) in this paper. Once we applied the proposed model to datasets like PASCAL VOC07 and VOC12, we observed significant improvements like reduced floating-point operations per second by 9.5% and better accuracy than the existing state-of-the-art models.