{"title":"FYCFNet: Vehicle and Pedestrian Detection Network based on Multi-model Fusion","authors":"Pnegyu Dai","doi":"10.1109/cvidliccea56201.2022.9825072","DOIUrl":null,"url":null,"abstract":"Vision-based solutions for target detection in autonomous driving are very much about the accuracy of detection. A correct or incorrect detection may cause or avoid a traffic accident. Therefore, in this paper, to further improve the detection accuracy of vision schemes, we propose a multi-model fusion network: Fusion Network with YoloV5 and CBNEet Faster-RCNN (FYCFNet) that fuses a one-stage target detection model and a two-stage model, which consists of three parts: the first part is a single-stage YOLOV5 [1] detection model, the second part is a Faster-RCNN [2] with CBNet-V2 [3] as the backbone, and the third part is the post-fusion head of weighted boxes fusion. We tested the performance of this network and compared it with other mainstream networks, and verified that the network achieves a very impressive accuracy improvement.","PeriodicalId":23649,"journal":{"name":"Vision","volume":"31 1","pages":"230-236"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cvidliccea56201.2022.9825072","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Vision-based solutions for target detection in autonomous driving are very much about the accuracy of detection. A correct or incorrect detection may cause or avoid a traffic accident. Therefore, in this paper, to further improve the detection accuracy of vision schemes, we propose a multi-model fusion network: Fusion Network with YoloV5 and CBNEet Faster-RCNN (FYCFNet) that fuses a one-stage target detection model and a two-stage model, which consists of three parts: the first part is a single-stage YOLOV5 [1] detection model, the second part is a Faster-RCNN [2] with CBNet-V2 [3] as the backbone, and the third part is the post-fusion head of weighted boxes fusion. We tested the performance of this network and compared it with other mainstream networks, and verified that the network achieves a very impressive accuracy improvement.