Zicheng Zhang, Quan Liang, Zhihui Feng, W. Ji, Hansong Wang, Jinjing Hu
{"title":"改进型YOLOV4在智能驾驶场景中的应用","authors":"Zicheng Zhang, Quan Liang, Zhihui Feng, W. Ji, Hansong Wang, Jinjing Hu","doi":"10.1109/PIC53636.2021.9687039","DOIUrl":null,"url":null,"abstract":"With the development of unmanned technology, the technical innovation of invehicle vision detection system is also getting faster and faster, while the improvement of algorithm accuracy often brings an increase in the number of parameters and poor real-time performance. In his paper, the optimization of the algorithm structure of YoLoV4 target detection is achieved by using MobileNet-v3 instead of the CspDarkNet53 master Network, which has the inverse residual structure of linear bottleneck, while the lightweight attention mechanism is added to the feature extraction process, and the learning degree of feature channels is enhanced; due to the long computation time of sigmoid, it also uses ReLU6(x+3)/6 is used to approximate the original activation function due to the long computation time of sigmoid; the system parameters are reduced by constructing a depth-separable convolution instead of the normal convolution in PaNet. Meanwhile, this paper improves the original upsampling method by using dual cubic interpolation, which makes the image more smooth, less image loss and more accurate feature extraction during he upsampling method. The map% is improved from 79.1% to 81.2% on the voc dataset, reaching 58.14 FPS.","PeriodicalId":297239,"journal":{"name":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of Improved YOLOV4 in Intelligent Driving Scenarios\",\"authors\":\"Zicheng Zhang, Quan Liang, Zhihui Feng, W. Ji, Hansong Wang, Jinjing Hu\",\"doi\":\"10.1109/PIC53636.2021.9687039\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of unmanned technology, the technical innovation of invehicle vision detection system is also getting faster and faster, while the improvement of algorithm accuracy often brings an increase in the number of parameters and poor real-time performance. In his paper, the optimization of the algorithm structure of YoLoV4 target detection is achieved by using MobileNet-v3 instead of the CspDarkNet53 master Network, which has the inverse residual structure of linear bottleneck, while the lightweight attention mechanism is added to the feature extraction process, and the learning degree of feature channels is enhanced; due to the long computation time of sigmoid, it also uses ReLU6(x+3)/6 is used to approximate the original activation function due to the long computation time of sigmoid; the system parameters are reduced by constructing a depth-separable convolution instead of the normal convolution in PaNet. Meanwhile, this paper improves the original upsampling method by using dual cubic interpolation, which makes the image more smooth, less image loss and more accurate feature extraction during he upsampling method. The map% is improved from 79.1% to 81.2% on the voc dataset, reaching 58.14 FPS.\",\"PeriodicalId\":297239,\"journal\":{\"name\":\"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PIC53636.2021.9687039\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Progress in Informatics and Computing (PIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIC53636.2021.9687039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of Improved YOLOV4 in Intelligent Driving Scenarios
With the development of unmanned technology, the technical innovation of invehicle vision detection system is also getting faster and faster, while the improvement of algorithm accuracy often brings an increase in the number of parameters and poor real-time performance. In his paper, the optimization of the algorithm structure of YoLoV4 target detection is achieved by using MobileNet-v3 instead of the CspDarkNet53 master Network, which has the inverse residual structure of linear bottleneck, while the lightweight attention mechanism is added to the feature extraction process, and the learning degree of feature channels is enhanced; due to the long computation time of sigmoid, it also uses ReLU6(x+3)/6 is used to approximate the original activation function due to the long computation time of sigmoid; the system parameters are reduced by constructing a depth-separable convolution instead of the normal convolution in PaNet. Meanwhile, this paper improves the original upsampling method by using dual cubic interpolation, which makes the image more smooth, less image loss and more accurate feature extraction during he upsampling method. The map% is improved from 79.1% to 81.2% on the voc dataset, reaching 58.14 FPS.