{"title":"基于FPGA的全景驾驶感知模型及推理加速","authors":"Yang Yang;Hui Xu;Abdullah Aman Khan;Jie Shao","doi":"10.1109/TIV.2024.3462431","DOIUrl":null,"url":null,"abstract":"Panoptic perception systems are critical for autonomous driving, as they process multiple visual tasks simultaneously, enhancing functionality. Compared with designing multiple independent networks to address various tasks, these systems exhibit reduced overall inference latency by integrating various tasks into a single network. Existing panoptic perception networks often rely on pre-trained classification models as their backbone, which are not tailored for specific tasks, thereby compromising accuracy. To address this, we propose a dual-branch backbone and a wide perception segmentation head, enhancing the effectiveness of the network for autonomous driving applications. This enhanced network can simultaneously perform vehicle object detection, drivable area segmentation, and lane segmentation. Furthermore, to meet the stringent latency requirements of autonomous driving, we implement this network using an FPGA acceleration card. In our experiments using the challenging BDD100K dataset, the model significantly surpasses the baseline in accuracy for all tasks. To satisfy the increased real-time demands, the VCK5000 FPGA is used, which achieves inference speeds approximately 35.2 times faster than GPU-based deployments and about 41.5 times energy efficiency, providing significant advantages in resource-constrained scenarios.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"10 6","pages":"3697-3704"},"PeriodicalIF":14.3000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Panoptic Driving Perception Model and Inference Acceleration Based on FPGA\",\"authors\":\"Yang Yang;Hui Xu;Abdullah Aman Khan;Jie Shao\",\"doi\":\"10.1109/TIV.2024.3462431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Panoptic perception systems are critical for autonomous driving, as they process multiple visual tasks simultaneously, enhancing functionality. Compared with designing multiple independent networks to address various tasks, these systems exhibit reduced overall inference latency by integrating various tasks into a single network. Existing panoptic perception networks often rely on pre-trained classification models as their backbone, which are not tailored for specific tasks, thereby compromising accuracy. To address this, we propose a dual-branch backbone and a wide perception segmentation head, enhancing the effectiveness of the network for autonomous driving applications. This enhanced network can simultaneously perform vehicle object detection, drivable area segmentation, and lane segmentation. Furthermore, to meet the stringent latency requirements of autonomous driving, we implement this network using an FPGA acceleration card. In our experiments using the challenging BDD100K dataset, the model significantly surpasses the baseline in accuracy for all tasks. To satisfy the increased real-time demands, the VCK5000 FPGA is used, which achieves inference speeds approximately 35.2 times faster than GPU-based deployments and about 41.5 times energy efficiency, providing significant advantages in resource-constrained scenarios.\",\"PeriodicalId\":36532,\"journal\":{\"name\":\"IEEE Transactions on Intelligent Vehicles\",\"volume\":\"10 6\",\"pages\":\"3697-3704\"},\"PeriodicalIF\":14.3000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Intelligent Vehicles\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10681579/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Vehicles","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10681579/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Panoptic Driving Perception Model and Inference Acceleration Based on FPGA
Panoptic perception systems are critical for autonomous driving, as they process multiple visual tasks simultaneously, enhancing functionality. Compared with designing multiple independent networks to address various tasks, these systems exhibit reduced overall inference latency by integrating various tasks into a single network. Existing panoptic perception networks often rely on pre-trained classification models as their backbone, which are not tailored for specific tasks, thereby compromising accuracy. To address this, we propose a dual-branch backbone and a wide perception segmentation head, enhancing the effectiveness of the network for autonomous driving applications. This enhanced network can simultaneously perform vehicle object detection, drivable area segmentation, and lane segmentation. Furthermore, to meet the stringent latency requirements of autonomous driving, we implement this network using an FPGA acceleration card. In our experiments using the challenging BDD100K dataset, the model significantly surpasses the baseline in accuracy for all tasks. To satisfy the increased real-time demands, the VCK5000 FPGA is used, which achieves inference speeds approximately 35.2 times faster than GPU-based deployments and about 41.5 times energy efficiency, providing significant advantages in resource-constrained scenarios.
期刊介绍:
The IEEE Transactions on Intelligent Vehicles (T-IV) is a premier platform for publishing peer-reviewed articles that present innovative research concepts, application results, significant theoretical findings, and application case studies in the field of intelligent vehicles. With a particular emphasis on automated vehicles within roadway environments, T-IV aims to raise awareness of pressing research and application challenges.
Our focus is on providing critical information to the intelligent vehicle community, serving as a dissemination vehicle for IEEE ITS Society members and others interested in learning about the state-of-the-art developments and progress in research and applications related to intelligent vehicles. Join us in advancing knowledge and innovation in this dynamic field.