Abdul Basit, Muhammad Usama Ejaz, Qirat Ayaz, F. Malik
{"title":"自动驾驶汽车的实时目标检测和3D场景感知","authors":"Abdul Basit, Muhammad Usama Ejaz, Qirat Ayaz, F. Malik","doi":"10.1109/ICAI58407.2023.10136623","DOIUrl":null,"url":null,"abstract":"Reliable autonomous urban driving hinges upon the vehicle's ability to perceive and navigate the environment. This research paper emphasizes designing and implementing a vision-based perception system for NUSTAG self-driving car. The primary task is the implementation of 3D bounding box estimation and depth perception using a stereo camera feed to estimate the positions of cars, bikes, and pedestrians. Moreover, road signs and traffic lights are detected using 2D object detection and classification. The major challenge to implement all these deep learning algorithms in parallel in the NVIDIA Jetson Xavier development kit is achieved by optimizing the models to perform inference in real-time. This is accomplished using the TensorRT framework employing ROS interface. The models have been trained for our requirements to yield efficient results within our operational design domain.","PeriodicalId":161809,"journal":{"name":"2023 3rd International Conference on Artificial Intelligence (ICAI)","volume":"209 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Real-time object detection and 3D scene perception in self-driving cars\",\"authors\":\"Abdul Basit, Muhammad Usama Ejaz, Qirat Ayaz, F. Malik\",\"doi\":\"10.1109/ICAI58407.2023.10136623\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Reliable autonomous urban driving hinges upon the vehicle's ability to perceive and navigate the environment. This research paper emphasizes designing and implementing a vision-based perception system for NUSTAG self-driving car. The primary task is the implementation of 3D bounding box estimation and depth perception using a stereo camera feed to estimate the positions of cars, bikes, and pedestrians. Moreover, road signs and traffic lights are detected using 2D object detection and classification. The major challenge to implement all these deep learning algorithms in parallel in the NVIDIA Jetson Xavier development kit is achieved by optimizing the models to perform inference in real-time. This is accomplished using the TensorRT framework employing ROS interface. The models have been trained for our requirements to yield efficient results within our operational design domain.\",\"PeriodicalId\":161809,\"journal\":{\"name\":\"2023 3rd International Conference on Artificial Intelligence (ICAI)\",\"volume\":\"209 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 3rd International Conference on Artificial Intelligence (ICAI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAI58407.2023.10136623\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 3rd International Conference on Artificial Intelligence (ICAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAI58407.2023.10136623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-time object detection and 3D scene perception in self-driving cars
Reliable autonomous urban driving hinges upon the vehicle's ability to perceive and navigate the environment. This research paper emphasizes designing and implementing a vision-based perception system for NUSTAG self-driving car. The primary task is the implementation of 3D bounding box estimation and depth perception using a stereo camera feed to estimate the positions of cars, bikes, and pedestrians. Moreover, road signs and traffic lights are detected using 2D object detection and classification. The major challenge to implement all these deep learning algorithms in parallel in the NVIDIA Jetson Xavier development kit is achieved by optimizing the models to perform inference in real-time. This is accomplished using the TensorRT framework employing ROS interface. The models have been trained for our requirements to yield efficient results within our operational design domain.