{"title":"Vision-Based Driving Decision Making Using Multi-Action Deep Q Network","authors":"Sheng Yuan;Yaochen Li;Kai Zhao;Li Zhu;Jiaxin Guo;Xinnan Ma;Yuncheng Xu","doi":"10.1109/TITS.2025.3552993","DOIUrl":null,"url":null,"abstract":"The performance improvement of perception algorithms and integrated validation of driving decision making remains challenging in the fields of computer vision and intelligent transportation systems. In this paper, we propose a novel vision-based framework for driving decision making, which is composed of three stages: object perception, lane line perception and driving decision making. For the object perception stage, an improved object perception model named CenterNet-ARA is developed, composing of a new adversarial training method, a receptive field enhancement module and an adaptive sample allocation equalization strategy to fuse multi-scale feature maps. For the lane line perception stage, a lane line perception method named Lite-MobileTR is proposed, which contains an improved Lite-MobileNetV3 encoder and an improved lite-transformer decoder. Moreover, a noise removal task is incorporated to alleviate the problem of slow convergence speed caused by Hungarian loss function. For the driving decision making stage, a new Multi-Action DQN is proposed utilizing a vehicle curriculum learning strategy and a curiosity exploration strategy to alleviate the problem of random exploration in the learning process. The proposed framework is evaluated on the Tusimple, CULane, TSD-max, and KITTI datasets. Finally, an integration verification is performed in Carla simulator to validate the driving decision making process. The experimental results well demonstrate the effectiveness of the proposed framework.","PeriodicalId":13416,"journal":{"name":"IEEE Transactions on Intelligent Transportation Systems","volume":"26 5","pages":"5816-5831"},"PeriodicalIF":7.9000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Intelligent Transportation Systems","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10947619/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
The performance improvement of perception algorithms and integrated validation of driving decision making remains challenging in the fields of computer vision and intelligent transportation systems. In this paper, we propose a novel vision-based framework for driving decision making, which is composed of three stages: object perception, lane line perception and driving decision making. For the object perception stage, an improved object perception model named CenterNet-ARA is developed, composing of a new adversarial training method, a receptive field enhancement module and an adaptive sample allocation equalization strategy to fuse multi-scale feature maps. For the lane line perception stage, a lane line perception method named Lite-MobileTR is proposed, which contains an improved Lite-MobileNetV3 encoder and an improved lite-transformer decoder. Moreover, a noise removal task is incorporated to alleviate the problem of slow convergence speed caused by Hungarian loss function. For the driving decision making stage, a new Multi-Action DQN is proposed utilizing a vehicle curriculum learning strategy and a curiosity exploration strategy to alleviate the problem of random exploration in the learning process. The proposed framework is evaluated on the Tusimple, CULane, TSD-max, and KITTI datasets. Finally, an integration verification is performed in Carla simulator to validate the driving decision making process. The experimental results well demonstrate the effectiveness of the proposed framework.
期刊介绍:
The theoretical, experimental and operational aspects of electrical and electronics engineering and information technologies as applied to Intelligent Transportation Systems (ITS). Intelligent Transportation Systems are defined as those systems utilizing synergistic technologies and systems engineering concepts to develop and improve transportation systems of all kinds. The scope of this interdisciplinary activity includes the promotion, consolidation and coordination of ITS technical activities among IEEE entities, and providing a focus for cooperative activities, both internally and externally.