RAFT: Regularized Adversarial Fine-Tuning to Enhance Deep Reinforcement Learning for Self-Parking

IF 2.2 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Letters Pub Date : 2025-08-20 DOI:10.1109/LSENS.2025.3600982

Alessandro Pighetti;Francesco Bellotti;Riccardo Berta;Andrea Cavallaro;Luca Lazzaroni;Changjae Oh

引用次数: 0

Abstract

Deep reinforcement learning (DRL) is a powerful method for local motion planning in automated driving. However, training of DRL agents is difficult and subject to instability. We propose regularized adversarial fine-tuning (RAFT), an adversarial DRL training framework, and test it in an automated parking (AP) scenario in the car learning to act (CARLA) simulator. Results show that RAFT enhances the performance of a state-of-the-art agent in its original operational design domain (ODD) (static parking, without adversary), by improving its robustness, as evidenced by an increase in all measured metrics. The success rate rises, the mean alignment error shrinks, and the gear reversal rate drops. Notably, we achieved this result not by designing an ad-hoc reward function, but simply by adding a general regularization term to the baseline adversary reward. The results open up new research perspectives for extending the ODD of DRL-based AP to dynamic scenes.

查看原文本刊更多论文

RAFT：正则化对抗性微调以增强自动停车的深度强化学习

深度强化学习（DRL）是一种有效的自动驾驶局部运动规划方法。然而，DRL代理的训练是困难的，并且受到不稳定性的影响。我们提出了正则化对抗性微调（RAFT），这是一种对抗性DRL训练框架，并在汽车学习行为（CARLA）模拟器的自动停车（AP）场景中对其进行了测试。结果表明，RAFT通过提高其鲁棒性，增强了最先进智能体在其原始操作设计域（ODD）（静态停车，无对手）中的性能，所有测量指标都有所增加。成功率上升，平均对准误差缩小，齿轮反转率下降。值得注意的是，我们不是通过设计一个特别的奖励函数来实现这个结果的，而是通过简单地向基线对手奖励添加一个通用的正则化项来实现的。研究结果为将基于drl的AP的ODD扩展到动态场景开辟了新的研究视角。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Sensors Letters Engineering-Electrical and Electronic Engineering

CiteScore

3.50

自引率

7.10%

发文量

194