Learning-based real-time model predictive tracking control for autonomous vehicles with path-pattern adaptability

IF 5.4 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Wentong Shi , Shiying Dong , Chao Zhang , Dele Meng , Jinlong Hong , Hongqing Chu , Bingzhao Gao
{"title":"Learning-based real-time model predictive tracking control for autonomous vehicles with path-pattern adaptability","authors":"Wentong Shi ,&nbsp;Shiying Dong ,&nbsp;Chao Zhang ,&nbsp;Dele Meng ,&nbsp;Jinlong Hong ,&nbsp;Hongqing Chu ,&nbsp;Bingzhao Gao","doi":"10.1016/j.conengprac.2025.106480","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time implementation of nonlinear model predictive control (NMPC) for path tracking in autonomous vehicles (AVs) remains challenging due to the computational complexity of solving high-dimensional, nonlinear optimization problems under dynamic constraints. To address this issue, this paper proposes a novel end-to-end training framework, called the neural network optimizer (NN Optimizer), which significantly reduces the computational burden of NMPC, enabling real-time implementation. Specifically, the NN Optimizer integrates low-fidelity, interpretable physical models with neural networks (NNs), utilizing automatic differentiation to backpropagate the NMPC loss function through a differentiable dynamics model to obtain policy gradients. As opposed to traditional imitation learning (IL)-based approaches, usually reliant on expert knowledge, the proposed method does not rely on extensive labeled data, offering greater interpretability and path-pattern adaptability. During online implementation, it just involves simple function evaluation, avoiding the computation of gradient information and multiple iterations. In two validation scenarios during co-simulation, the NN Optimizer achieves a solving speed over 70 times faster than interior point optimizer (<span>IPOPT</span>), while improving closed-loop control performance by more than 10% compared to IL. In real-vehicle tests, the NN optimizer outperforms the linear quadratic regulator (LQR), achieving improvements in control performance of over 30% in the weave scenario and over 60% in the double lane change (DLC) scenario.</div></div>","PeriodicalId":50615,"journal":{"name":"Control Engineering Practice","volume":"164 ","pages":"Article 106480"},"PeriodicalIF":5.4000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Control Engineering Practice","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0967066125002424","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Real-time implementation of nonlinear model predictive control (NMPC) for path tracking in autonomous vehicles (AVs) remains challenging due to the computational complexity of solving high-dimensional, nonlinear optimization problems under dynamic constraints. To address this issue, this paper proposes a novel end-to-end training framework, called the neural network optimizer (NN Optimizer), which significantly reduces the computational burden of NMPC, enabling real-time implementation. Specifically, the NN Optimizer integrates low-fidelity, interpretable physical models with neural networks (NNs), utilizing automatic differentiation to backpropagate the NMPC loss function through a differentiable dynamics model to obtain policy gradients. As opposed to traditional imitation learning (IL)-based approaches, usually reliant on expert knowledge, the proposed method does not rely on extensive labeled data, offering greater interpretability and path-pattern adaptability. During online implementation, it just involves simple function evaluation, avoiding the computation of gradient information and multiple iterations. In two validation scenarios during co-simulation, the NN Optimizer achieves a solving speed over 70 times faster than interior point optimizer (IPOPT), while improving closed-loop control performance by more than 10% compared to IL. In real-vehicle tests, the NN optimizer outperforms the linear quadratic regulator (LQR), achieving improvements in control performance of over 30% in the weave scenario and over 60% in the double lane change (DLC) scenario.
基于学习的路径模式自适应自动驾驶汽车实时模型预测跟踪控制
由于在动态约束下求解高维非线性优化问题的计算复杂性,自动驾驶汽车(AVs)路径跟踪的非线性模型预测控制(NMPC)的实时实现仍然具有挑战性。为了解决这个问题,本文提出了一种新的端到端训练框架,称为神经网络优化器(NN optimizer),它显著降低了NMPC的计算负担,实现了实时实现。具体来说,NN优化器将低保真、可解释的物理模型与神经网络(NN)集成在一起,利用自动微分通过可微动力学模型反向传播NMPC损失函数,以获得策略梯度。与传统的基于模仿学习(IL)的方法(通常依赖于专家知识)相反,该方法不依赖于广泛的标记数据,提供了更好的可解释性和路径模式适应性。在线实现时,只涉及简单的函数求值,避免了梯度信息的计算和多次迭代。在联合仿真的两个验证场景中,NN优化器的求解速度比内点优化器(IPOPT)快70倍以上,同时闭环控制性能比IL提高10%以上。在实车测试中,NN优化器优于线性二次型调节器(LQR),在编织场景中实现了超过30%的控制性能提高,在双车道变(DLC)场景中实现了超过60%的控制性能提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Control Engineering Practice
Control Engineering Practice 工程技术-工程:电子与电气
CiteScore
9.20
自引率
12.20%
发文量
183
审稿时长
44 days
期刊介绍: Control Engineering Practice strives to meet the needs of industrial practitioners and industrially related academics and researchers. It publishes papers which illustrate the direct application of control theory and its supporting tools in all possible areas of automation. As a result, the journal only contains papers which can be considered to have made significant contributions to the application of advanced control techniques. It is normally expected that practical results should be included, but where simulation only studies are available, it is necessary to demonstrate that the simulation model is representative of a genuine application. Strictly theoretical papers will find a more appropriate home in Control Engineering Practice''s sister publication, Automatica. It is also expected that papers are innovative with respect to the state of the art and are sufficiently detailed for a reader to be able to duplicate the main results of the paper (supplementary material, including datasets, tables, code and any relevant interactive material can be made available and downloaded from the website). The benefits of the presented methods must be made very clear and the new techniques must be compared and contrasted with results obtained using existing methods. Moreover, a thorough analysis of failures that may happen in the design process and implementation can also be part of the paper. The scope of Control Engineering Practice matches the activities of IFAC. Papers demonstrating the contribution of automation and control in improving the performance, quality, productivity, sustainability, resource and energy efficiency, and the manageability of systems and processes for the benefit of mankind and are relevant to industrial practitioners are most welcome.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信