Toward Improving the Distributional Robustness of Risk-Aware Controllers in Learning-Enabled Environments

2021 60th IEEE Conference on Decision and Control (CDC) Pub Date : 2021-12-14 DOI:10.1109/CDC45484.2021.9682981

A. Hakobyan, Insoon Yang

引用次数: 3

Abstract

This paper is concerned with designing a risk-aware controller in an unknown and dynamic environment. In our method, the evolution of the environment state is learned using observational data via Gaussian process regression (GPR). Unfortunately, these learning results provide imperfect distribution information about the environment. To address such distribution errors, we propose a risk-constrained model predictive control (MPC) method that exploits techniques from modern distributionally robust optimization (DRO). To resolve the infinite dimensionality issue inherent in DRO, we derive a tractable semidefinite programming (SDP) problem that upper-bounds the original MPC problem. Furthermore, the SDP problem is reduced to a quadratic program when the constraint function has a decomposable form. The performance and the utility of our method are demonstrated through an autonomous driving problem, and the results show that our controller preserves safety despite errors in learning the behaviors of surrounding vehicles.

查看原文本刊更多论文

改进学习环境中风险感知控制器的分布鲁棒性

本文研究了未知动态环境下的风险感知控制器设计问题。在我们的方法中，通过高斯过程回归(GPR)利用观测数据学习环境状态的演变。不幸的是，这些学习结果提供了关于环境的不完美分布信息。为了解决这种分布误差，我们提出了一种利用现代分布鲁棒优化(DRO)技术的风险约束模型预测控制(MPC)方法。为了解决DRO固有的无限维问题，我们导出了一个可处理的半定规划问题(SDP)，它是原MPC问题的上界。进一步地，当约束函数具有可分解形式时，将SDP问题简化为二次规划问题。通过一个自动驾驶问题证明了我们的方法的性能和实用性，结果表明，尽管在学习周围车辆的行为时存在错误，但我们的控制器仍然保持了安全性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 60th IEEE Conference on Decision and Control (CDC)

自引率

0.00%

发文量