Evaluative feedback as the basis for behavior optimization in the of autonomous vehicle steering

Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005. Pub Date : 2005-10-24 DOI:10.1109/ITSC.2005.1520128

K. Kuhnert, Michael Krödel

引用次数: 1

Abstract

Steering an autonomous vehicle requires the permanent adaptation of behavior in relationship to the various situations the vehicle is in. This paper describes a research which implements such adaptation and optimization based on reinforcement learning (RL) which in detail purely learns from evaluative feedback in contrast to instructive feedback. In this way it self-explores and self-optimises actions for situations in a defined environment. The target of this research is to determine to what extent RL-based systems serve as an enhancement or even an alternative to classical concepts of autonomous intelligent vehicles such as modelling or neural nets.

查看原文本刊更多论文

评价反馈作为自动驾驶车辆转向行为优化的基础

驾驶自动驾驶汽车需要根据车辆所处的各种情况不断调整行为。本文描述了一项基于强化学习(RL)实现这种适应和优化的研究，该学习完全从评估反馈中学习，而不是从指导性反馈中学习。通过这种方式，它可以在特定的环境中自我探索和自我优化行动。本研究的目标是确定基于强化学习的系统在多大程度上可以作为自动智能车辆的经典概念(如建模或神经网络)的增强甚至替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005.

自引率

0.00%

发文量