Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms

2020 IEEE/ION Position, Location and Navigation Symposium (PLANS) Pub Date : 2020-04-01 DOI:10.1109/PLANS46316.2020.9109948

Ivan Smolyakov, R. Langley

{"title":"Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms","authors":"Ivan Smolyakov, R. Langley","doi":"10.1109/PLANS46316.2020.9109948","DOIUrl":null,"url":null,"abstract":"In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.","PeriodicalId":273568,"journal":{"name":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PLANS46316.2020.9109948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.

查看原文本刊更多论文

基于h -∞滤波和强化学习算法的城市环境智能导航

在城市地区，定位解决方案的鲁棒性受到相对不可预测的衰减、非视距和多路径污染信号接收的影响。为了反映GNSS信号的传播环境，需要动态调整状态估计滤波器的参数。这里考虑了混合H2/ H∞滤波器来解决最小误差方差估计器对测量异常值的脆弱性。H2filter和H∞filter之间的重点(最小化最坏情况误差)通过强化学习(RL)模型不断调整。具体来说，实现了一个具有资格跟踪的连续动作参与者-评论家RL模型。考虑了允许RL奖励计算的滤波器性能评估的cram - rao下界。该算法已在应用紧密耦合IMU/GPS传感器集成的大众市场硬件收集的真实数据集上进行了测试。在阻力最大的两个轨迹段，RL模型的学习趋势为正，表明了该技术的应用潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)

自引率

0.00%

发文量