基于h -∞滤波和强化学习算法的城市环境智能导航

2020 IEEE/ION Position, Location and Navigation Symposium (PLANS) Pub Date : 2020-04-01 DOI:10.1109/PLANS46316.2020.9109948

Ivan Smolyakov, R. Langley

{"title":"基于h -∞滤波和强化学习算法的城市环境智能导航","authors":"Ivan Smolyakov, R. Langley","doi":"10.1109/PLANS46316.2020.9109948","DOIUrl":null,"url":null,"abstract":"In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.","PeriodicalId":273568,"journal":{"name":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms\",\"authors\":\"Ivan Smolyakov, R. Langley\",\"doi\":\"10.1109/PLANS46316.2020.9109948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.\",\"PeriodicalId\":273568,\"journal\":{\"name\":\"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PLANS46316.2020.9109948\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PLANS46316.2020.9109948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在城市地区，定位解决方案的鲁棒性受到相对不可预测的衰减、非视距和多路径污染信号接收的影响。为了反映GNSS信号的传播环境，需要动态调整状态估计滤波器的参数。这里考虑了混合H2/ H∞滤波器来解决最小误差方差估计器对测量异常值的脆弱性。H2filter和H∞filter之间的重点(最小化最坏情况误差)通过强化学习(RL)模型不断调整。具体来说，实现了一个具有资格跟踪的连续动作参与者-评论家RL模型。考虑了允许RL奖励计算的滤波器性能评估的cram - rao下界。该算法已在应用紧密耦合IMU/GPS传感器集成的大众市场硬件收集的真实数据集上进行了测试。在阻力最大的两个轨迹段，RL模型的学习趋势为正，表明了该技术的应用潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms

In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)

自引率

0.00%

发文量