基于h -∞滤波和强化学习算法的城市环境智能导航

Ivan Smolyakov, R. Langley
{"title":"基于h -∞滤波和强化学习算法的城市环境智能导航","authors":"Ivan Smolyakov, R. Langley","doi":"10.1109/PLANS46316.2020.9109948","DOIUrl":null,"url":null,"abstract":"In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.","PeriodicalId":273568,"journal":{"name":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms\",\"authors\":\"Ivan Smolyakov, R. Langley\",\"doi\":\"10.1109/PLANS46316.2020.9109948\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.\",\"PeriodicalId\":273568,\"journal\":{\"name\":\"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PLANS46316.2020.9109948\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE/ION Position, Location and Navigation Symposium (PLANS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PLANS46316.2020.9109948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在城市地区,定位解决方案的鲁棒性受到相对不可预测的衰减、非视距和多路径污染信号接收的影响。为了反映GNSS信号的传播环境,需要动态调整状态估计滤波器的参数。这里考虑了混合H2/ H∞滤波器来解决最小误差方差估计器对测量异常值的脆弱性。H2filter和H∞filter之间的重点(最小化最坏情况误差)通过强化学习(RL)模型不断调整。具体来说,实现了一个具有资格跟踪的连续动作参与者-评论家RL模型。考虑了允许RL奖励计算的滤波器性能评估的cram - rao下界。该算法已在应用紧密耦合IMU/GPS传感器集成的大众市场硬件收集的真实数据集上进行了测试。在阻力最大的两个轨迹段,RL模型的学习趋势为正,表明了该技术的应用潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Intelligent Navigation in Urban Environments Based on an H-infinity Filter and Reinforcement Learning Algorithms
In urban areas, robustness of a positioning solution suffers from relatively unpredictable reception of attenuated, non-line-of-sight and multipath-contaminated signals. To reflect a GNSS signal propagation environment, parameters of a state estimation filter need to be adjusted on-the-fly. A mixed H2/ H∞ filter has been considered here to address the vulnerability of a minimum error variance estimator to measurement outliers. An emphasis between the H2filter and the H∞ filter (minimizing the worst-case error) is continuously adjusted by a reinforcement learning (RL) model. Specifically, a continuous action actor-critic RL model with eligibility traces is implemented. The Cramér-Rao lower bound is considered for the filter performance evaluation allowing for the RL reward computation. The algorithm has been tested on a real-world dataset collected with mass-market hardware applying tightly-coupled IMU/GPS sensor integration. A positive RL model learning trend has been identified in two segments of the trajectory with the highest obstruction environment, suggesting the applicability potential of the technique.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信