Shanli Chen;Yunfei Zheng;Dongyuan Lin;Peng Cai;Yingying Xiao;Shiyuan Wang
{"title":"MAML-KalmanNet: A Neural Network-Assisted Kalman Filter Based on Model-Agnostic Meta-Learning","authors":"Shanli Chen;Yunfei Zheng;Dongyuan Lin;Peng Cai;Yingying Xiao;Shiyuan Wang","doi":"10.1109/TSP.2025.3540018","DOIUrl":null,"url":null,"abstract":"Neural network-assisted (NNA) Kalman filters provide an effective solution to addressing the filtering issues involving partially unknown system information by incorporating neural networks to compute the intermediate values influenced by unknown data, such as the Kalman gain in the filtering process. However, whenever there are slight changes in the state-space model (SSM), previously trained networks used in NNA Kalman filters become outdated, necessitating extensive time and data for retraining. Furthermore, obtaining sufficient labeled data for supervised learning is costly, and the effectiveness of unsupervised learning can be inconsistent. To this end, to address the inflexibility of neural network architecture and the scarcity of training data, we propose a model-agnostic meta-learning based neural network-assisted Kalman filter in this paper, called MAML-KalmanNet, by employing a limited amount of labeled data and training rounds to achieve desirable outcomes comparable to the supervised NNA Kalman filters with sufficient training. MAML-KalmanNet utilizes a pre-training approach based on specifically tailored meta-learning, enabling the network to adapt to model changes with minimal data and time without the requirement of retraining. Simultaneously, by fully leveraging the information from the SSM, MAML-KalmanNet eliminates the requirement of a large amount of labeled data to train the meta-learning initialization network. Simulations show that MAML-KalmanNet can mitigate the shortcomings existing in NNA Kalman filters regarding the requirements of abundant training data and sensitive network architecture, while providing real-time state estimation across a range of noise distributions.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"73 ","pages":"988-1003"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10883047/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Neural network-assisted (NNA) Kalman filters provide an effective solution to addressing the filtering issues involving partially unknown system information by incorporating neural networks to compute the intermediate values influenced by unknown data, such as the Kalman gain in the filtering process. However, whenever there are slight changes in the state-space model (SSM), previously trained networks used in NNA Kalman filters become outdated, necessitating extensive time and data for retraining. Furthermore, obtaining sufficient labeled data for supervised learning is costly, and the effectiveness of unsupervised learning can be inconsistent. To this end, to address the inflexibility of neural network architecture and the scarcity of training data, we propose a model-agnostic meta-learning based neural network-assisted Kalman filter in this paper, called MAML-KalmanNet, by employing a limited amount of labeled data and training rounds to achieve desirable outcomes comparable to the supervised NNA Kalman filters with sufficient training. MAML-KalmanNet utilizes a pre-training approach based on specifically tailored meta-learning, enabling the network to adapt to model changes with minimal data and time without the requirement of retraining. Simultaneously, by fully leveraging the information from the SSM, MAML-KalmanNet eliminates the requirement of a large amount of labeled data to train the meta-learning initialization network. Simulations show that MAML-KalmanNet can mitigate the shortcomings existing in NNA Kalman filters regarding the requirements of abundant training data and sensitive network architecture, while providing real-time state estimation across a range of noise distributions.
期刊介绍:
The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.