{"title":"非线性不确定时滞机械臂系统的神经网络强化迭代学习故障估计方法","authors":"Zhengquan Chen;Zhiheng Zhang;Jiayuan Yan;Maiying Zhong;Lingling Lv;Yandong Hou","doi":"10.1109/TII.2025.3575123","DOIUrl":null,"url":null,"abstract":"This article investigates the problem of fault estimation in nonlinear uncertain manipulator systems with time-delay. A novel fault estimation scheme is proposed, which optimizes the iterative learning (IL) estimator performance using a neural network (NN)-based reinforcement learning (RL) approach. Specifically, first, with the purpose of enhancing robustness and adaptability of RL, a new adaptive exponential reward function is designed. Then, to improve the performance of fault estimation, speed and accuracy are designed as optimization objectives. Simultaneously, by leveraging the IL estimator, the NN is continuously optimized in each iteration process to mitigate the issues of gradient vanishing and explosion. Further, by incorporating the H<inline-formula><tex-math>$\\mathrm{\\infty }$</tex-math></inline-formula> performance index into the observer, an asymptotically convergent estimated error can be attained. Finally, numerical simulations are conducted to demonstrate the effectiveness of our method.","PeriodicalId":13301,"journal":{"name":"IEEE Transactions on Industrial Informatics","volume":"21 9","pages":"7287-7298"},"PeriodicalIF":9.9000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Neural Network–Based Reinforcement Iterative Learning Fault Estimation Scheme for Nonlinear Uncertain Manipulator Systems With Time-Delay\",\"authors\":\"Zhengquan Chen;Zhiheng Zhang;Jiayuan Yan;Maiying Zhong;Lingling Lv;Yandong Hou\",\"doi\":\"10.1109/TII.2025.3575123\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article investigates the problem of fault estimation in nonlinear uncertain manipulator systems with time-delay. A novel fault estimation scheme is proposed, which optimizes the iterative learning (IL) estimator performance using a neural network (NN)-based reinforcement learning (RL) approach. Specifically, first, with the purpose of enhancing robustness and adaptability of RL, a new adaptive exponential reward function is designed. Then, to improve the performance of fault estimation, speed and accuracy are designed as optimization objectives. Simultaneously, by leveraging the IL estimator, the NN is continuously optimized in each iteration process to mitigate the issues of gradient vanishing and explosion. Further, by incorporating the H<inline-formula><tex-math>$\\\\mathrm{\\\\infty }$</tex-math></inline-formula> performance index into the observer, an asymptotically convergent estimated error can be attained. Finally, numerical simulations are conducted to demonstrate the effectiveness of our method.\",\"PeriodicalId\":13301,\"journal\":{\"name\":\"IEEE Transactions on Industrial Informatics\",\"volume\":\"21 9\",\"pages\":\"7287-7298\"},\"PeriodicalIF\":9.9000,\"publicationDate\":\"2025-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Industrial Informatics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11033185/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Industrial Informatics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11033185/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Neural Network–Based Reinforcement Iterative Learning Fault Estimation Scheme for Nonlinear Uncertain Manipulator Systems With Time-Delay
This article investigates the problem of fault estimation in nonlinear uncertain manipulator systems with time-delay. A novel fault estimation scheme is proposed, which optimizes the iterative learning (IL) estimator performance using a neural network (NN)-based reinforcement learning (RL) approach. Specifically, first, with the purpose of enhancing robustness and adaptability of RL, a new adaptive exponential reward function is designed. Then, to improve the performance of fault estimation, speed and accuracy are designed as optimization objectives. Simultaneously, by leveraging the IL estimator, the NN is continuously optimized in each iteration process to mitigate the issues of gradient vanishing and explosion. Further, by incorporating the H$\mathrm{\infty }$ performance index into the observer, an asymptotically convergent estimated error can be attained. Finally, numerical simulations are conducted to demonstrate the effectiveness of our method.
期刊介绍:
The IEEE Transactions on Industrial Informatics is a multidisciplinary journal dedicated to publishing technical papers that connect theory with practical applications of informatics in industrial settings. It focuses on the utilization of information in intelligent, distributed, and agile industrial automation and control systems. The scope includes topics such as knowledge-based and AI-enhanced automation, intelligent computer control systems, flexible and collaborative manufacturing, industrial informatics in software-defined vehicles and robotics, computer vision, industrial cyber-physical and industrial IoT systems, real-time and networked embedded systems, security in industrial processes, industrial communications, systems interoperability, and human-machine interaction.