基于Actor-Critic深度强化学习的核电厂综合故障诊断方法

IF 2.3 3区 工程技术 Q1 NUCLEAR SCIENCE & TECHNOLOGY
Tong Li , Sichao Tan , Jiangkuan Li , Ruifeng Tian , Jihong Shen , Jiaoshen Xu
{"title":"基于Actor-Critic深度强化学习的核电厂综合故障诊断方法","authors":"Tong Li ,&nbsp;Sichao Tan ,&nbsp;Jiangkuan Li ,&nbsp;Ruifeng Tian ,&nbsp;Jihong Shen ,&nbsp;Jiaoshen Xu","doi":"10.1016/j.anucene.2025.111907","DOIUrl":null,"url":null,"abstract":"<div><div>Although the international community has carried out a lot of research work on fault diagnosis, the nonlinear and high coupling characteristics of nuclear reactors and complex transient conditions put forward higher requirements for fault diagnosis algorithms. In this paper, an integrated method based on reinforcement learning is proposed, in which the base models of the integrated model are pre-trained machine learning units, and Twin delayed deep deterministic policy gradient is used to learn dynamic weight update strategies to adjust the diagnosis probability of the base models based on One-vs-Rest method. An idea of enhanced value propagation is designed and applied to the reinforcement learning. Experimental results show that the fault diagnosis integrated framework based on reinforcement learning has higher accuracy and lower misdiagnosis rate, the enhanced value propagation is proved to be able to obtain higher fault diagnosis accuracy with faster convergence and stronger optimization ability.</div></div>","PeriodicalId":8006,"journal":{"name":"Annals of Nuclear Energy","volume":"227 ","pages":"Article 111907"},"PeriodicalIF":2.3000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Actor-Critic deep reinforcement learning-based integrated fault diagnosis method for nuclear power plants\",\"authors\":\"Tong Li ,&nbsp;Sichao Tan ,&nbsp;Jiangkuan Li ,&nbsp;Ruifeng Tian ,&nbsp;Jihong Shen ,&nbsp;Jiaoshen Xu\",\"doi\":\"10.1016/j.anucene.2025.111907\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Although the international community has carried out a lot of research work on fault diagnosis, the nonlinear and high coupling characteristics of nuclear reactors and complex transient conditions put forward higher requirements for fault diagnosis algorithms. In this paper, an integrated method based on reinforcement learning is proposed, in which the base models of the integrated model are pre-trained machine learning units, and Twin delayed deep deterministic policy gradient is used to learn dynamic weight update strategies to adjust the diagnosis probability of the base models based on One-vs-Rest method. An idea of enhanced value propagation is designed and applied to the reinforcement learning. Experimental results show that the fault diagnosis integrated framework based on reinforcement learning has higher accuracy and lower misdiagnosis rate, the enhanced value propagation is proved to be able to obtain higher fault diagnosis accuracy with faster convergence and stronger optimization ability.</div></div>\",\"PeriodicalId\":8006,\"journal\":{\"name\":\"Annals of Nuclear Energy\",\"volume\":\"227 \",\"pages\":\"Article 111907\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Nuclear Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306454925007248\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NUCLEAR SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Nuclear Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306454925007248","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NUCLEAR SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

尽管国际上在故障诊断方面开展了大量的研究工作,但核反应堆的非线性和高耦合特性以及复杂的暂态条件对故障诊断算法提出了更高的要求。本文提出了一种基于强化学习的集成方法,该方法将集成模型的基本模型作为预训练的机器学习单元,利用Twin延迟深度确定性策略梯度学习基于One-vs-Rest方法的动态权值更新策略来调整基本模型的诊断概率。提出了一种增强值传播的思想,并将其应用于强化学习中。实验结果表明,基于强化学习的故障诊断集成框架具有较高的准确率和较低的误诊率,增强的值传播能够以更快的收敛速度和更强的优化能力获得较高的故障诊断精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Actor-Critic deep reinforcement learning-based integrated fault diagnosis method for nuclear power plants

Actor-Critic deep reinforcement learning-based integrated fault diagnosis method for nuclear power plants
Although the international community has carried out a lot of research work on fault diagnosis, the nonlinear and high coupling characteristics of nuclear reactors and complex transient conditions put forward higher requirements for fault diagnosis algorithms. In this paper, an integrated method based on reinforcement learning is proposed, in which the base models of the integrated model are pre-trained machine learning units, and Twin delayed deep deterministic policy gradient is used to learn dynamic weight update strategies to adjust the diagnosis probability of the base models based on One-vs-Rest method. An idea of enhanced value propagation is designed and applied to the reinforcement learning. Experimental results show that the fault diagnosis integrated framework based on reinforcement learning has higher accuracy and lower misdiagnosis rate, the enhanced value propagation is proved to be able to obtain higher fault diagnosis accuracy with faster convergence and stronger optimization ability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Annals of Nuclear Energy
Annals of Nuclear Energy 工程技术-核科学技术
CiteScore
4.30
自引率
21.10%
发文量
632
审稿时长
7.3 months
期刊介绍: Annals of Nuclear Energy provides an international medium for the communication of original research, ideas and developments in all areas of the field of nuclear energy science and technology. Its scope embraces nuclear fuel reserves, fuel cycles and cost, materials, processing, system and component technology (fission only), design and optimization, direct conversion of nuclear energy sources, environmental control, reactor physics, heat transfer and fluid dynamics, structural analysis, fuel management, future developments, nuclear fuel and safety, nuclear aerosol, neutron physics, computer technology (both software and hardware), risk assessment, radioactive waste disposal and reactor thermal hydraulics. Papers submitted to Annals need to demonstrate a clear link to nuclear power generation/nuclear engineering. Papers which deal with pure nuclear physics, pure health physics, imaging, or attenuation and shielding properties of concretes and various geological materials are not within the scope of the journal. Also, papers that deal with policy or economics are not within the scope of the journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信