Remaining Useful Life Prediction of Power MOSFETs Based on Deep Reinforcement Learning

IF 4.3 2区综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Sensors Journal Pub Date : 2025-08-11 DOI:10.1109/JSEN.2025.3596121

Yuxuan Xie;Yujie Zhang;Qiang Miao

{"title":"Remaining Useful Life Prediction of Power MOSFETs Based on Deep Reinforcement Learning","authors":"Yuxuan Xie;Yujie Zhang;Qiang Miao","doi":"10.1109/JSEN.2025.3596121","DOIUrl":null,"url":null,"abstract":"The power metal–oxide–semiconductor field-effect transistors (MOSFETs) are the critical components extensively utilized in various fields. To ensure the safety and reliability of MOSFET-based systems, the remaining useful life (RUL) prediction techniques can be applied to calculate the RUL during the early to middle stages of equipment degradation, thereby avoiding more severe failures. Over the past few years, the RUL prediction technology utilizing reinforcement learning (RL) has steadily matured, with the deep deterministic policy gradient (DDPG) algorithm demonstrating excellent predictive performance in time-series data prediction. However, the complicated degradation mechanisms of power MOSFETs and the limited historical degradation data make it challenging for the hyperparameters of DDPG model to effectively converge to the global optimum. Consequently, the DDPG-based prediction method suffers from poor generalization and adaptability, leading to larger prediction errors. To address the issue, we propose a novel prediction method, named GDDPG, which integrates Gaussian process regression (GPR) with DDPG. First, monitoring data of MOSFETs are used to construct state matrix and optimal action space. Second, during the iterative prediction phase, the GPR algorithm is introduced to correct each single-step prediction result from the DDPG model. Finally, the corrected prediction value is incorporated into the next iteration’s state variables for subsequent prediction. Validation on the publicly available NASA prognostics center of excellence (PCoE) dataset “MOSFET Thermal Overstress Aging” demonstrates that the proposed method consistently surpasses all comparison methods across multiple evaluation metrics. Thus, the GDDPG-based method significantly enhances the precision and dependability of RUL prediction for MOSFETs.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"25 18","pages":"35090-35100"},"PeriodicalIF":4.3000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/11122405/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The power metal–oxide–semiconductor field-effect transistors (MOSFETs) are the critical components extensively utilized in various fields. To ensure the safety and reliability of MOSFET-based systems, the remaining useful life (RUL) prediction techniques can be applied to calculate the RUL during the early to middle stages of equipment degradation, thereby avoiding more severe failures. Over the past few years, the RUL prediction technology utilizing reinforcement learning (RL) has steadily matured, with the deep deterministic policy gradient (DDPG) algorithm demonstrating excellent predictive performance in time-series data prediction. However, the complicated degradation mechanisms of power MOSFETs and the limited historical degradation data make it challenging for the hyperparameters of DDPG model to effectively converge to the global optimum. Consequently, the DDPG-based prediction method suffers from poor generalization and adaptability, leading to larger prediction errors. To address the issue, we propose a novel prediction method, named GDDPG, which integrates Gaussian process regression (GPR) with DDPG. First, monitoring data of MOSFETs are used to construct state matrix and optimal action space. Second, during the iterative prediction phase, the GPR algorithm is introduced to correct each single-step prediction result from the DDPG model. Finally, the corrected prediction value is incorporated into the next iteration’s state variables for subsequent prediction. Validation on the publicly available NASA prognostics center of excellence (PCoE) dataset “MOSFET Thermal Overstress Aging” demonstrates that the proposed method consistently surpasses all comparison methods across multiple evaluation metrics. Thus, the GDDPG-based method significantly enhances the precision and dependability of RUL prediction for MOSFETs.

查看原文本刊更多论文

基于深度强化学习的功率mosfet剩余使用寿命预测

功率金属氧化物半导体场效应晶体管（mosfet）是广泛应用于各个领域的关键器件。为了确保基于mosfet的系统的安全性和可靠性，可以应用剩余使用寿命（RUL）预测技术来计算设备退化早期到中期的RUL，从而避免更严重的故障。近年来，利用强化学习（RL）的RUL预测技术不断成熟，深度确定性策略梯度（DDPG）算法在时间序列数据预测中表现出优异的预测性能。然而，功率mosfet复杂的退化机制和有限的历史退化数据给DDPG模型的超参数有效收敛到全局最优带来了挑战。因此，基于ddpg的预测方法泛化和自适应性差，导致预测误差较大。为了解决这个问题，我们提出了一种新的预测方法，称为GDDPG，它将高斯过程回归（GPR）与DDPG相结合。首先，利用mosfet的监测数据构造状态矩阵和最优动作空间；其次，在迭代预测阶段，引入GPR算法对DDPG模型的各单步预测结果进行校正；最后，将修正后的预测值合并到下一次迭代的状态变量中，用于后续的预测。在公开可用的NASA预测卓越中心（PCoE）数据集“MOSFET热超应力老化”上的验证表明，所提出的方法在多个评估指标上始终优于所有比较方法。因此，基于gddpg的方法显著提高了mosfet RUL预测的精度和可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Sensors Journal 工程技术-工程：电子与电气

CiteScore

7.70

自引率

14.00%

发文量

2058

审稿时长

5.2 months

期刊介绍： The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice