Dongnian Jiang , Haowen Yang , Huichao Cao , Dezhi Xu
{"title":"A missing data imputation method for industrial soft sensor modeling","authors":"Dongnian Jiang , Haowen Yang , Huichao Cao , Dezhi Xu","doi":"10.1016/j.jprocont.2025.103485","DOIUrl":null,"url":null,"abstract":"<div><div>Data on complex industrial processes are often missing, due to sensor or equipment malfunctions; this poses challenges for the prediction of important quality variables and soft sensor applications, and may have a significant impact on production processes and equipment maintenance. Traditional missing data imputation methods face challenges in terms of acquiring data distributions, structures, etc., and are detached from the downstream soft sensor tasks, as they do not consider the close connections and synergistic relationships between missing data imputation and soft sensors. This affects the filling results for important quality variables, and thus reduces the prediction accuracy of the downstream soft sensors. To address these issues, a missing data imputation method for industrial soft sensor modeling, called PFIDM, is proposed that can realize a customized data imputation process with a progressive feedback strategy. The loss function of the improved diffusion model (IDDPM) is rationally designed to introduce KL dispersion between the noise addition process and the data distribution corresponding to the generation process into the next step of noise prediction, which involves predicting and correcting the noise of the current data state. In addition, a dynamic step decay factor related to the noise intensity is defined in the sampling process, and the sampling step span is adaptively adjusted to reduce the number of sampling steps and to accelerate the sampling time. The superiority of the proposed method is verified by comparing several typical methods and instantiating a dataset.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"152 ","pages":"Article 103485"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152425001131","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Data on complex industrial processes are often missing, due to sensor or equipment malfunctions; this poses challenges for the prediction of important quality variables and soft sensor applications, and may have a significant impact on production processes and equipment maintenance. Traditional missing data imputation methods face challenges in terms of acquiring data distributions, structures, etc., and are detached from the downstream soft sensor tasks, as they do not consider the close connections and synergistic relationships between missing data imputation and soft sensors. This affects the filling results for important quality variables, and thus reduces the prediction accuracy of the downstream soft sensors. To address these issues, a missing data imputation method for industrial soft sensor modeling, called PFIDM, is proposed that can realize a customized data imputation process with a progressive feedback strategy. The loss function of the improved diffusion model (IDDPM) is rationally designed to introduce KL dispersion between the noise addition process and the data distribution corresponding to the generation process into the next step of noise prediction, which involves predicting and correcting the noise of the current data state. In addition, a dynamic step decay factor related to the noise intensity is defined in the sampling process, and the sampling step span is adaptively adjusted to reduce the number of sampling steps and to accelerate the sampling time. The superiority of the proposed method is verified by comparing several typical methods and instantiating a dataset.
期刊介绍:
This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others.
Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques.
Topics covered include:
• Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods
Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.