{"title":"A semi-supervised temporal modeling strategy integrating VAE and Wasserstein GAN under sparse sampling constraints","authors":"Yujie Hu , Changrui Xie , Xi Chen","doi":"10.1016/j.jprocont.2025.103497","DOIUrl":null,"url":null,"abstract":"<div><div>Time series network models are widely applied in process industries for soft sensing, fault monitoring, and real-time optimization, serving as a powerful tool to enhance the safety and efficiency of industrial production. Typically, time series networks require labeled data for supervised learning. However, labeled data often exhibits sparse sampling characteristics in industrial settings, which limits the model's performance. To address this issue, a semi-supervised modeling strategy based on Variational Autoencoder (VAE) and Wasserstein Generative Adversarial Network (WGAN) is proposed in this paper. The strategy consists of three steps. First, for the labeled samples, process data and labeled data are used as input to train a supervised VAE model (SVAE). Upon completion of the training, the posterior distribution of the latent variable <em><strong>z</strong></em><sub><em><strong>S</strong></em></sub> is obtained. Second, in all samples, only process data is used to train an unsupervised VAE model (UVAE) to extract the latent variable <em><strong>z</strong></em><sub><em><strong>U</strong></em></sub>, and the WGAN discriminator is introduced to distinguish between \"fake data\" (<em><strong>z</strong></em><sub><em><strong>U</strong></em></sub>) and \"real data\" (<em><strong>z</strong></em><sub><em><strong>S</strong></em></sub>). Through adversarial learning between the UVAE and WGAN discriminator, the posterior distribution of <em><strong>z</strong></em><sub><em><strong>U</strong></em></sub> is forced to approximate <em><strong>z</strong></em><sub><em><strong>S</strong></em></sub>. Finally, the encoder of UVAE and the decoder of SVAE are combined to form a Semi-Supervised Variational Autoencoder model (SS-VAE), which extracts the latent variable <em><strong>z</strong></em><sub><em><strong>SS</strong></em></sub> and the reconstructed labeled data from the decoder as inputs for the time series network. Long Short-Term Memory (LSTM) and Temporal Convolutional Network (TCN) are selected as two basic time series models, and their performance, both with and without the proposed semi-supervised approach, is compared to assess the effectiveness and robustness of the strategy. The improvements observed in two industrial case studies validate the efficiency of the proposed approach.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"152 ","pages":"Article 103497"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152425001258","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Time series network models are widely applied in process industries for soft sensing, fault monitoring, and real-time optimization, serving as a powerful tool to enhance the safety and efficiency of industrial production. Typically, time series networks require labeled data for supervised learning. However, labeled data often exhibits sparse sampling characteristics in industrial settings, which limits the model's performance. To address this issue, a semi-supervised modeling strategy based on Variational Autoencoder (VAE) and Wasserstein Generative Adversarial Network (WGAN) is proposed in this paper. The strategy consists of three steps. First, for the labeled samples, process data and labeled data are used as input to train a supervised VAE model (SVAE). Upon completion of the training, the posterior distribution of the latent variable zS is obtained. Second, in all samples, only process data is used to train an unsupervised VAE model (UVAE) to extract the latent variable zU, and the WGAN discriminator is introduced to distinguish between "fake data" (zU) and "real data" (zS). Through adversarial learning between the UVAE and WGAN discriminator, the posterior distribution of zU is forced to approximate zS. Finally, the encoder of UVAE and the decoder of SVAE are combined to form a Semi-Supervised Variational Autoencoder model (SS-VAE), which extracts the latent variable zSS and the reconstructed labeled data from the decoder as inputs for the time series network. Long Short-Term Memory (LSTM) and Temporal Convolutional Network (TCN) are selected as two basic time series models, and their performance, both with and without the proposed semi-supervised approach, is compared to assess the effectiveness and robustness of the strategy. The improvements observed in two industrial case studies validate the efficiency of the proposed approach.
期刊介绍:
This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others.
Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques.
Topics covered include:
• Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods
Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.