A semi-supervised temporal modeling strategy integrating VAE and Wasserstein GAN under sparse sampling constraints

IF 3.3 2区计算机科学 Q2 AUTOMATION & CONTROL SYSTEMS

Journal of Process Control Pub Date : 2025-07-05 DOI:10.1016/j.jprocont.2025.103497

Yujie Hu , Changrui Xie , Xi Chen

{"title":"A semi-supervised temporal modeling strategy integrating VAE and Wasserstein GAN under sparse sampling constraints","authors":"Yujie Hu , Changrui Xie , Xi Chen","doi":"10.1016/j.jprocont.2025.103497","DOIUrl":null,"url":null,"abstract":"<div><div>Time series network models are widely applied in process industries for soft sensing, fault monitoring, and real-time optimization, serving as a powerful tool to enhance the safety and efficiency of industrial production. Typically, time series networks require labeled data for supervised learning. However, labeled data often exhibits sparse sampling characteristics in industrial settings, which limits the model's performance. To address this issue, a semi-supervised modeling strategy based on Variational Autoencoder (VAE) and Wasserstein Generative Adversarial Network (WGAN) is proposed in this paper. The strategy consists of three steps. First, for the labeled samples, process data and labeled data are used as input to train a supervised VAE model (SVAE). Upon completion of the training, the posterior distribution of the latent variable zS is obtained. Second, in all samples, only process data is used to train an unsupervised VAE model (UVAE) to extract the latent variable zU, and the WGAN discriminator is introduced to distinguish between \"fake data\" (zU) and \"real data\" (zS). Through adversarial learning between the UVAE and WGAN discriminator, the posterior distribution of zU is forced to approximate zS. Finally, the encoder of UVAE and the decoder of SVAE are combined to form a Semi-Supervised Variational Autoencoder model (SS-VAE), which extracts the latent variable zSS and the reconstructed labeled data from the decoder as inputs for the time series network. Long Short-Term Memory (LSTM) and Temporal Convolutional Network (TCN) are selected as two basic time series models, and their performance, both with and without the proposed semi-supervised approach, is compared to assess the effectiveness and robustness of the strategy. The improvements observed in two industrial case studies validate the efficiency of the proposed approach.</div></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"152 ","pages":"Article 103497"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152425001258","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Time series network models are widely applied in process industries for soft sensing, fault monitoring, and real-time optimization, serving as a powerful tool to enhance the safety and efficiency of industrial production. Typically, time series networks require labeled data for supervised learning. However, labeled data often exhibits sparse sampling characteristics in industrial settings, which limits the model's performance. To address this issue, a semi-supervised modeling strategy based on Variational Autoencoder (VAE) and Wasserstein Generative Adversarial Network (WGAN) is proposed in this paper. The strategy consists of three steps. First, for the labeled samples, process data and labeled data are used as input to train a supervised VAE model (SVAE). Upon completion of the training, the posterior distribution of the latent variable z_S is obtained. Second, in all samples, only process data is used to train an unsupervised VAE model (UVAE) to extract the latent variable z_U, and the WGAN discriminator is introduced to distinguish between "fake data" (z_U) and "real data" (z_S). Through adversarial learning between the UVAE and WGAN discriminator, the posterior distribution of z_U is forced to approximate z_S. Finally, the encoder of UVAE and the decoder of SVAE are combined to form a Semi-Supervised Variational Autoencoder model (SS-VAE), which extracts the latent variable z_SS and the reconstructed labeled data from the decoder as inputs for the time series network. Long Short-Term Memory (LSTM) and Temporal Convolutional Network (TCN) are selected as two basic time series models, and their performance, both with and without the proposed semi-supervised approach, is compared to assess the effectiveness and robustness of the strategy. The improvements observed in two industrial case studies validate the efficiency of the proposed approach.

查看原文本刊更多论文

稀疏采样约束下结合VAE和Wasserstein GAN的半监督时态建模策略

时间序列网络模型广泛应用于过程工业的软测量、故障监测和实时优化，是提高工业生产安全性和效率的有力工具。通常，时间序列网络需要标记数据进行监督学习。然而，在工业环境中，标记数据通常表现出稀疏的采样特征，这限制了模型的性能。为了解决这一问题，本文提出了一种基于变分自编码器（VAE）和Wasserstein生成对抗网络（WGAN）的半监督建模策略。该战略包括三个步骤。首先，对于标记的样本，使用过程数据和标记数据作为输入来训练有监督的VAE模型（SVAE）。训练完成后，得到潜伏变量zS的后验分布。其次，在所有样本中，只使用过程数据训练无监督VAE模型（UVAE）来提取潜在变量zU，并引入WGAN鉴别器来区分“假数据”（zU）和“真实数据”（zS）。通过UVAE和WGAN鉴别器之间的对抗性学习，使zU的后验分布近似于zS。最后，将UVAE的编码器和SVAE的解码器相结合，形成一个半监督变分自编码器模型（SS-VAE），该模型从解码器中提取潜变量zSS和重构的标记数据作为时间序列网络的输入。选择长短期记忆（LSTM）和颞卷积网络（TCN）作为两个基本的时间序列模型，并比较了它们在使用和不使用半监督方法时的性能，以评估该策略的有效性和鲁棒性。在两个工业案例研究中观察到的改进验证了所提出方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Process Control 工程技术-工程：化工

CiteScore

7.00

自引率

11.90%

发文量

159

审稿时长

74 days

期刊介绍： This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others. Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques. Topics covered include: • Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.