{"title":"Data augmentation for forecasting industrial aging processes via conditional multimodal generative time-series models","authors":"Mihail Bogojeski , Nataliya Yakut , Sasho Nedelkoski , Shinichi Nakajima , Klaus-Robert Müller","doi":"10.1016/j.compchemeng.2025.109109","DOIUrl":null,"url":null,"abstract":"<div><div>Data augmentation has shown to be effective for improving generalization performance of deep neural networks, especially in the regime of high noise and scarce data. However, this approach has not been applied to industrial aging processes (IAP) forecasting, where observed data are multimodal time-series, and therefore existing augmentation methods are not suitable for data generation. In this paper, we propose Seq-MVAE, a generative architecture that can generate complex time-series data consisting of multiple heterogeneous modalities. Seq-MVAE is capable of conditional generation, i.e., Seq-MVAE learns the joint distribution across the modalities, and allows users to generate a part of the modalities that are coherent with the other (given) modalities. This enables not only missing value imputation but also conditional generation, which is known to be crucial for data augmentation. We evaluate the generative performance and other aspects of Seq-MVAE on an artificial dataset generated based designed to simulate an industrial aging process, and show the effectiveness of data augmentation by Seq-MVAE on a real-world dataset acquired from an industrial plant.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"199 ","pages":"Article 109109"},"PeriodicalIF":3.9000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425001139","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Data augmentation has shown to be effective for improving generalization performance of deep neural networks, especially in the regime of high noise and scarce data. However, this approach has not been applied to industrial aging processes (IAP) forecasting, where observed data are multimodal time-series, and therefore existing augmentation methods are not suitable for data generation. In this paper, we propose Seq-MVAE, a generative architecture that can generate complex time-series data consisting of multiple heterogeneous modalities. Seq-MVAE is capable of conditional generation, i.e., Seq-MVAE learns the joint distribution across the modalities, and allows users to generate a part of the modalities that are coherent with the other (given) modalities. This enables not only missing value imputation but also conditional generation, which is known to be crucial for data augmentation. We evaluate the generative performance and other aspects of Seq-MVAE on an artificial dataset generated based designed to simulate an industrial aging process, and show the effectiveness of data augmentation by Seq-MVAE on a real-world dataset acquired from an industrial plant.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.