基于双向f散度的时间序列数据缺失值输入深度生成方法。

IF 0.9 Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats Pub Date : 2025-03-01 Epub Date: 2025-01-14 DOI:10.3390/stats8010007

Wen-Shan Liu, Tong Si, Aldas Kriauciunas, Marcus Snell, Haijun Gong

{"title":"基于双向f散度的时间序列数据缺失值输入深度生成方法。","authors":"Wen-Shan Liu, Tong Si, Aldas Kriauciunas, Marcus Snell, Haijun Gong","doi":"10.3390/stats8010007","DOIUrl":null,"url":null,"abstract":"Imputing missing values in high-dimensional time-series data remains a significant challenge in statistics and machine learning. Although various methods have been proposed in recent years, many struggle with limitations and reduced accuracy, particularly when the missing rate is high. In this work, we present a novel f-divergence-based bidirectional generative adversarial imputation network, tf-BiGAIN, designed to address these challenges in time-series data imputation. Unlike traditional imputation methods, tf-BiGAIN employs a generative model to synthesize missing values without relying on distributional assumptions. The imputation process is achieved by training two neural networks, implemented using bidirectional modified gated recurrent units, with f-divergence serving as the objective function to guide optimization. Compared to existing deep learning-based methods, tf-BiGAIN introduces two key innovations. First, the use of f-divergence provides a flexible and adaptable framework for optimizing the model across diverse imputation tasks, enhancing its versatility. Second, the use of bidirectional gated recurrent units allows the model to leverage both forward and backward temporal information. This bidirectional approach enables the model to effectively capture dependencies from both past and future observations, enhancing its imputation accuracy and robustness. We applied tf-BiGAIN to analyze two real-world time-series datasets, demonstrating its superior performance in imputing missing values and outperforming existing methods in terms of accuracy and robustness.","PeriodicalId":93142,"journal":{"name":"Stats","volume":"8 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11793919/pdf/","citationCount":"0","resultStr":"{\"title\":\"Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.\",\"authors\":\"Wen-Shan Liu, Tong Si, Aldas Kriauciunas, Marcus Snell, Haijun Gong\",\"doi\":\"10.3390/stats8010007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Imputing missing values in high-dimensional time-series data remains a significant challenge in statistics and machine learning. Although various methods have been proposed in recent years, many struggle with limitations and reduced accuracy, particularly when the missing rate is high. In this work, we present a novel f-divergence-based bidirectional generative adversarial imputation network, tf-BiGAIN, designed to address these challenges in time-series data imputation. Unlike traditional imputation methods, tf-BiGAIN employs a generative model to synthesize missing values without relying on distributional assumptions. The imputation process is achieved by training two neural networks, implemented using bidirectional modified gated recurrent units, with f-divergence serving as the objective function to guide optimization. Compared to existing deep learning-based methods, tf-BiGAIN introduces two key innovations. First, the use of f-divergence provides a flexible and adaptable framework for optimizing the model across diverse imputation tasks, enhancing its versatility. Second, the use of bidirectional gated recurrent units allows the model to leverage both forward and backward temporal information. This bidirectional approach enables the model to effectively capture dependencies from both past and future observations, enhancing its imputation accuracy and robustness. We applied tf-BiGAIN to analyze two real-world time-series datasets, demonstrating its superior performance in imputing missing values and outperforming existing methods in terms of accuracy and robustness.\",\"PeriodicalId\":93142,\"journal\":{\"name\":\"Stats\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11793919/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stats\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/stats8010007\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/14 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stats","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/stats8010007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/14 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

在高维时间序列数据中输入缺失值仍然是统计学和机器学习中的重大挑战。尽管近年来提出了各种方法，但许多方法都存在局限性和准确性降低的问题，特别是在缺失率很高的情况下。在这项工作中，我们提出了一种新的基于f散度的双向生成对抗输入网络tf-BiGAIN，旨在解决时间序列数据输入中的这些挑战。与传统的imputation方法不同，tf-BiGAIN采用生成模型来综合缺失值，而不依赖于分布假设。输入过程是通过训练两个神经网络来实现的，使用双向修正门控循环单元，以f-散度作为指导优化的目标函数。与现有的基于深度学习的方法相比，tf-BiGAIN引入了两个关键创新。首先，f-散度的使用提供了一个灵活且适应性强的框架，用于跨不同的输入任务优化模型，增强了其通用性。其次，使用双向门控循环单元允许模型利用前向和后向时间信息。这种双向方法使模型能够有效地从过去和未来的观测中捕获依赖关系，提高其估算精度和鲁棒性。我们应用tf-BiGAIN分析了两个真实世界的时间序列数据集，证明了它在输入缺失值方面的优越性能，并且在准确性和鲁棒性方面优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.

查看原文本刊更多论文

Bidirectional f-Divergence-Based Deep Generative Method for Imputing Missing Values in Time-Series Data.

Imputing missing values in high-dimensional time-series data remains a significant challenge in statistics and machine learning. Although various methods have been proposed in recent years, many struggle with limitations and reduced accuracy, particularly when the missing rate is high. In this work, we present a novel f-divergence-based bidirectional generative adversarial imputation network, tf-BiGAIN, designed to address these challenges in time-series data imputation. Unlike traditional imputation methods, tf-BiGAIN employs a generative model to synthesize missing values without relying on distributional assumptions. The imputation process is achieved by training two neural networks, implemented using bidirectional modified gated recurrent units, with f-divergence serving as the objective function to guide optimization. Compared to existing deep learning-based methods, tf-BiGAIN introduces two key innovations. First, the use of f-divergence provides a flexible and adaptable framework for optimizing the model across diverse imputation tasks, enhancing its versatility. Second, the use of bidirectional gated recurrent units allows the model to leverage both forward and backward temporal information. This bidirectional approach enables the model to effectively capture dependencies from both past and future observations, enhancing its imputation accuracy and robustness. We applied tf-BiGAIN to analyze two real-world time-series datasets, demonstrating its superior performance in imputing missing values and outperforming existing methods in terms of accuracy and robustness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Stats

CiteScore

0.60

自引率

0.00%

发文量

审稿时长

7 weeks