AutoCTS++: zero-shot joint neural architecture and hyperparameter search for correlated time series forecasting

The VLDB Journal Pub Date : 2024-07-30 DOI:10.1007/s00778-024-00872-x

Xinle Wu, Xingjian Wu, Bin Yang, Lekui Zhou, Chenjuan Guo, Xiangfei Qiu, Jilin Hu, Zhenli Sheng, Christian S. Jensen

{"title":"AutoCTS++: zero-shot joint neural architecture and hyperparameter search for correlated time series forecasting","authors":"Xinle Wu, Xingjian Wu, Bin Yang, Lekui Zhou, Chenjuan Guo, Xiangfei Qiu, Jilin Hu, Zhenli Sheng, Christian S. Jensen","doi":"10.1007/s00778-024-00872-x","DOIUrl":null,"url":null,"abstract":"<p>Sensors in cyber-physical systems often capture interconnected processes and thus emit correlated time series (CTS), the forecasting of which enables important applications. Recent deep learning based forecasting methods show strong capabilities at capturing both the temporal dynamics of time series and the spatial correlations among time series, thus achieving impressive accuracy. In particular, automated CTS forecasting, where a deep learning architecture is configured automatically, enables forecasting accuracy that surpasses what has been achieved by manual approaches. However, automated CTS forecasting remains in its infancy, as existing proposals are only able to find optimal architectures for predefined hyperparameters and for specific datasets and forecasting settings (e.g., short vs. long term forecasting). These limitations hinder real-world industrial application, where forecasting faces diverse datasets and forecasting settings. We propose AutoCTS++, a zero-shot, joint search framework, to efficiently configure effective CTS forecasting models (including both neural architectures and hyperparameters), even when facing unseen datasets and foreacsting settings. Specifically, we propose an architecture-hyperparameter joint search space by encoding candidate architecture and accompanying hyperparameters into a graph representation. We then introduce a zero-shot Task-aware Architecture-Hyperparameter Comparator (T-AHC) to rank architecture-hyperparameter pairs according to different tasks (i.e., datasets and forecasting settings). We propose zero-shot means to train T-AHC, enabling it to rank architecture-hyperparameter pairs given unseen datasets and forecasting settings. A final forecasting model is then selected from the top-ranked pairs. Extensive experiments involving multiple benchmark datasets and forecasting settings demonstrate that AutoCTS++ is able to efficiently devise forecasting models for unseen datasets and forecasting settings that are capable of outperforming existing manually designed and automated models.</p>","PeriodicalId":501532,"journal":{"name":"The VLDB Journal","volume":"20 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The VLDB Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00778-024-00872-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Sensors in cyber-physical systems often capture interconnected processes and thus emit correlated time series (CTS), the forecasting of which enables important applications. Recent deep learning based forecasting methods show strong capabilities at capturing both the temporal dynamics of time series and the spatial correlations among time series, thus achieving impressive accuracy. In particular, automated CTS forecasting, where a deep learning architecture is configured automatically, enables forecasting accuracy that surpasses what has been achieved by manual approaches. However, automated CTS forecasting remains in its infancy, as existing proposals are only able to find optimal architectures for predefined hyperparameters and for specific datasets and forecasting settings (e.g., short vs. long term forecasting). These limitations hinder real-world industrial application, where forecasting faces diverse datasets and forecasting settings. We propose AutoCTS++, a zero-shot, joint search framework, to efficiently configure effective CTS forecasting models (including both neural architectures and hyperparameters), even when facing unseen datasets and foreacsting settings. Specifically, we propose an architecture-hyperparameter joint search space by encoding candidate architecture and accompanying hyperparameters into a graph representation. We then introduce a zero-shot Task-aware Architecture-Hyperparameter Comparator (T-AHC) to rank architecture-hyperparameter pairs according to different tasks (i.e., datasets and forecasting settings). We propose zero-shot means to train T-AHC, enabling it to rank architecture-hyperparameter pairs given unseen datasets and forecasting settings. A final forecasting model is then selected from the top-ranked pairs. Extensive experiments involving multiple benchmark datasets and forecasting settings demonstrate that AutoCTS++ is able to efficiently devise forecasting models for unseen datasets and forecasting settings that are capable of outperforming existing manually designed and automated models.

Abstract Image

查看原文本刊更多论文

AutoCTS++：用于相关时间序列预测的零点联合神经架构和超参数搜索

网络物理系统中的传感器通常会捕捉相互关联的过程，从而发出相关时间序列（CTS），对其进行预测可实现重要的应用。最近，基于深度学习的预测方法在捕捉时间序列的时间动态和时间序列之间的空间相关性方面显示出强大的能力，从而实现了令人印象深刻的准确性。特别是自动 CTS 预测，即自动配置深度学习架构，其预测准确度超过了人工方法。然而，自动 CTS 预测仍处于起步阶段，因为现有的建议只能为预定义的超参数以及特定的数据集和预测设置（如短期与长期预测）找到最佳架构。这些限制阻碍了现实世界中的工业应用，因为预测面临着不同的数据集和预测设置。我们提出了 AutoCTS++--一种零点联合搜索框架，用于高效配置有效的 CTS 预测模型（包括神经架构和超参数），即使在面对未知数据集和预测设置时也是如此。具体来说，我们提出了一个架构-超参数联合搜索空间，将候选架构和相应的超参数编码成图表示。然后，我们引入了一个零射任务感知架构-超参数比较器（T-AHC），根据不同的任务（即数据集和预测设置）对架构-超参数对进行排序。我们提出了零点训练 T-AHC 的方法，使其能够在未见数据集和预测设置的情况下对架构-参数对进行排序。然后从排名靠前的模型中选出最终的预测模型。涉及多个基准数据集和预测设置的广泛实验表明，AutoCTS++ 能够针对未见数据集和预测设置高效地设计预测模型，其性能优于现有的人工设计和自动模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The VLDB Journal

自引率

0.00%

发文量