MSA-LR: Enhancing multi-scale temporal dynamics in multivariate time series forecasting with low-rank self-attention

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neural Networks Pub Date : 2025-09-23 DOI:10.1016/j.neunet.2025.108150

Jie Sun , Zhilin Sun , Zhongshan Chen , Mengyang Dong , Xiaozheng Wang , Changwei Chen , Hao Zheng , Xiangjun Zhao

{"title":"MSA-LR: Enhancing multi-scale temporal dynamics in multivariate time series forecasting with low-rank self-attention","authors":"Jie Sun , Zhilin Sun , Zhongshan Chen , Mengyang Dong , Xiaozheng Wang , Changwei Chen , Hao Zheng , Xiangjun Zhao","doi":"10.1016/j.neunet.2025.108150","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately forecasting multivariate time series requires effectively capturing intricate temporal dependencies across diverse scales. Existing deep learning models, while promising, often fall short in this regard. Recurrent architectures like LSTMs struggle with long-range dependencies crucial for multi-scale modeling, while standard Transformers, despite employing attention mechanisms, fail to explicitly differentiate the importance of distinct periodicities, treating all time steps within a fixed window with similar relevance. This limitation hinders their ability to leverage the rich hierarchical structure of real-world time series, particularly in long-term forecasting scenarios. This paper introduces MSA-LR (Multi-Scale Self-Attention with Low-Rank Approximation), a novel architecture explicitly designed to capture multi-scale temporal dynamics. MSA-LR leverages a learnable scale weight matrix and low-rank approximations to directly model the influence of different temporal granularities (e.g., hourly, daily, weekly). This approach not only allows for fine-grained control over multi-scale interactions but also significantly reduces computational complexity compared to standard self-attention, enabling efficient processing of long time series. Empirical evaluations on diverse datasets, including electricity load, traffic flow, and air quality, demonstrate that MSA-LR achieves competitive performance compared to state-of-the-art methods, exhibiting notable improvements in long-term forecasting accuracy. Further analysis reveals MSA-LR's ability to discern and leverage periodic patterns at various resolutions, confirming its effectiveness in capturing the rich multi-scale temporal structure of real-world time series data.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"194 ","pages":"Article 108150"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025010305","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurately forecasting multivariate time series requires effectively capturing intricate temporal dependencies across diverse scales. Existing deep learning models, while promising, often fall short in this regard. Recurrent architectures like LSTMs struggle with long-range dependencies crucial for multi-scale modeling, while standard Transformers, despite employing attention mechanisms, fail to explicitly differentiate the importance of distinct periodicities, treating all time steps within a fixed window with similar relevance. This limitation hinders their ability to leverage the rich hierarchical structure of real-world time series, particularly in long-term forecasting scenarios. This paper introduces MSA-LR (Multi-Scale Self-Attention with Low-Rank Approximation), a novel architecture explicitly designed to capture multi-scale temporal dynamics. MSA-LR leverages a learnable scale weight matrix and low-rank approximations to directly model the influence of different temporal granularities (e.g., hourly, daily, weekly). This approach not only allows for fine-grained control over multi-scale interactions but also significantly reduces computational complexity compared to standard self-attention, enabling efficient processing of long time series. Empirical evaluations on diverse datasets, including electricity load, traffic flow, and air quality, demonstrate that MSA-LR achieves competitive performance compared to state-of-the-art methods, exhibiting notable improvements in long-term forecasting accuracy. Further analysis reveals MSA-LR's ability to discern and leverage periodic patterns at various resolutions, confirming its effectiveness in capturing the rich multi-scale temporal structure of real-world time series data.

查看原文本刊更多论文

MSA-LR：增强低秩自注意多元时间序列预测的多尺度时间动力学。

准确预测多变量时间序列需要在不同尺度上有效地捕获复杂的时间依赖性。现有的深度学习模型虽然很有前途，但在这方面往往存在不足。像lstm这样的循环架构在多尺度建模的远程依赖关系中挣扎，而标准的transformer，尽管采用了注意机制，却不能明确区分不同周期性的重要性，在一个固定的窗口内以相似的相关性处理所有时间步长。这种限制阻碍了它们利用现实世界时间序列丰富的层次结构的能力，特别是在长期预测场景中。本文介绍了MSA-LR (Multi-Scale Self-Attention with Low-Rank Approximation)，这是一种明确设计用于捕获多尺度时间动态的新架构。MSA-LR利用可学习的尺度权重矩阵和低秩近似来直接模拟不同时间粒度（例如，每小时、每天、每周）的影响。这种方法不仅允许对多尺度交互进行细粒度控制，而且与标准的自关注相比，还显着降低了计算复杂性，从而能够有效地处理长时间序列。对不同数据集（包括电力负荷、交通流量和空气质量）的实证评估表明，与最先进的方法相比，MSA-LR取得了具有竞争力的表现，在长期预测准确性方面表现出显著提高。进一步的分析表明，MSA-LR能够在各种分辨率下识别和利用周期模式，证实了其在捕获现实世界时间序列数据丰富的多尺度时间结构方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.