Multi-scale dynamics by adjusting the leaking rate to enhance the performance of deep echo state networks

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Frontiers in Artificial Intelligence Pub Date : 2024-07-16 DOI:10.3389/frai.2024.1397915

Shuichi Inoue, S. Nobukawa, Haruhiko Nishimura, Eiji Watanabe, T. Isokawa

{"title":"Multi-scale dynamics by adjusting the leaking rate to enhance the performance of deep echo state networks","authors":"Shuichi Inoue, S. Nobukawa, Haruhiko Nishimura, Eiji Watanabe, T. Isokawa","doi":"10.3389/frai.2024.1397915","DOIUrl":null,"url":null,"abstract":"The deep echo state network (Deep-ESN) architecture, which comprises a multi-layered reservoir layer, exhibits superior performance compared to conventional echo state networks (ESNs) owing to the divergent layer-specific time-scale responses in the Deep-ESN. Although researchers have attempted to use experimental trial-and-error grid searches and Bayesian optimization methods to adjust the hyperparameters, suitable guidelines for setting hyperparameters to adjust the time scale of the dynamics in each layer from the perspective of dynamical characteristics have not been established. In this context, we hypothesized that evaluating the dependence of the multi-time-scale dynamical response on the leaking rate as a typical hyperparameter of the time scale in each neuron would help to achieve a guideline for optimizing the hyperparameters of the Deep-ESN.First, we set several leaking rates for each layer of the Deep-ESN and performed multi-scale entropy (MSCE) analysis to analyze the impact of the leaking rate on the dynamics in each layer. Second, we performed layer-by-layer cross-correlation analysis between adjacent layers to elucidate the structural mechanisms to enhance the performance.As a result, an optimum task-specific leaking rate value for producing layer-specific multi-time-scale responses and a queue structure with layer-to-layer signal transmission delays for retaining past applied input enhance the Deep-ESN prediction performance.These findings can help to establish ideal design guidelines for setting the hyperparameters of Deep-ESNs.","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":3.0000,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2024.1397915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The deep echo state network (Deep-ESN) architecture, which comprises a multi-layered reservoir layer, exhibits superior performance compared to conventional echo state networks (ESNs) owing to the divergent layer-specific time-scale responses in the Deep-ESN. Although researchers have attempted to use experimental trial-and-error grid searches and Bayesian optimization methods to adjust the hyperparameters, suitable guidelines for setting hyperparameters to adjust the time scale of the dynamics in each layer from the perspective of dynamical characteristics have not been established. In this context, we hypothesized that evaluating the dependence of the multi-time-scale dynamical response on the leaking rate as a typical hyperparameter of the time scale in each neuron would help to achieve a guideline for optimizing the hyperparameters of the Deep-ESN.First, we set several leaking rates for each layer of the Deep-ESN and performed multi-scale entropy (MSCE) analysis to analyze the impact of the leaking rate on the dynamics in each layer. Second, we performed layer-by-layer cross-correlation analysis between adjacent layers to elucidate the structural mechanisms to enhance the performance.As a result, an optimum task-specific leaking rate value for producing layer-specific multi-time-scale responses and a queue structure with layer-to-layer signal transmission delays for retaining past applied input enhance the Deep-ESN prediction performance.These findings can help to establish ideal design guidelines for setting the hyperparameters of Deep-ESNs.

查看原文本刊更多论文

通过调整泄漏率提高深度回波态网络性能的多尺度动态特性

深层回声状态网络（Deep-ESN）结构由多层储层组成，与传统的回声状态网络（ESN）相比，由于深层回声状态网络中特定层的时间尺度响应不同，因此表现出更优越的性能。虽然研究人员曾尝试使用实验性试错网格搜索和贝叶斯优化方法来调整超参数，但从动态特性的角度来设定超参数以调整各层动态的时间尺度的合适准则尚未建立。在这种情况下，我们假设评估多时间尺度动态响应对作为各神经元时间尺度典型超参数的泄漏率的依赖性，将有助于实现优化 Deep-ESN 超参数的指导原则。首先，我们为 Deep-ESN 的每一层设置了多个泄漏率，并进行了多尺度熵（MSCE）分析，以分析泄漏率对各层动态的影响。其次，我们对相邻层之间进行了逐层交叉相关分析，以阐明提高性能的结构机制。结果是，用于产生特定层多时间尺度响应的特定任务泄漏率最佳值，以及用于保留过去应用输入的具有层间信号传输延迟的队列结构，都提高了 Deep-ESN 的预测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Frontiers in Artificial Intelligence Multiple-

CiteScore

6.10

自引率

2.50%

发文量

272

审稿时长

13 weeks