递归神经网络在多维时间序列预测中的可视化解释

2020 IEEE Pacific Visualization Symposium (PacificVis) Pub Date : 2020-06-01 DOI:10.1109/PacificVis48177.2020.2785

Qiaomu Shen, Yanhong Wu, Yuzhe Jiang, Wei Zeng, A. Lau, Anna Vianova, Huamin Qu

{"title":"递归神经网络在多维时间序列预测中的可视化解释","authors":"Qiaomu Shen, Yanhong Wu, Yuzhe Jiang, Wei Zeng, A. Lau, Anna Vianova, Huamin Qu","doi":"10.1109/PacificVis48177.2020.2785","DOIUrl":null,"url":null,"abstract":"Recent attempts at utilizing visual analytics to interpret Recurrent Neural Networks (RNNs) mainly focus on natural language processing (NLP) tasks that take symbolic sequences as input. However, many real-world problems like environment pollution forecasting apply RNNs on sequences of multi-dimensional data where each dimension represents an individual feature with semantic meaning such as PM2.5 and SO2. RNN interpretation on multi-dimensional sequences is challenging as users need to analyze what features are important at different time steps to better understand model behavior and gain trust in prediction. This requires effective and scalable visualization methods to reveal the complex many-to-many relations between hidden units and features. In this work, we propose a visual analytics system to interpret RNNs on multi-dimensional time-series forecasts. Specifically, to provide an overview to reveal the model mechanism, we propose a technique to estimate the hidden unit response by measuring how different feature selections affect the hidden unit output distribution. We then cluster the hidden units and features based on the response embedding vectors. Finally, we propose a visual analytics system which allows users to visually explore the model behavior from the global and individual levels. We demonstrate the effectiveness of our approach with case studies using air pollutant forecast applications.","PeriodicalId":322092,"journal":{"name":"2020 IEEE Pacific Visualization Symposium (PacificVis)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"Visual Interpretation of Recurrent Neural Network on Multi-dimensional Time-series Forecast\",\"authors\":\"Qiaomu Shen, Yanhong Wu, Yuzhe Jiang, Wei Zeng, A. Lau, Anna Vianova, Huamin Qu\",\"doi\":\"10.1109/PacificVis48177.2020.2785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent attempts at utilizing visual analytics to interpret Recurrent Neural Networks (RNNs) mainly focus on natural language processing (NLP) tasks that take symbolic sequences as input. However, many real-world problems like environment pollution forecasting apply RNNs on sequences of multi-dimensional data where each dimension represents an individual feature with semantic meaning such as PM2.5 and SO2. RNN interpretation on multi-dimensional sequences is challenging as users need to analyze what features are important at different time steps to better understand model behavior and gain trust in prediction. This requires effective and scalable visualization methods to reveal the complex many-to-many relations between hidden units and features. In this work, we propose a visual analytics system to interpret RNNs on multi-dimensional time-series forecasts. Specifically, to provide an overview to reveal the model mechanism, we propose a technique to estimate the hidden unit response by measuring how different feature selections affect the hidden unit output distribution. We then cluster the hidden units and features based on the response embedding vectors. Finally, we propose a visual analytics system which allows users to visually explore the model behavior from the global and individual levels. We demonstrate the effectiveness of our approach with case studies using air pollutant forecast applications.\",\"PeriodicalId\":322092,\"journal\":{\"name\":\"2020 IEEE Pacific Visualization Symposium (PacificVis)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Pacific Visualization Symposium (PacificVis)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PacificVis48177.2020.2785\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Pacific Visualization Symposium (PacificVis)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PacificVis48177.2020.2785","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

最近利用视觉分析来解释递归神经网络(rnn)的尝试主要集中在以符号序列作为输入的自然语言处理(NLP)任务上。然而，许多现实世界的问题，如环境污染预测，将rnn应用于多维数据序列，其中每个维度代表具有语义的单个特征，如PM2.5和SO2。RNN对多维序列的解释具有挑战性，因为用户需要分析在不同的时间步长哪些特征是重要的，以更好地理解模型行为并获得对预测的信任。这需要有效和可扩展的可视化方法来揭示隐藏单元和特征之间复杂的多对多关系。在这项工作中，我们提出了一个可视化分析系统来解释rnn在多维时间序列预测上的作用。具体来说，为了提供揭示模型机制的概述，我们提出了一种通过测量不同特征选择如何影响隐藏单元输出分布来估计隐藏单元响应的技术。然后基于响应嵌入向量对隐藏单元和特征进行聚类。最后，我们提出了一个可视化分析系统，允许用户从全局和个人层面可视化地探索模型行为。我们以应用空气污染物预测的个案研究，证明我们方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Visual Interpretation of Recurrent Neural Network on Multi-dimensional Time-series Forecast

Recent attempts at utilizing visual analytics to interpret Recurrent Neural Networks (RNNs) mainly focus on natural language processing (NLP) tasks that take symbolic sequences as input. However, many real-world problems like environment pollution forecasting apply RNNs on sequences of multi-dimensional data where each dimension represents an individual feature with semantic meaning such as PM2.5 and SO2. RNN interpretation on multi-dimensional sequences is challenging as users need to analyze what features are important at different time steps to better understand model behavior and gain trust in prediction. This requires effective and scalable visualization methods to reveal the complex many-to-many relations between hidden units and features. In this work, we propose a visual analytics system to interpret RNNs on multi-dimensional time-series forecasts. Specifically, to provide an overview to reveal the model mechanism, we propose a technique to estimate the hidden unit response by measuring how different feature selections affect the hidden unit output distribution. We then cluster the hidden units and features based on the response embedding vectors. Finally, we propose a visual analytics system which allows users to visually explore the model behavior from the global and individual levels. We demonstrate the effectiveness of our approach with case studies using air pollutant forecast applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE Pacific Visualization Symposium (PacificVis)

自引率

0.00%

发文量