Proper orthogonal decomposition based parallel compression for visualizing big data on the K computer

2013 IEEE Symposium on Large-Scale Data Analysis and Visualization (LDAV) Pub Date : 2013-12-02 DOI:10.1109/LDAV.2013.6675169

Chongke Bi, K. Ono, K. Ma, Haiyuan Wu, Toshiyuki Imamura

{"title":"Proper orthogonal decomposition based parallel compression for visualizing big data on the K computer","authors":"Chongke Bi, K. Ono, K. Ma, Haiyuan Wu, Toshiyuki Imamura","doi":"10.1109/LDAV.2013.6675169","DOIUrl":null,"url":null,"abstract":"The development of supercomputers has greatly help us to carry on large-scale computing for dealing with various problems through simulating and analyzing them. Visualization is an indispensable tool to understand the properties of the data from supercomputers. Especially, interactive visualization can help us to analyze data from various viewpoints and even to find out some local small but important features. However, it is still difficult to interactively visualize such kind of big data directly due to the slow file I/O problem and the limitation of memory size. For resolving these problems, we proposed a parallel compression method to reduce the data size with low computational cost. Furthermore, the fast linear decompression process is another merit for interactive visualization. Our method uses proper orthogonal decomposition (POD) to compress data because it can effectively extract important features from the data and the resulting compressed data can also be linearly decompressed. Our implementation achieves high parallel efficiency with a binary load-distributed approach, which is similar to the binary-swap image composition used in parallel volume rendering [2]. This approach allows us to effectively utilize all the processing nodes and reduce the interprocessor communication cost throughout the parallel compression calculations. Our test results on the K computer demonstrate superior performance of our design and implementation.","PeriodicalId":266607,"journal":{"name":"2013 IEEE Symposium on Large-Scale Data Analysis and Visualization (LDAV)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Symposium on Large-Scale Data Analysis and Visualization (LDAV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LDAV.2013.6675169","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

The development of supercomputers has greatly help us to carry on large-scale computing for dealing with various problems through simulating and analyzing them. Visualization is an indispensable tool to understand the properties of the data from supercomputers. Especially, interactive visualization can help us to analyze data from various viewpoints and even to find out some local small but important features. However, it is still difficult to interactively visualize such kind of big data directly due to the slow file I/O problem and the limitation of memory size. For resolving these problems, we proposed a parallel compression method to reduce the data size with low computational cost. Furthermore, the fast linear decompression process is another merit for interactive visualization. Our method uses proper orthogonal decomposition (POD) to compress data because it can effectively extract important features from the data and the resulting compressed data can also be linearly decompressed. Our implementation achieves high parallel efficiency with a binary load-distributed approach, which is similar to the binary-swap image composition used in parallel volume rendering [2]. This approach allows us to effectively utilize all the processing nodes and reduce the interprocessor communication cost throughout the parallel compression calculations. Our test results on the K computer demonstrate superior performance of our design and implementation.

查看原文本刊更多论文

基于正交分解的并行压缩在K计算机上实现大数据可视化

超级计算机的发展极大地帮助我们通过模拟和分析来进行大规模计算，处理各种各样的问题。可视化是理解超级计算机数据属性不可或缺的工具。特别是交互式可视化可以帮助我们从不同的角度分析数据，甚至可以发现一些局部的小而重要的特征。但是，由于文件I/O速度慢和内存大小的限制，这种大数据仍然难以直接交互可视化。为了解决这些问题，我们提出了一种并行压缩方法，以减少数据量和降低计算成本。此外，快速的线性解压缩过程是交互式可视化的另一个优点。我们的方法使用适当的正交分解(POD)来压缩数据，因为它可以有效地从数据中提取重要的特征，并且压缩后的数据也可以线性解压缩。我们的实现通过二进制负载分布式方法实现了高并行效率，这类似于并行体渲染[2]中使用的二进制交换图像组合。这种方法使我们能够有效地利用所有的处理节点，并在并行压缩计算过程中减少处理器间的通信成本。我们在K计算机上的测试结果表明我们的设计和实现具有优越的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Symposium on Large-Scale Data Analysis and Visualization (LDAV)

自引率

0.00%

发文量