基于HPC的图像分析生成对抗网络的单节点加速

Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems Pub Date : 2022-11-04 DOI:10.1145/3581792.3581802

A. Ravikumar, H. Sriraman

{"title":"基于HPC的图像分析生成对抗网络的单节点加速","authors":"A. Ravikumar, H. Sriraman","doi":"10.1145/3581792.3581802","DOIUrl":null,"url":null,"abstract":"Generative Adversarial Networks (GAN) are approaches that are utilized for data augmentation, which facilitates the development of more accurate detection models for unusual or unbalanced datasets. Computer-assisted diagnostic methods may be made more reliable by using synthetic pictures generated by GAN. Generative adversarial networks are challenging to train because too unpredictable training dynamics may occur throughout the learning process, such as model collapse and vanishing gradients. For accurate and faster results the GAN network need to trained in parallel and distributed manner. We enhance the speed and precision of the Deep Convolutional Generative Adversarial Networks (DCGAN) architecture by using its parallelism and executing it on High-Performance Computing platforms. The effective analysis of a DCGAN in Graphic Processing Unit and Tensor Processing Unit platforms in which each layer execution pattern is analyzed. The bottleneck is identified for the GAN structure for each execution platforms. The Central Processing Unit is capable of processing neural network models, but it requires a great deal of time to do it. Graphic Processing Unit in contrast, side, are a hundred times quicker than CPUs for Neural Networks, however, they are prohibitively expensive compared to CPUs. Using the systolic array structure, TPU performs well on neural networks with high batch sizes but in GAN the shift between CPU and TPU is huge so it does not perform well.","PeriodicalId":436413,"journal":{"name":"Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Single Node Acceleration of Generative Adversarial Networks using HPC for Image Analytics\",\"authors\":\"A. Ravikumar, H. Sriraman\",\"doi\":\"10.1145/3581792.3581802\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generative Adversarial Networks (GAN) are approaches that are utilized for data augmentation, which facilitates the development of more accurate detection models for unusual or unbalanced datasets. Computer-assisted diagnostic methods may be made more reliable by using synthetic pictures generated by GAN. Generative adversarial networks are challenging to train because too unpredictable training dynamics may occur throughout the learning process, such as model collapse and vanishing gradients. For accurate and faster results the GAN network need to trained in parallel and distributed manner. We enhance the speed and precision of the Deep Convolutional Generative Adversarial Networks (DCGAN) architecture by using its parallelism and executing it on High-Performance Computing platforms. The effective analysis of a DCGAN in Graphic Processing Unit and Tensor Processing Unit platforms in which each layer execution pattern is analyzed. The bottleneck is identified for the GAN structure for each execution platforms. The Central Processing Unit is capable of processing neural network models, but it requires a great deal of time to do it. Graphic Processing Unit in contrast, side, are a hundred times quicker than CPUs for Neural Networks, however, they are prohibitively expensive compared to CPUs. Using the systolic array structure, TPU performs well on neural networks with high batch sizes but in GAN the shift between CPU and TPU is huge so it does not perform well.\",\"PeriodicalId\":436413,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3581792.3581802\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3581792.3581802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

生成对抗网络(GAN)是用于数据增强的方法，它有助于为异常或不平衡数据集开发更准确的检测模型。使用GAN生成的合成图像可以使计算机辅助诊断方法更加可靠。生成对抗网络的训练具有挑战性，因为在整个学习过程中可能会出现不可预测的训练动态，例如模型崩溃和梯度消失。为了获得准确和快速的结果，GAN网络需要采用并行和分布式的方式进行训练。我们利用深度卷积生成对抗网络(Deep Convolutional Generative Adversarial Networks, DCGAN)架构的并行性，并在高性能计算平台上执行该架构，以提高其速度和精度。在图形处理单元和张量处理单元平台上对DCGAN进行了有效的分析，分析了各层的执行模式。确定了每个执行平台的GAN结构的瓶颈。中央处理单元能够处理神经网络模型，但需要大量的时间来完成。相比之下，图形处理单元比用于神经网络的cpu快100倍，然而，与cpu相比，它们的价格昂贵得令人望而却步。采用收缩阵列结构，TPU在高批处理的神经网络上表现良好，但在GAN中CPU和TPU之间的转换很大，因此性能不佳。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Single Node Acceleration of Generative Adversarial Networks using HPC for Image Analytics

Generative Adversarial Networks (GAN) are approaches that are utilized for data augmentation, which facilitates the development of more accurate detection models for unusual or unbalanced datasets. Computer-assisted diagnostic methods may be made more reliable by using synthetic pictures generated by GAN. Generative adversarial networks are challenging to train because too unpredictable training dynamics may occur throughout the learning process, such as model collapse and vanishing gradients. For accurate and faster results the GAN network need to trained in parallel and distributed manner. We enhance the speed and precision of the Deep Convolutional Generative Adversarial Networks (DCGAN) architecture by using its parallelism and executing it on High-Performance Computing platforms. The effective analysis of a DCGAN in Graphic Processing Unit and Tensor Processing Unit platforms in which each layer execution pattern is analyzed. The bottleneck is identified for the GAN structure for each execution platforms. The Central Processing Unit is capable of processing neural network models, but it requires a great deal of time to do it. Graphic Processing Unit in contrast, side, are a hundred times quicker than CPUs for Neural Networks, however, they are prohibitively expensive compared to CPUs. Using the systolic array structure, TPU performs well on neural networks with high batch sizes but in GAN the shift between CPU and TPU is huge so it does not perform well.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2022 5th International Conference on Computational Intelligence and Intelligent Systems

自引率

0.00%

发文量