{"title":"Analyzing I/O Performance of a Hierarchical HPC Storage System for Distributed Deep Learning","authors":"Takaaki Fukai, Kento Sato, Takahiro Hirofuchi","doi":"10.48550/arXiv.2301.01494","DOIUrl":"https://doi.org/10.48550/arXiv.2301.01494","url":null,"abstract":"Today, deep learning is an essential technology for our life. To solve more complex problems with deep learning, both sizes of training datasets and neural networks are increasing. To train a model with large datasets and networks, distributed deep neural network (DDNN) training technique is necessary. For large-scale DDNN training, HPC clusters are a promising computation environment. In large-scale DDNN on HPC clusters, I/O performance is critical because it is becoming a bottleneck. Most flagship-class HPC clusters have hierarchical storage systems. For designing future HPC storage systems, it is necessary to quantify the performance improvement effect of the hierarchical storage system on the workloads. This paper demonstrates the quantitative performance analysis of the hierarchical storage system for DDNN workload in a flagship-class supercomputer. Our analysis shows how much performance improvement and volume increment of the storage will be required to meet the performance goal.","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133883208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Omkar Bhilare, Rahul Singh, V. Paranjape, Sravan Chittupalli, Shraddha Suratkar, F. Kazi
{"title":"DEEPFAKE CLI: Accelerated Deepfake Detection using FPGAs","authors":"Omkar Bhilare, Rahul Singh, V. Paranjape, Sravan Chittupalli, Shraddha Suratkar, F. Kazi","doi":"10.48550/arXiv.2210.14743","DOIUrl":"https://doi.org/10.48550/arXiv.2210.14743","url":null,"abstract":"Because of the availability of larger datasets and recent improvements in the generative model, more realistic Deepfake videos are being produced each day. People consume around one billion hours of video on social media platforms every day, and thats why it is very important to stop the spread of fake videos as they can be damaging, dangerous, and malicious. There has been a significant improvement in the field of deepfake classification, but deepfake detection and inference have remained a difficult task. To solve this problem in this paper, we propose a novel DEEPFAKE C-L-I (Classification-Localization-Inference) in which we have explored the idea of accelerating Quantized Deepfake Detection Models using FPGAs due to their ability of maximum parallelism and energy efficiency compared to generalized GPUs. In this paper, we have used light MesoNet with EFF-YNet structure and accelerated it on VCK5000 FPGA, powered by state-of-the-art VC1902 Versal Architecture which uses AI, DSP, and Adaptable Engines for acceleration. We have benchmarked our inference speed with other state-of-the-art inference nodes, got 316.8 FPS on VCK5000 while maintaining 93% Accuracy.","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"128 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131835086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Debesh Jha, A. Yazidi, M. Riegler, Dag Johansen, Haavard D. Johansen, P. Halvorsen
{"title":"LightLayers: Parameter Efficient Dense and Convolutional Layers for Image Classification","authors":"Debesh Jha, A. Yazidi, M. Riegler, Dag Johansen, Haavard D. Johansen, P. Halvorsen","doi":"10.1007/978-3-030-69244-5_25","DOIUrl":"https://doi.org/10.1007/978-3-030-69244-5_25","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127225949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Blood Leukocyte Object Detection According to Model Parameter-Transfer and Deformable Convolution","authors":"Kaizhi Chen, Wencheng Wei, Shangping Zhong, Longkun Guo","doi":"10.1007/978-3-030-69244-5_1","DOIUrl":"https://doi.org/10.1007/978-3-030-69244-5_1","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131875526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis of Massive E-learning Processes: An Approach Based on Big Association Rules Mining","authors":"Asma Hassani, Sonia Ayachi Ghannouchi","doi":"10.1007/978-981-13-5907-1_20","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_20","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124934451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Green vs Revenue: Data Center Profit Maximization Under Green Degree Constraints","authors":"Huaiwen He, Hong Shen","doi":"10.1007/978-981-13-5907-1_3","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_3","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"213 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116999264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Privacy Preserving Classification Based on Perturbation for Network Traffic","authors":"Yue Lu, Hui Tian, Hong Shen, Dongdong Xu","doi":"10.1007/978-981-13-5907-1_13","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_13","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"51 14","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114021616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Security Vulnerability Analysis of Wi-Fi Connection Hijacking on the Linux-Based Robot Operating System for Drone Systems","authors":"Jinyeong Kang, I. Joe","doi":"10.1007/978-981-13-5907-1_49","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_49","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129382488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hethini Ribeiro, R. S. Ulson, A. Manacero, R. S. Lobato
{"title":"Parallelization of the DIANA Algorithm in OpenMP","authors":"Hethini Ribeiro, R. S. Ulson, A. Manacero, R. S. Lobato","doi":"10.1007/978-981-13-5907-1_18","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_18","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127630404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Memory Contention Aware Power Management for High Performance GPUs","authors":"H. Choi, D. Son, C. Kim","doi":"10.1007/978-981-13-5907-1_23","DOIUrl":"https://doi.org/10.1007/978-981-13-5907-1_23","url":null,"abstract":"","PeriodicalId":110399,"journal":{"name":"International Conference on Parallel and Distributed Computing: Applications and Technologies","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131885360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}