EDAML 2022 Invited Speaker 3: Scalable ML Architectures for Real-time Energy-efficient Computing

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) Pub Date : 2022-05-01 DOI:10.1109/IPDPSW55747.2022.00196

R. I. Bahar

{"title":"EDAML 2022 Invited Speaker 3: Scalable ML Architectures for Real-time Energy-efficient Computing","authors":"R. I. Bahar","doi":"10.1109/IPDPSW55747.2022.00196","DOIUrl":null,"url":null,"abstract":"Technological advancements have led to a proliferation machine learning systems to assist humans in a wide range of tasks. However, we are still far from accurate, reliable, and resource-efficient operations for many of these systems. Despite the strengths of convolutional neural networks (CNNs) for object recognition, these discriminative techniques have several shortcomings that leave them vulnerable to exploitation from adversaries. In addition, the computational cost incurred to train these discriminative models can be quite significant. Discriminative-generative approaches offers a promising avenue for robust perception and action. Such methods combine inference by deep learning with sampling and probabilistic inference models to achieve robust and adaptive understanding. In this talk, I will present our work on implementing a scalable, computationally efficient generative inference algorithm in hardware that can achieve real-time results in an energy efficient manner. I will also discuss future directions in designing scalable and efficient ML algorithms in hardware more broadly.","PeriodicalId":286968,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW55747.2022.00196","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Technological advancements have led to a proliferation machine learning systems to assist humans in a wide range of tasks. However, we are still far from accurate, reliable, and resource-efficient operations for many of these systems. Despite the strengths of convolutional neural networks (CNNs) for object recognition, these discriminative techniques have several shortcomings that leave them vulnerable to exploitation from adversaries. In addition, the computational cost incurred to train these discriminative models can be quite significant. Discriminative-generative approaches offers a promising avenue for robust perception and action. Such methods combine inference by deep learning with sampling and probabilistic inference models to achieve robust and adaptive understanding. In this talk, I will present our work on implementing a scalable, computationally efficient generative inference algorithm in hardware that can achieve real-time results in an energy efficient manner. I will also discuss future directions in designing scalable and efficient ML algorithms in hardware more broadly.

查看原文本刊更多论文

EDAML 2022特邀演讲3:面向实时节能计算的可扩展机器学习架构

技术进步导致机器学习系统的扩散，以帮助人类完成各种任务。然而，对于许多这样的系统，我们仍然远远不能做到准确、可靠和资源高效的操作。尽管卷积神经网络(cnn)在物体识别方面具有优势，但这些判别技术有几个缺点，使它们容易被对手利用。此外，训练这些判别模型的计算成本可能相当可观。判别生成方法为稳健的感知和行动提供了一条有希望的途径。这些方法将深度学习推理与抽样和概率推理模型相结合，以实现鲁棒性和自适应理解。在这次演讲中，我将介绍我们在硬件上实现可扩展，计算效率高的生成推理算法的工作，该算法可以以节能的方式实现实时结果。我还将讨论在更广泛的硬件中设计可扩展和高效的ML算法的未来方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

自引率

0.00%

发文量