Reinforcement Learning for Placement Optimization

Proceedings of the 2021 International Symposium on Physical Design Pub Date : 2021-03-22 DOI:10.1145/3439706.3446883

Anna Goldie, Azalia Mirhoseini

{"title":"Reinforcement Learning for Placement Optimization","authors":"Anna Goldie, Azalia Mirhoseini","doi":"10.1145/3439706.3446883","DOIUrl":null,"url":null,"abstract":"In the past decade, computer systems and chips have played a key role in the success of artificial intelligence (AI). Our vision in Google Brain's Machine Learning for Systems team is to use AI to transform the way in which computer systems and chips are designed. Many core problems in systems and hardware design are combinatorial optimization or decision making tasks with state and action spaces that are orders of magnitude larger than that of standard AI benchmarks in robotics and games. In this talk, we will describe some of our latest learning based approaches to tackling such large-scale optimization problems. We will discuss our work on a new domain-transferable reinforcement learning (RL) method for optimizing chip placement [1], a long pole in hardware design. Our approach is capable of learning from past experience and improving over time, resulting in more optimized placements on unseen chip blocks as the RL agent is exposed to a larger volume of data. Our objective is to minimize power, performance, and area. We show that, in under six hours, our method can generate placements that are superhuman or comparable on modern accelerator chips, whereas existing baselines require human experts in the loop and can take several weeks.","PeriodicalId":184050,"journal":{"name":"Proceedings of the 2021 International Symposium on Physical Design","volume":"184 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 International Symposium on Physical Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3439706.3446883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In the past decade, computer systems and chips have played a key role in the success of artificial intelligence (AI). Our vision in Google Brain's Machine Learning for Systems team is to use AI to transform the way in which computer systems and chips are designed. Many core problems in systems and hardware design are combinatorial optimization or decision making tasks with state and action spaces that are orders of magnitude larger than that of standard AI benchmarks in robotics and games. In this talk, we will describe some of our latest learning based approaches to tackling such large-scale optimization problems. We will discuss our work on a new domain-transferable reinforcement learning (RL) method for optimizing chip placement [1], a long pole in hardware design. Our approach is capable of learning from past experience and improving over time, resulting in more optimized placements on unseen chip blocks as the RL agent is exposed to a larger volume of data. Our objective is to minimize power, performance, and area. We show that, in under six hours, our method can generate placements that are superhuman or comparable on modern accelerator chips, whereas existing baselines require human experts in the loop and can take several weeks.

查看原文本刊更多论文

用于布局优化的强化学习

在过去的十年中，计算机系统和芯片在人工智能(AI)的成功中发挥了关键作用。在Google Brain的机器学习系统团队中，我们的愿景是使用人工智能来改变计算机系统和芯片的设计方式。系统和硬件设计中的许多核心问题是组合优化或具有状态和行动空间的决策任务，这些任务比机器人和游戏中的标准AI基准要大几个数量级。在这次演讲中，我们将介绍一些最新的基于学习的方法来解决这种大规模的优化问题。我们将讨论我们在优化芯片放置的新领域可转移强化学习(RL)方法上的工作[1]，这是硬件设计中的一个重要方面。我们的方法能够从过去的经验中学习并随着时间的推移而改进，当RL代理暴露于更大的数据量时，可以在未见过的芯片块上进行更优化的放置。我们的目标是最小化功率、性能和面积。我们证明，在不到6小时的时间里，我们的方法可以在现代加速器芯片上生成超人或可比的位置，而现有的基线需要人类专家参与，可能需要几周的时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2021 International Symposium on Physical Design

自引率

0.00%

发文量