{"title":"CGRA4HPC 2022 Invited Speaker: Mapping ML to the AMD/Xilinx AIE-ML architecture","authors":"Elliott Delaye","doi":"10.1109/IPDPSW55747.2022.00109","DOIUrl":null,"url":null,"abstract":"In the field of compute acceleration, machine learning model acceleration is one of the fastest growing areas of focus. ML model complexity in both compute and memory have driven the latest accelerator architectures and with that, developing ways to efficiently use these new architectures is the key to unlocking their potential. At AMD the AIE-ML architecture is our second generation AI-Engine architecture and we will dive into some of the ways we map the most important ML compute/bandwidth requirements to this architecture.","PeriodicalId":286968,"journal":{"name":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW55747.2022.00109","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In the field of compute acceleration, machine learning model acceleration is one of the fastest growing areas of focus. ML model complexity in both compute and memory have driven the latest accelerator architectures and with that, developing ways to efficiently use these new architectures is the key to unlocking their potential. At AMD the AIE-ML architecture is our second generation AI-Engine architecture and we will dive into some of the ways we map the most important ML compute/bandwidth requirements to this architecture.