{"title":"SambaNova SN10 RDU:一种加速软件2.0的7nm数据流架构","authors":"R. Prabhakar, Sumti Jairath, Jinuk Luke Shin","doi":"10.1109/ISSCC42614.2022.9731612","DOIUrl":null,"url":null,"abstract":"The availability of large amounts of data and advances in modern machine-learning algorithms are pushing the limits of computing systems and redefining the way software is written. This new software paradigm, termed “Software 2.0”, is a departure from deterministic computing - centered around exact specifications and expert-created algorithms to accomplish a task - to probabilistic computing, centered around methods that learn to accomplish the same task using several examples. Software 2.0 contains a graph of operations that is rich in data locality and has abundant data, task, and hierarchical pipeline parallelism. Consequently, Software 2.0 can be accelerated by building custom dataflow pipelines. However, conventional GPU systems provide limited flexibility to build such dataflow pipelines. As a result, they suffer from poor device utilization and require a high-bandwidth off-chip memory system, which results in lower memory capacity. Memory capacity limitations impose serious challenges for increasingly larger models and data sets common in the fields of Natural Language Processing (NLP), high-resolution computer vision, and large recommender systems. SambaNova Systems Cardinal SN10 is a Reconfigurable Dataflow Unit (RDU) that enables accelerating Software 2.0 with the flexibility to build custom dataflow pipelines as well as large memory capacity to run big models efficiently [1].","PeriodicalId":6830,"journal":{"name":"2022 IEEE International Solid- State Circuits Conference (ISSCC)","volume":"40 1","pages":"350-352"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"SambaNova SN10 RDU: A 7nm Dataflow Architecture to Accelerate Software 2.0\",\"authors\":\"R. Prabhakar, Sumti Jairath, Jinuk Luke Shin\",\"doi\":\"10.1109/ISSCC42614.2022.9731612\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The availability of large amounts of data and advances in modern machine-learning algorithms are pushing the limits of computing systems and redefining the way software is written. This new software paradigm, termed “Software 2.0”, is a departure from deterministic computing - centered around exact specifications and expert-created algorithms to accomplish a task - to probabilistic computing, centered around methods that learn to accomplish the same task using several examples. Software 2.0 contains a graph of operations that is rich in data locality and has abundant data, task, and hierarchical pipeline parallelism. Consequently, Software 2.0 can be accelerated by building custom dataflow pipelines. However, conventional GPU systems provide limited flexibility to build such dataflow pipelines. As a result, they suffer from poor device utilization and require a high-bandwidth off-chip memory system, which results in lower memory capacity. Memory capacity limitations impose serious challenges for increasingly larger models and data sets common in the fields of Natural Language Processing (NLP), high-resolution computer vision, and large recommender systems. SambaNova Systems Cardinal SN10 is a Reconfigurable Dataflow Unit (RDU) that enables accelerating Software 2.0 with the flexibility to build custom dataflow pipelines as well as large memory capacity to run big models efficiently [1].\",\"PeriodicalId\":6830,\"journal\":{\"name\":\"2022 IEEE International Solid- State Circuits Conference (ISSCC)\",\"volume\":\"40 1\",\"pages\":\"350-352\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Solid- State Circuits Conference (ISSCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSCC42614.2022.9731612\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Solid- State Circuits Conference (ISSCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSCC42614.2022.9731612","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
摘要
大量数据的可用性和现代机器学习算法的进步正在推动计算系统的极限,并重新定义软件的编写方式。这种新的软件范式被称为“软件2.0”,它从确定性计算(以精确的规范和专家创建的算法为中心来完成任务)转向概率计算(以学习使用几个示例来完成相同任务的方法为中心)。软件2.0包含一个操作图,它具有丰富的数据局部性,并具有丰富的数据、任务和分层管道并行性。因此,可以通过构建定制的数据流管道来加速软件2.0。然而,传统的GPU系统提供有限的灵活性来构建这样的数据流管道。因此,它们的设备利用率很低,并且需要高带宽的片外存储系统,这导致内存容量较低。在自然语言处理(NLP)、高分辨率计算机视觉和大型推荐系统等领域中,内存容量的限制给越来越大的模型和数据集带来了严峻的挑战。SambaNova Systems Cardinal SN10是一款可重构数据流单元(RDU),它可以加速Software 2.0,灵活地构建自定义数据流管道,以及大容量内存,从而有效地运行大型模型。
SambaNova SN10 RDU: A 7nm Dataflow Architecture to Accelerate Software 2.0
The availability of large amounts of data and advances in modern machine-learning algorithms are pushing the limits of computing systems and redefining the way software is written. This new software paradigm, termed “Software 2.0”, is a departure from deterministic computing - centered around exact specifications and expert-created algorithms to accomplish a task - to probabilistic computing, centered around methods that learn to accomplish the same task using several examples. Software 2.0 contains a graph of operations that is rich in data locality and has abundant data, task, and hierarchical pipeline parallelism. Consequently, Software 2.0 can be accelerated by building custom dataflow pipelines. However, conventional GPU systems provide limited flexibility to build such dataflow pipelines. As a result, they suffer from poor device utilization and require a high-bandwidth off-chip memory system, which results in lower memory capacity. Memory capacity limitations impose serious challenges for increasingly larger models and data sets common in the fields of Natural Language Processing (NLP), high-resolution computer vision, and large recommender systems. SambaNova Systems Cardinal SN10 is a Reconfigurable Dataflow Unit (RDU) that enables accelerating Software 2.0 with the flexibility to build custom dataflow pipelines as well as large memory capacity to run big models efficiently [1].