用于模拟内存中基于计算的加速器的深度学习软件堆栈

Nature Reviews Electrical Engineering Pub Date : 2025-07-02 DOI:10.1038/s44287-025-00187-1

Corey Lammie, Hadjer Benmeziane, William Simon, Elena Ferro, Athanasios Vasilopoulos, Julian Büchel, Manuel Le Gallo, Irem Boybat, Abu Sebastian

{"title":"用于模拟内存中基于计算的加速器的深度学习软件堆栈","authors":"Corey Lammie, Hadjer Benmeziane, William Simon, Elena Ferro, Athanasios Vasilopoulos, Julian Büchel, Manuel Le Gallo, Irem Boybat, Abu Sebastian","doi":"10.1038/s44287-025-00187-1","DOIUrl":null,"url":null,"abstract":"Analogue in-memory computing (AIMC) is an emerging computational paradigm that can efficiently accelerate the key operations in deep learning (DL) inference workloads. Heterogeneous architectures, which integrate both AIMC tiles and digital processing units, have been proposed to enable the end-to-end execution of various deep neural network models. However, developing a software stack for these architectures is challenging, owing to their distinct characteristics — such as the need for extensive or complete weight stationarity and pipelined execution across layers, if maximum performance is to be achieved. Moreover, AIMC tiles are inherently stochastic and hence introduce a combination of stochastic and deterministic noise, which adversely affects accuracy. As a result, existing tools for software stack development are not directly applicable. In this Perspective, we give an overview of the key attributes of DL software stacks and AIMC-based accelerators, outline the challenges associated with designing DL software stacks for AIMC-based accelerators and present opportunities for future research. Analogue in-memory computing (AIMC), with digital processing, forms a useful architecture for performant end-to-end execution of deep neural network models, but requires the development of sophisticated software stacks. This Perspective outlines the challenges in designing deep learning software stacks for AIMC-based accelerators, and suggests directions for future research.","PeriodicalId":501701,"journal":{"name":"Nature Reviews Electrical Engineering","volume":"2 9","pages":"621-633"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning software stacks for analogue in-memory computing-based accelerators\",\"authors\":\"Corey Lammie, Hadjer Benmeziane, William Simon, Elena Ferro, Athanasios Vasilopoulos, Julian Büchel, Manuel Le Gallo, Irem Boybat, Abu Sebastian\",\"doi\":\"10.1038/s44287-025-00187-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Analogue in-memory computing (AIMC) is an emerging computational paradigm that can efficiently accelerate the key operations in deep learning (DL) inference workloads. Heterogeneous architectures, which integrate both AIMC tiles and digital processing units, have been proposed to enable the end-to-end execution of various deep neural network models. However, developing a software stack for these architectures is challenging, owing to their distinct characteristics — such as the need for extensive or complete weight stationarity and pipelined execution across layers, if maximum performance is to be achieved. Moreover, AIMC tiles are inherently stochastic and hence introduce a combination of stochastic and deterministic noise, which adversely affects accuracy. As a result, existing tools for software stack development are not directly applicable. In this Perspective, we give an overview of the key attributes of DL software stacks and AIMC-based accelerators, outline the challenges associated with designing DL software stacks for AIMC-based accelerators and present opportunities for future research. Analogue in-memory computing (AIMC), with digital processing, forms a useful architecture for performant end-to-end execution of deep neural network models, but requires the development of sophisticated software stacks. This Perspective outlines the challenges in designing deep learning software stacks for AIMC-based accelerators, and suggests directions for future research.\",\"PeriodicalId\":501701,\"journal\":{\"name\":\"Nature Reviews Electrical Engineering\",\"volume\":\"2 9\",\"pages\":\"621-633\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature Reviews Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.nature.com/articles/s44287-025-00187-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Reviews Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.nature.com/articles/s44287-025-00187-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

模拟内存计算（AIMC）是一种新兴的计算范式，可以有效地加速深度学习（DL）推理工作负载中的关键操作。异构架构，集成了AIMC块和数字处理单元，已被提出，以实现各种深度神经网络模型的端到端执行。然而，为这些架构开发软件堆栈是具有挑战性的，因为它们具有不同的特征——例如，如果要实现最大性能，需要广泛或完全的权重平稳性和跨层的流水线执行。此外，AIMC瓷砖本身是随机的，因此引入了随机和确定性噪声的组合，这对准确性产生不利影响。因此，现有的软件堆栈开发工具不能直接应用。在本展望中，我们概述了深度学习软件栈和基于aimc的加速器的关键属性，概述了为基于aimc的加速器设计深度学习软件栈所面临的挑战，并提出了未来研究的机会。模拟内存计算（AIMC）与数字处理形成了一种有用的架构，用于深度神经网络模型的高性能端到端执行，但需要开发复杂的软件堆栈。本展望概述了为基于aimc的加速器设计深度学习软件栈所面临的挑战，并提出了未来的研究方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Deep learning software stacks for analogue in-memory computing-based accelerators

查看原文本刊更多论文

Deep learning software stacks for analogue in-memory computing-based accelerators

Analogue in-memory computing (AIMC) is an emerging computational paradigm that can efficiently accelerate the key operations in deep learning (DL) inference workloads. Heterogeneous architectures, which integrate both AIMC tiles and digital processing units, have been proposed to enable the end-to-end execution of various deep neural network models. However, developing a software stack for these architectures is challenging, owing to their distinct characteristics — such as the need for extensive or complete weight stationarity and pipelined execution across layers, if maximum performance is to be achieved. Moreover, AIMC tiles are inherently stochastic and hence introduce a combination of stochastic and deterministic noise, which adversely affects accuracy. As a result, existing tools for software stack development are not directly applicable. In this Perspective, we give an overview of the key attributes of DL software stacks and AIMC-based accelerators, outline the challenges associated with designing DL software stacks for AIMC-based accelerators and present opportunities for future research. Analogue in-memory computing (AIMC), with digital processing, forms a useful architecture for performant end-to-end execution of deep neural network models, but requires the development of sophisticated software stacks. This Perspective outlines the challenges in designing deep learning software stacks for AIMC-based accelerators, and suggests directions for future research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Nature Reviews Electrical Engineering

自引率

0.00%

发文量