Nihar Athreyas, Wenhao Song, B. Perot, Q. Xia, Abbie Mathew, J. Gupta, D. Gupta, J. Yang
{"title":"Memristor-CMOS Analog Co-Processor for Acceleration of High Performance Computing Applications","authors":"Nihar Athreyas, Wenhao Song, B. Perot, Q. Xia, Abbie Mathew, J. Gupta, D. Gupta, J. Yang","doi":"10.1145/3269985","DOIUrl":null,"url":null,"abstract":"Vector matrix multiplication computation underlies major applications in machine vision, deep learning and scientific simulation. These applications require high computational speed and are run on platforms that are size, weight and power constrained. With the transistor scaling coming to an end, existing digital hardware architectures will not be able to meet this increasing demand. Analog computation with its rich set of primitives and inherent parallel architecture can be faster, more efficient and compact for some of these applications. One such primitive is a memristor-CMOS crossbar array based vector matrix multiplication. In this paper, we develop a memristor-CMOS analog co-processor architecture that can handle floating point computation. The crossbar array is based on a $\\mathbf{TaO}_{\\mathbf{x}}$ memristor device. To demonstrate the working of the analog co-processor at a system level, we use a new tool developed by Cadence and Mathworks called PSpice Systems Option which performs integrated co-simulation of MATLAB/Simulink and PSpice. It is shown that the analog co-processor has a superior performance when compared to other processors. Using the new PSpice Systems Option tool, various application simulations for image processing and solution to partial differential equations are performed on the analog coprocessor model.","PeriodicalId":344188,"journal":{"name":"2018 International Conference on Current Trends towards Converging Technologies (ICCTCT)","volume":"212 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Current Trends towards Converging Technologies (ICCTCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3269985","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Vector matrix multiplication computation underlies major applications in machine vision, deep learning and scientific simulation. These applications require high computational speed and are run on platforms that are size, weight and power constrained. With the transistor scaling coming to an end, existing digital hardware architectures will not be able to meet this increasing demand. Analog computation with its rich set of primitives and inherent parallel architecture can be faster, more efficient and compact for some of these applications. One such primitive is a memristor-CMOS crossbar array based vector matrix multiplication. In this paper, we develop a memristor-CMOS analog co-processor architecture that can handle floating point computation. The crossbar array is based on a $\mathbf{TaO}_{\mathbf{x}}$ memristor device. To demonstrate the working of the analog co-processor at a system level, we use a new tool developed by Cadence and Mathworks called PSpice Systems Option which performs integrated co-simulation of MATLAB/Simulink and PSpice. It is shown that the analog co-processor has a superior performance when compared to other processors. Using the new PSpice Systems Option tool, various application simulations for image processing and solution to partial differential equations are performed on the analog coprocessor model.
向量矩阵乘法计算是机器视觉、深度学习和科学模拟等主要应用的基础。这些应用程序需要高计算速度,并且运行在尺寸、重量和功率受限的平台上。随着晶体管规模的终结,现有的数字硬件架构将无法满足这一日益增长的需求。模拟计算具有丰富的原语集和固有的并行架构,可以更快,更高效和紧凑的一些应用程序。其中一种基元是基于矢量矩阵乘法的忆阻器- cmos交叉棒阵列。在本文中,我们开发了一种可以处理浮点运算的忆阻器- cmos模拟协处理器架构。交叉条数组基于$\mathbf{TaO}_{\mathbf{x}}$忆阻器器件。为了演示模拟协处理器在系统级的工作,我们使用了由Cadence和Mathworks开发的名为PSpice Systems Option的新工具,该工具执行MATLAB/Simulink和PSpice的集成联合仿真。实验结果表明,与其他处理器相比,该模拟协处理器具有优越的性能。使用新的PSpice系统选项工具,在模拟协处理器模型上进行了图像处理和偏微分方程求解的各种应用模拟。