{"title":"在详细的GPU模拟器上表征卷积神经网络工作负载","authors":"Kwanghee Chang, Minsik Kim, Kyungah Kim, W. Ro","doi":"10.1109/ISOCC.2017.8368781","DOIUrl":null,"url":null,"abstract":"Recent frameworks on convolutional neural networks (CNNs) such as Caffe and MXNet have focused primarily on being compatible with CUDA software and hardware application. However, it was designed for GPU architecture of compute capability 3.0 and above. Therefore, it needs verification of function to perform GPGPU-Sim which is implemented as NVIDIA compute capability devices 2.x. We developed a framework which can make inferencing AlexNet on GPGPU-Sim. We also analyze the execution results of the GPGPU-Sim. The number of lines in one set of the L1 data cache is sensitive to influence performance of AlexNet inference.","PeriodicalId":248826,"journal":{"name":"2017 International SoC Design Conference (ISOCC)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Characterizing convolutional neural network workloads on a detailed GPU simulator\",\"authors\":\"Kwanghee Chang, Minsik Kim, Kyungah Kim, W. Ro\",\"doi\":\"10.1109/ISOCC.2017.8368781\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent frameworks on convolutional neural networks (CNNs) such as Caffe and MXNet have focused primarily on being compatible with CUDA software and hardware application. However, it was designed for GPU architecture of compute capability 3.0 and above. Therefore, it needs verification of function to perform GPGPU-Sim which is implemented as NVIDIA compute capability devices 2.x. We developed a framework which can make inferencing AlexNet on GPGPU-Sim. We also analyze the execution results of the GPGPU-Sim. The number of lines in one set of the L1 data cache is sensitive to influence performance of AlexNet inference.\",\"PeriodicalId\":248826,\"journal\":{\"name\":\"2017 International SoC Design Conference (ISOCC)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International SoC Design Conference (ISOCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISOCC.2017.8368781\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International SoC Design Conference (ISOCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISOCC.2017.8368781","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Characterizing convolutional neural network workloads on a detailed GPU simulator
Recent frameworks on convolutional neural networks (CNNs) such as Caffe and MXNet have focused primarily on being compatible with CUDA software and hardware application. However, it was designed for GPU architecture of compute capability 3.0 and above. Therefore, it needs verification of function to perform GPGPU-Sim which is implemented as NVIDIA compute capability devices 2.x. We developed a framework which can make inferencing AlexNet on GPGPU-Sim. We also analyze the execution results of the GPGPU-Sim. The number of lines in one set of the L1 data cache is sensitive to influence performance of AlexNet inference.