Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe
{"title":"用户友好的GPGPU编程界面","authors":"Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe","doi":"10.1109/NCTM.2017.7872835","DOIUrl":null,"url":null,"abstract":"Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.","PeriodicalId":343372,"journal":{"name":"2017 6th National Conference on Technology and Management (NCTM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"User-friendly interface for GPGPU programming\",\"authors\":\"Hasindu Gamaarachchi, Mohamed Fawsan, F. Fasna, D. Elkaduwe\",\"doi\":\"10.1109/NCTM.2017.7872835\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.\",\"PeriodicalId\":343372,\"journal\":{\"name\":\"2017 6th National Conference on Technology and Management (NCTM)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 6th National Conference on Technology and Management (NCTM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCTM.2017.7872835\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th National Conference on Technology and Management (NCTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCTM.2017.7872835","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing are ill trained to develop efficient CUDA programs that would extract the necessary performance. In this paper we argue that this gap can be bridged by exposing the CUDA architecture as an API for manipulating matrices. We observe that many of the high demanding scientific computations can be expressed as matrix manipulations, where the need for high performance stems for the size of the matrix. We present a Software as a Service (SaaS) solution to bridge this gap where a domain specialist uploads the data as matrices and specify the operations as an equation involving the uploaded matrices via web GUI. Then the back end will process the request using CUDA and return the results via the GUI. The CUDA code for handling matrix operations are highly optimized and the domain specialist can simply use them without knowing the underlying intricate details.