{"title":"现代加速器和协处理器的运行时行为比较","authors":"Ayman Tarakji, Niels Ole Salscheider","doi":"10.1109/IPDPSW.2014.16","DOIUrl":null,"url":null,"abstract":"Recently, a variety of accelerator architectures became available in the field of high performance computing. Intel's MIC (Many Integrated Core) and both GPU architectures, NVIDIA's Kepler and AMD's Graphics Core Next, all represent the latest innovation in the field of general purpose computing accelerators. This paper explores several important characteristics of these architectures and investigates the impact of certain design factors on the achieved performance using the uCLbench micro-benchmarks, the NPB (NAS Parallel Benchmark) suite and diverse real-world applications from the field of physics. Based on the single unified programming interface OpenCL, we observe the run-time behavior of each test program on several test platforms. Major architectural discrepancies are studied and a higher level examination is discussed in details.","PeriodicalId":153864,"journal":{"name":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","volume":"183 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Runtime Behavior Comparison of Modern Accelerators and Coprocessors\",\"authors\":\"Ayman Tarakji, Niels Ole Salscheider\",\"doi\":\"10.1109/IPDPSW.2014.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, a variety of accelerator architectures became available in the field of high performance computing. Intel's MIC (Many Integrated Core) and both GPU architectures, NVIDIA's Kepler and AMD's Graphics Core Next, all represent the latest innovation in the field of general purpose computing accelerators. This paper explores several important characteristics of these architectures and investigates the impact of certain design factors on the achieved performance using the uCLbench micro-benchmarks, the NPB (NAS Parallel Benchmark) suite and diverse real-world applications from the field of physics. Based on the single unified programming interface OpenCL, we observe the run-time behavior of each test program on several test platforms. Major architectural discrepancies are studied and a higher level examination is discussed in details.\",\"PeriodicalId\":153864,\"journal\":{\"name\":\"2014 IEEE International Parallel & Distributed Processing Symposium Workshops\",\"volume\":\"183 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE International Parallel & Distributed Processing Symposium Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IPDPSW.2014.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Parallel & Distributed Processing Symposium Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPDPSW.2014.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Runtime Behavior Comparison of Modern Accelerators and Coprocessors
Recently, a variety of accelerator architectures became available in the field of high performance computing. Intel's MIC (Many Integrated Core) and both GPU architectures, NVIDIA's Kepler and AMD's Graphics Core Next, all represent the latest innovation in the field of general purpose computing accelerators. This paper explores several important characteristics of these architectures and investigates the impact of certain design factors on the achieved performance using the uCLbench micro-benchmarks, the NPB (NAS Parallel Benchmark) suite and diverse real-world applications from the field of physics. Based on the single unified programming interface OpenCL, we observe the run-time behavior of each test program on several test platforms. Major architectural discrepancies are studied and a higher level examination is discussed in details.