高性能计算技术最新文献

筛选
英文 中文
Session details: 1 会话详情:1
高性能计算技术 Pub Date : 2011-11-13 DOI: 10.1145/3256283
A. Herdman
{"title":"Session details: 1","authors":"A. Herdman","doi":"10.1145/3256283","DOIUrl":"https://doi.org/10.1145/3256283","url":null,"abstract":"","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2011-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75525424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session details: 4 会话详情:4
高性能计算技术 Pub Date : 2011-11-13 DOI: 10.1145/3256286
Todd Gamblin
{"title":"Session details: 4","authors":"Todd Gamblin","doi":"10.1145/3256286","DOIUrl":"https://doi.org/10.1145/3256286","url":null,"abstract":"","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2011-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76904389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Session details: 3 会话详情:3
高性能计算技术 Pub Date : 2011-11-13 DOI: 10.1145/3256285
I. Miller
{"title":"Session details: 3","authors":"I. Miller","doi":"10.1145/3256285","DOIUrl":"https://doi.org/10.1145/3256285","url":null,"abstract":"","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2011-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74442417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating resilient high performance reconfigurable computing with minimally-invasive system monitoring 研究具有最小侵入性系统监控的弹性高性能可重构计算
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670795
Bin Huang, A. Schmidt, Ashwin A. Mendon, R. Sass
{"title":"Investigating resilient high performance reconfigurable computing with minimally-invasive system monitoring","authors":"Bin Huang, A. Schmidt, Ashwin A. Mendon, R. Sass","doi":"10.1109/HPRCTA.2010.5670795","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670795","url":null,"abstract":"As researchers push for Exascale computing, one of the emerging challenges is system resilience. Unlike fault-tolerance which corrects errors, recent reports suggest that resilient systems will need to continue to make progress on an application despite faults. A first step in developing a resilient system is to have robust, scalable system monitoring. The work described here presents a novel, minimally-invasive system monitor that operates over a separate network. We analytically characterize the performance for an arbitrary set of nodes and demonstrate a working implementation of the design. We argue that the hardware approach is inherently superior to the ad hoc, software techniques currently employed in practice.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"102 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79431484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A parallel hardware architecture for information-theoretic adaptive filtering 一种用于信息理论自适应滤波的并行硬件结构
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670798
S. Craciun, A. George, H. Lam, J. Príncipe
{"title":"A parallel hardware architecture for information-theoretic adaptive filtering","authors":"S. Craciun, A. George, H. Lam, J. Príncipe","doi":"10.1109/HPRCTA.2010.5670798","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670798","url":null,"abstract":"Information-theoretic cost functions such as minimization of the error entropy (MEE) can extract more structure from the error signal, yielding better results in many realistic problems. However, adaptive filters (AFs) using MEE methods are more computationally intensive when compared to conventional, mean-squared error (MSE) methods employed in the well-known, least mean squares (LMS) algorithm. This paper presents a novel, parallel hardware architecture for MEE adaptive filtering. The design has been implemented and evaluated in realtime on one of the servers of the Novo-G machine in the NSF CHREC Center at the University of Florida, believed to be the most powerful reconfigurable supercomputer in academia. By pipelining the design and parallelizing independent computations within the algorithm, our proposed hardware architecture successfully achieves a speedup of 5800 on one FPGA, 23200 on one quad-FPGA board, and 46400 on two quad-FPGA boards, as compared to the same algorithm running in software (optimized C program) on a single CPU core. Just as important, our results show that this reconfigurable design does not lose precision while converging to the optimum solution in the same number of steps as the software version. As a result, our approach makes it possible for AFs using the MEE cost function to adapt in real-time for signals that require a sampling rate in excess of 400 kHz and thus can target a much wider range of applications.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"15 1","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87510297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Towards production FPGA-accelerated molecular dynamics: Progress and challenges 迈向生产fpga加速分子动力学:进展与挑战
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670800
Matt Chiu, M. Herbordt
{"title":"Towards production FPGA-accelerated molecular dynamics: Progress and challenges","authors":"Matt Chiu, M. Herbordt","doi":"10.1109/HPRCTA.2010.5670800","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670800","url":null,"abstract":"Recent work in the FPGA acceleration of molecular dynamics simulation has shown that including on-the-fly neighbor list calculation (particle filtering) in the device has the potential for an 80× per core speed-up over the CPU-based reference code and so to make the approach competitive with other computing technologies. In this paper we report on progress and challenges in advancing this work towards the creation of a production system, especially one capable of running on a large-scale system such as the Novo-G. The current version consists of an FPGA-accelerated NAMD-lite running on a PC with a Gidel PROCStar III. The most important implementation issues include software integration, handling exclusion, and modifying the force pipeline. In the last of these we have added support for Particle-Mesh-Ewald and augmented the Lennard-Jones calculation with a switching function. In experiments, we find that energy stability so far appears to be acceptable, but that longer simulations are needed. Due primarily to the added complexity of the force pipelines, performance is somewhat diminished from the previous study; we find, however, that porting to a newer (existing) device will more than compensate for this loss.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"13 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84957481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Optimization and performance study of large-scale biological networks for reconfigurable computing 面向可重构计算的大规模生物网络优化与性能研究
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670796
M. Bhuiyan, Ananth Nallamuthu, M. C. Smith, V. Pallipuram
{"title":"Optimization and performance study of large-scale biological networks for reconfigurable computing","authors":"M. Bhuiyan, Ananth Nallamuthu, M. C. Smith, V. Pallipuram","doi":"10.1109/HPRCTA.2010.5670796","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670796","url":null,"abstract":"Field-programmable gate arrays (FPGAs) can provide an efficient programmable resource for implementing hardware-based spiking neural networks (SNN). In this paper we present a hardware-software design that makes it possible to simulate large-scale (2 million neurons) biologically plausible SNNs on an FPGA-based system. We have chosen three SNN models from the various models available in the literature, the Hodgkin-Huxley (HH), Wilson and Izhikevich models, for implementation on the SRC 7 H MAP FPGA-based system. The models have various computation and communication requirements making them good candidates for a performance and optimization study of SNNs on an FPGA-based system. Significant acceleration of the SNN models using the FPGA is achieved: 38x for the HH model. This paper also provides insights into the factors affecting the speedup achieved such as FLOP:Byte ratio of the application, the problem size, and the optimization techniques available.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"77 1","pages":"1-9"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83968160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Comparative analysis of HPC and accelerator devices: Computation, memory, I/O, and power HPC和加速器设备的比较分析:计算、内存、I/O和功率
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670797
J. Richardson, Steven Fingulin, Diwakar Raghunathan, Chris Massie, A. George, H. Lam
{"title":"Comparative analysis of HPC and accelerator devices: Computation, memory, I/O, and power","authors":"J. Richardson, Steven Fingulin, Diwakar Raghunathan, Chris Massie, A. George, H. Lam","doi":"10.1109/HPRCTA.2010.5670797","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670797","url":null,"abstract":"The computing market constantly experiences the introduction of new devices, architectures, and enhancements to existing ones. Due to the number and diversity of processor and accelerator devices available, it is important to be able to objectively compare them based upon their capabilities regarding computation, I/O, power, and memory interfacing. This paper presents an extension to our existing suite of metrics to quantify additional characteristics of devices and highlight tradeoffs that exist between architectures and specific products. These metrics are applied to a large group of modern devices to evaluate their computational density, power consumption, I/O bandwidth, internal memory bandwidth, and external memory bandwidth.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"69 1","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87117576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
An application of high-performance reconfigurable computing in radio astronomy signal processing 高性能可重构计算在射电天文信号处理中的应用
高性能计算技术 Pub Date : 2010-12-17 DOI: 10.1109/HPRCTA.2010.5670794
J. Ford, J. Ray
{"title":"An application of high-performance reconfigurable computing in radio astronomy signal processing","authors":"J. Ford, J. Ray","doi":"10.1109/HPRCTA.2010.5670794","DOIUrl":"https://doi.org/10.1109/HPRCTA.2010.5670794","url":null,"abstract":"Reconfigurable Computing has been making inroads in the front-end digital signal processing systems deployed at radio telescopes around the world. The National Radio Astronomy Observatory (NRAO) at Green Bank has developed a signal processing system expressly for pulsar search and timing observations. These observations are among the most demanding experiments in terms of real-time computational and data rate requirements. In this paper, we describe the application domain, the challenges in designing a system to meet these demands, and the resulting heterogeneous computing system.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"41 1","pages":"1-7"},"PeriodicalIF":0.0,"publicationDate":"2010-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89391103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
FPGA-based acceleration of CHARMM-potential minimization 基于fpga的charmm -势能最小化加速
高性能计算技术 Pub Date : 2009-11-15 DOI: 10.1145/1646461.1646462
Bharat Sukhwani, M. Herbordt
{"title":"FPGA-based acceleration of CHARMM-potential minimization","authors":"Bharat Sukhwani, M. Herbordt","doi":"10.1145/1646461.1646462","DOIUrl":"https://doi.org/10.1145/1646461.1646462","url":null,"abstract":"Energy minimization is an important step in molecular modeling, with applications in molecular docking and in mapping binding sites. Minimization involves repeated evaluation of various bonded and non-bonded energies of a protein complex. It is a computationally expensive process, with runtimes typically being many hours on a desktop system. In the current article, we present acceleration of the energy evaluation phase of minimization using Field Programmable Gate Arrays. We project a multiple orders-of-magnitude speed-up over a single CPU core and a factor of 8 speed-up over our previous acceleration using an NVIDIA Tesla 1060 GPU.","PeriodicalId":59014,"journal":{"name":"高性能计算技术","volume":"99 1","pages":"1-10"},"PeriodicalIF":0.0,"publicationDate":"2009-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83125086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信