S. Turek, Dominik Göddeke, S. Buijssen, Hilmar Wobker
{"title":"Hardware-Oriented Multigrid Finite Element Solvers on GPU-Accelerated Clusters","authors":"S. Turek, Dominik Göddeke, S. Buijssen, Hilmar Wobker","doi":"10.1201/B10376-17","DOIUrl":"https://doi.org/10.1201/B10376-17","url":null,"abstract":"The accurate simulation of real-world phenomena in computational science is often based on an underlying mathematical model comprising a system of partial differential equations (PDEs). Important research fields that we pursue in this setting are computational solid mechanics and computational fluid dynamics (CSM and CFD, see Section 3). Practical applications range from material failure tests, as for instance crash tests in the automotive industry, to fluid and gas flow of any kind, for instance in chemical or medical engineering (e. g., simulation of blood flow in the human body to predict aneurysms) or flow around cars and aircrafts to minimize drag and lift forces. Moreover, the coupling of both models is essential for fluid structure interaction settings (FSI) which represent problem fields of very high technological importance. Such configurations include polymer processing or microfluidic problems exhibiting very complex multiscale behavior due to nonlinear rheological or non-isothermal constitutive laws, and also due to self-induced oscillations of the structural parts in the flow field. In all these cases, the fluid part is mostly laminar, but highly viscous.","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"13 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125637218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Sengupta, Mark J. Harris, M. Garland, John Douglas Owens
{"title":"Efficient Parallel Scan Algorithms for Manycore GPUs","authors":"S. Sengupta, Mark J. Harris, M. Garland, John Douglas Owens","doi":"10.1201/B10376-29","DOIUrl":"https://doi.org/10.1201/B10376-29","url":null,"abstract":"Author(s): Sengupta, Shubhabrata; Harris, Mark; Garland, Michael; Owens, John D. | Editor(s): Kurzak, Jakub; Bader, David A.; Dongarra, Jack","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133954773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mixed-Precision GPU-Multigrid Solvers with Strong Smoothers","authors":"Dominik Göddeke, R. Strzodka","doi":"10.1201/B10376-11","DOIUrl":"https://doi.org/10.1201/B10376-11","url":null,"abstract":"• Sparse iterative linear solvers are the most important building block in (implicit) schemes for PDE problems • In FD, FV and FE discretisations • Lots of research on GPUs so far for Krylov subspace methods, ADI approaches and multigrid • But: Limited to simple preconditioners and smoothing operators •Numerically strong smoothers exhibit inherently sequential data dependencies (impossible to parallelise?) • Strong smoothers required in practice: Anisotropies (mesh, operator), localised nonlinearities from the PDEs etc. increase ill-conditioning of the systems drastically •Multigrid is asymptotically optimal, all other iterative schemes suffer from h-dependencies • In our context: Multigrid = geometric multigrid","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129645152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Stone, David J. Hardy, B. Isralewitz, K. Schulten
{"title":"GPU Algorithms for Molecular Modeling","authors":"J. Stone, David J. Hardy, B. Isralewitz, K. Schulten","doi":"10.1201/B10376-32","DOIUrl":"https://doi.org/10.1201/B10376-32","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124606513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cecilia González-Alvarez, Harald Servat, Daniel Cabrera-Benitez, Xavier Aguilar, Carles Pons, J. Fernández-Recio, Daniel Jiménez-González
{"title":"Drug Design on the Cell BE","authors":"Cecilia González-Alvarez, Harald Servat, Daniel Cabrera-Benitez, Xavier Aguilar, Carles Pons, J. Fernández-Recio, Daniel Jiménez-González","doi":"10.1201/B10376-24","DOIUrl":"https://doi.org/10.1201/B10376-24","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127264631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ümit V. Çatalyürek, Renato Ferreira, Timothy D. R. Hartley, George Teodoro, R. S. Oliveira
{"title":"Data Flow Frameworks for Emerging Heterogeneous Architectures and Their Application to Biomedicine","authors":"Ümit V. Çatalyürek, Renato Ferreira, Timothy D. R. Hartley, George Teodoro, R. S. Oliveira","doi":"10.1201/b10376-27","DOIUrl":"https://doi.org/10.1201/b10376-27","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128932051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
David A. Bader, Virat Agarwal, Kamesh Madduri, F. Petrini
{"title":"Combinatorial Algorithm Design on the Cell/B.E. Processor","authors":"David A. Bader, Virat Agarwal, Kamesh Madduri, F. Petrini","doi":"10.1201/b10376-16","DOIUrl":"https://doi.org/10.1201/b10376-16","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124231573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Pairwise Computations on the Cell Processor","authors":"Abhinav Sarje, J. Zola, S. Aluru","doi":"10.1201/b10376-22","DOIUrl":"https://doi.org/10.1201/b10376-22","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124151835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Implementing FFTs on Multicore Architectures","authors":"A. Chow, G. Fossum, Daniel A. Brokenshire","doi":"10.1201/b10376-14","DOIUrl":"https://doi.org/10.1201/b10376-14","url":null,"abstract":"","PeriodicalId":411793,"journal":{"name":"Scientific Computing with Multicore and Accelerators","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123825640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}