Fast Semi-Iterative Finite Element Poisson Solvers for Tensor Core GPUs Based on Prehandling

dc.contributor.authorRuda, Dustin
dc.contributor.authorTurek, Stefan
dc.contributor.authorRibbrock, Dirk
dc.date.accessioned2024-02-11T19:58:31Z
dc.date.available2024-02-11T19:58:31Z
dc.date.issued2024-01
dc.description.abstractThe impetus for the research presented in this work is provided by recent developments in the field of GPU computing. Nvidia GPUs that are equipped with Tensor Cores, such as the A100 or the latest H100, promise an immense computing power of 156 and 495 TFLOPS, respectively, but only for dense matrix operations carried out in single precision (with even higher rates in half precision), since this serves their actual purpose of accelerating AI training. It is shown that this performance can also be exploited to a large extent in the domain of matrix-based finite element methods for solving PDEs, if specially tailored, hardware-oriented methods are used. Such methods need to preserve sufficient accuracy, even if single precision is used, and mostly consist of dense matrix operations. A semi-iterative method for solving Poisson’s equation in 2D and 3D based on prehandling, i.e., explicit preconditioning, by means of hierarchical finite elements or generating systems, that satisfies these requirements, is derived and analyzed.Actual benchmark results on an H100 allow the determination of optimal solver configurations in terms of performance, which ultimately exceeds that of a standard geometric multigrid solver on CPU.en
dc.identifier.issn2190-1767
dc.identifier.urihttp://hdl.handle.net/2003/42317
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-24154
dc.language.isoen
dc.relation.ispartofseriesErgebnisberichte des Instituts für Angewandte Mathematik;671
dc.subject.ddc610
dc.titleFast Semi-Iterative Finite Element Poisson Solvers for Tensor Core GPUs Based on Prehandlingen
dc.typeText
dc.type.publicationtypePreprint
dcterms.accessRightsopen access
eldorado.dnb.deposittruede
eldorado.secondarypublicationfalse

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
Ergebnisbericht Nr. 671.pdf
Größe:
306.4 KB
Format:
Adobe Portable Document Format
Beschreibung:
DNB

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
4.85 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: