Very Fast Finite Element Poisson Solvers on Lower Precision Accelerator Hardware - A “Proof-of-Concept” Study for NVIDIA Tesla V100

Ruda, Dustin; Turek, Stefan; Ribbrock, Dirk; Zajac, Peter

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ruda, Dustin	-
dc.contributor.author	Turek, Stefan	-
dc.contributor.author	Ribbrock, Dirk	-
dc.contributor.author	Zajac, Peter	-
dc.date.accessioned	2021-07-30T12:48:16Z	-
dc.date.available	2021-07-30T12:48:16Z	-
dc.date.issued	2021-07	-
dc.identifier.issn	2190-1767	-
dc.identifier.uri	http://hdl.handle.net/2003/40355	-
dc.identifier.uri	http://dx.doi.org/10.17877/DE290R-22230	-
dc.description.abstract	Recently, accelerator hardware in the form of graphics cards including Tensor Cores, specialized for AI, has significantly gained in importance in the domain of high performance computing. For example, NVIDIA’s Tesla V100 promises a com-puting power of up to 125 TFLOP/s achieved by Tensor Cores, but only if half precision floating point format is used. We describe the diÿculties and discrepancy between theoretical and actual computing power if one seeks to use such hardware for numerical simulations, i.e., solving partial di˙erential equations with a matrix-based finite element method, with numerical examples. If certain requirements, namely low condition numbers and many dense matrix operations, are met, the indicated high performance can be reached without an excessive loss of accuracy. A new method to solve linear systems arising from Poisson’s equation in 2D that meets these re-quirements, based on “prehandling” by means of hierarchical finite elements and an additional Schur complement approach, is presented and analyzed. We provide numerical results illustrating the computational performance of this method and compare it to a commonly used (geometric) multigrid solver on standard hardware. It turns out that we can exploit nearly the full computational power of Tensor Cores and achieve a significant speed-up compared to the standard methodology without losing accuracy.	en
dc.language.iso	en	-
dc.relation.ispartofseries	Ergebnisberichte des Instituts für Angewandte Mathematik;647	-
dc.subject.ddc	610	-
dc.title	Very Fast Finite Element Poisson Solvers on Lower Precision Accelerator Hardware - A “Proof-of-Concept” Study for NVIDIA Tesla V100	en
dc.type	Text	-
dc.type.publicationtype	preprint	-
dc.subject.rswk	Finite Elemente	de
dcterms.accessRights	open access	-
eldorado.secondarypublication	false	-
Appears in Collections:	Ergebnisberichte des Instituts für Angewandte Mathematik

Files in This Item:

File	Description	Size	Format
Ergebnisbericht Nr. 647.pdf	DNB	921.41 kB	Adobe PDF	View/Open

This item is protected by original copyright

View License

Show simple item record

This item is protected by original copyright rightsstatements.org