An extension of a very fast direct finite element Poisson solver on lower precision accelerator hardware towards semi-structured grids

dc.contributor.authorRuda, Dustin
dc.contributor.authorTurek, Stefan
dc.contributor.authorRibbrock, Dirk
dc.contributor.authorZajac, Peter
dc.date.accessioned2022-08-12T13:22:30Z
dc.date.available2022-08-12T13:22:30Z
dc.date.issued2022-07
dc.description.abstractGraphics cards that are equipped with Tensor Core units designed for AI applica tions, for example the NVIDIA Ampere A100, promise very high peak rates concerning their computing power (156 TFLOP/s in single and 312 TFLOP/s in half precision in the case of the A100). This is only achieved when performing arithmetically intensive operations such as dense matrix multiplications in the aforementioned lower precision, which is an obstacle when trying to use this hardware for solving linear systems arising from PDEs discretized with the finite element method. In previous works, we delivered a proof of concept that the predecessor of the A100, the V100 and its Tensor Cores, can be exploited to a great extent when solving Poisson’s equation on the unit square if a hardware-oriented direct solver based on prehandling via hierarchical finite elements and a Schur complement approach is used. In this work, using numerical results on an A100 graphics card, we show that the method also achieves a very high performance if Poisson’s equation, which is discretized by linear finite elements, is solved on a more complex domain corresponding to a flow around a square configuration.en
dc.identifier.issn2190-1767
dc.identifier.urihttp://hdl.handle.net/2003/41031
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-22879
dc.language.isoen
dc.relation.ispartofseriesErgebnisberichte des Instituts fĂĽr Angewandte Mathematik;654
dc.subjectaccelerator hardwareen
dc.subjectlower precisionen
dc.subjecthierarchical finite elementsen
dc.subjectprehandlingen
dc.subjectNVIDIA A100en
dc.subjecttensor core GPUsen
dc.subject.ddc610
dc.titleAn extension of a very fast direct finite element Poisson solver on lower precision accelerator hardware towards semi-structured gridsen
dc.typeText
dc.type.publicationtypepreprint
dcterms.accessRightsopen access
eldorado.dnb.deposittruede
eldorado.secondarypublicationfalse

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ergebnisbericht Nr. 654.pdf
Size:
2.54 MB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.85 KB
Format:
Item-specific license agreed upon to submission
Description: