Publications by the same author
plus in the repository
plus in Google Scholar

Bibliografische Daten exportieren
 

Analyzing performance portability for a SYCL implementation of the 2D shallow water equations

DOI zum Zitieren der Version auf EPub Bayreuth: https://doi.org/10.15495/EPub_UBT_00008913
URN to cite this document: urn:nbn:de:bvb:703-epub-8913-9

Title data

Büttner, Markus ; Alt, Christoph ; Kenter, Tobias ; Köstler, Harald ; Plessl, Christian ; Aizinger, Vadym:
Analyzing performance portability for a SYCL implementation of the 2D shallow water equations.
In: The Journal of Supercomputing. Vol. 81 (2025) . - 772.
ISSN 1573-0484
DOI der Verlagsversion: https://doi.org/10.1007/s11227-025-07063-7

[thumbnail of s11227-025-07063-7.pdf]
Format: PDF
Name: s11227-025-07063-7.pdf
Version: Published Version
Available under License Creative Commons BY 4.0: Attribution
Download (2MB)

Project information

Project title:
Project's official title
Project's id
Performance-optimiertes Co-Design von Ozeanmodellierungssoftware auf FPGAs
502500606
Open Access Publizieren
No information

Project financing: Deutsche Forschungsgemeinschaft

Abstract

SYCL is an open standard for targeting heterogeneous hardware from C++. In this work, we evaluate a SYCL implementation for a discontinuous Galerkin discretization of the 2D shallow water equations targeting CPUs, GPUs, and also FPGAs. The discretization uses polynomial orders zero to two on unstructured triangular meshes. Separating memory accesses from the numerical code allow us to optimize data accesses for the target architecture. A performance analysis shows good portability across x86 and ARM CPUs, GPUs from different vendors, and even two variants of Intel Stratix 10 FPGAs. Measuring the energy to solution shows that GPUs yield an up to 10x higher energy efficiency in terms of degrees of freedom per joule compared to CPUs. With custom designed caches, FPGAs offer a meaningful complement to the other architectures with particularly good computational performance on smaller meshes. FPGAs with High Bandwidth Memory are less affected by bandwidth issues and have similar energy efficiency as latest generation CPUs.

Further data

Item Type: Article in a journal
DDC Subjects: 000 Computer Science, information, general works > 004 Computer science
500 Science > 510 Mathematics
500 Science > 550 Earth sciences, geology
Institutions of the University: Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Mathematics > Chair Scientific Computing
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Mathematics > Chair Scientific Computing > Chair Scientific Computing - Univ.-Prof. Dr. Mario Bebendorf
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Mathematics > Chair Numerics of Partial Differential Equations
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Mathematics > Chair Numerics of Partial Differential Equations > Chair Numerics of Partial Differential Equations - Univ.-Prof. Dr. Vadym Aizinger
Profile Fields > Advanced Fields > Nonlinear Dynamics
Research Institutions > Central research institutes > Bayreuth Research Center for Modeling and Simulation - MODUS
Research Institutions > Central research institutes > Forschungszentrum für Wissenschaftliches Rechnen an der Universität Bayreuth - HPC-Forschungszentrum
Faculties
Faculties > Faculty of Mathematics, Physics und Computer Science
Faculties > Faculty of Mathematics, Physics und Computer Science > Department of Mathematics
Profile Fields
Profile Fields > Advanced Fields
Research Institutions
Research Institutions > Central research institutes
Language: English
Originates at UBT: Yes
URN: urn:nbn:de:bvb:703-epub-8913-9
Date Deposited: 23 Feb 2026 12:34
Last Modified: 23 Feb 2026 12:48
URI: https://epub.uni-bayreuth.de/id/eprint/8913

Downloads

Downloads per month over past year