OpenMP Offloading

Overview

This (example) project focused on benchmarking GPU OpenMP-offloading performance, using a variety of preexisting benchmarks. This project used a variety of:

Benchmarks

This work used 6 benchmarking applications, all of which support OpenMP-offloading, whilst others offer also CUDA and HIP versions.

BabelStream

A GPU port of the STREAMS benchmark

XSBench

A neutron scattering kernels

RSBench

A neutron scattering kernels

SU3Bench

SU(3) kernels from QCD code

miniBUDE

Kernels from the BUDE protein simulation code

miniQMC

Kernels from the QMCPACK, quantum Monte Carlo code

Compilers

ARCHER2 - AMD

ARCHER2 is a Cray built system and so makes use of the Cray programming environment and so, along with it's AMD architecture has two current OpenMP-offload enable compilers:

AMD
Cray

Cirrus - Nvidia

Cirrus supports several Nvidia GPU V100 nodes and so the OpenMP-offload enabled compilers are:

Nvidia compilers (nvc, nvc++, nvfortran) - supplied via the Nvidia HPC-SDK (Software Developer Kit)
GCC compilers

EIDF - Nvidia

Nvidia compilers (nvc, nvc++, nvfortran) - supplied via the Nvidia HPC-SDK (Software Developer Kit) container image

Compute systems

This project relied on two traditional HPC systems Cirrus and ARCHER2, and a cloud platform: the Edinburgh International Data Facility (EIDF)

ARCHER2

The UK National supercomputing service

Cirrus

An older Tier-2 system, which has a substantial number of Nvidia V100 GPUs.

EIDF

A cloud platform focused on machine learning applications, with a GPU service that consists of both Nvidia A100 and H100 GPUs

Deployment

SLURM

One of the traditional schedulers for submitting jobs to HPC systems.

Kubernetes

Kubernetes is a system for managing cloud resources and deploying containerised applications. It is notorious for having a steep learning curve for beginners; however, for getting started on the EIDF GPU service, there are useful getting started examples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenMP Offloading

Overview

Benchmarks

BabelStream

XSBench

RSBench

SU3Bench

miniBUDE

miniQMC

Compilers

ARCHER2 - AMD

Cirrus - Nvidia

EIDF - Nvidia

Compute systems

ARCHER2

Cirrus

EIDF

Deployment

SLURM

Kubernetes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally