Leveraging the NVIDIA A100 GPU for AI and HPC

The NVIDIA A100 GPU will soon be found across the majority of the Research Computing clusters. This powerful accelerator offers a theoretical performance of 9.7 TFLOPS in double precision and 19.5 in single. Specialized hardware units on the GPUs called Tensor Cores allow for even faster speeds. In order to take full advantage of the A100, most applications require users to modify their input scripts.

This workshop provides an overview of the features of the A100 GPU along with specific use cases for deep learning (PyTorch and TensorFlow) and HPC. Tools for performance profiling and for measuring data transfer rates will be presented.

Local Self-Study (Non-Slurm)

If you already have a local machine with an NVIDIA A100, use LOCAL_SELF_STUDY.md. It adapts the workshop into a workstation-friendly path with local wrapper scripts under scripts/local/.

Princeton / Slurm Workshop Flow

If you are following the original cluster-based workshop flow, start with setup.md and then continue through the numbered directories in order.

Getting Help

If you encounter any difficulties with the material in this guide then please send an email to cses@princeton.edu or attend a help session.

Authorship

This guide was created by Xuefei Zhang, Jonathan Halverson, and members of Princeton Research Computing.

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
01_a100_overview		01_a100_overview
02_adroit_gpu_nodes		02_adroit_gpu_nodes
03_gpu_programming_review		03_gpu_programming_review
04_floating_point_formats		04_floating_point_formats
05_mps_gpudirect_cuda_aware_mpi		05_mps_gpudirect_cuda_aware_mpi
06_cupy		06_cupy
07_pytorch		07_pytorch
08_tensorflow		08_tensorflow
09_takeaways		09_takeaways
scripts/local		scripts/local
tests		tests
.gitignore		.gitignore
LOCAL_SELF_STUDY.md		LOCAL_SELF_STUDY.md
README.md		README.md
setup.md		setup.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Leveraging the NVIDIA A100 GPU for AI and HPC

Local Self-Study (Non-Slurm)

Princeton / Slurm Workshop Flow

Getting Help

Authorship

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Leveraging the NVIDIA A100 GPU for AI and HPC

Local Self-Study (Non-Slurm)

Princeton / Slurm Workshop Flow

Getting Help

Authorship

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages