Skip to content

eunomia-bpf/basic-cuda-tutorial

Repository files navigation

basic-cuda-tutorial

You can find the code in https://github.com/eunomia-bpf/basic-cuda-tutorial

A collection of CUDA programming examples to learn GPU programming with NVIDIA CUDA.

make sure change the gpu architecture sm_61 to your own gpu architecture in Makefile

Examples and tutorials

  • basic01.cu and basic01.md: Introduction to CUDA programming with a vector addition example
  • basic02.cu and basic02.md: Demonstration of CUDA PTX inline assembly with a vector multiplication example
  • basic03.cu and basic03.md: Comprehensive comparison of GPU programming methods including CUDA, PTX, Thrust, Unified Memory, Shared Memory, CUDA Streams, and Dynamic Parallelism using matrix multiplication
  • basic04.cu and basic04.md: Detailed exploration of GPU organization hierarchy including hardware architecture, thread/block/grid structure, memory hierarchy, and execution model
  • basic05.cu and basic05.md: Implementing a basic neural network forward pass on GPU with CUDA
  • basic06.cu and basic06.md: GPU-accelerated convolution operations for CNN with shared memory optimization
  • basic07.cu and basic07.md: CUDA implementation of attention mechanism for transformer models

Upcoming ML/AI GPU Tutorials

  • basic08.cu: Mixed-precision training with Tensor Cores using CUDA

Each tutorial will include comprehensive documentation explaining the concepts, implementation details, and optimization techniques used in ML/AI workloads on GPUs.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Sponsor this project

  •  
  •  

Packages

No packages published

Contributors 2

  •  
  •