Skip to content

Starmys/TritonStudyGroup

Repository files navigation

Triton Kernel Study Group

Introduction

Triton crash course prepared for beginners.

Outline

  1. Introduction to GPU architecture
  2. Write a simple CUDA kernel: softmax
  3. Introduction to Triton and Triton softmax kernel
  4. Tensor Core and Triton matrix multiplication
  5. Debugging kernels using NVIDIA NCU
  6. Flash-Attention algorithm
  7. Triton Flash-Attention kernels (fwd & bwd)
  8. Triton kernel examples #1
  9. Triton kernel examples #2

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages