Advanced-Topics-in-Neural-Networks-Template-2025/Lab04 at main · Tensor-Reloaded/Advanced-Topics-in-Neural-Networks-Template-2025 · GitHub

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

Lab 4

Lab Notebook

Using C++ Modules in PyTorch

Bonus points

You will get bonus points if you implement the torchvision transforms from "Complex Yet Simple Training Pipeline" in C++ and submit them until Lab 6.

For self-study (for students who want to pass):

Foundational CNN papers:
- AlexNet: https://www.cs.toronto.edu/~hinton/absps/imagenet.pdf
- ResNet: https://arxiv.org/abs/1512.03385
- BatchNorm: https://arxiv.org/abs/1502.03167
Advanced optimizers:
- SAM Optimizer: https://github.com/davda54/sam
- Muon Optimizer: https://kellerjordan.github.io/posts/muon/
Hyperparameter tuning / experiment tracking:
- Tensorboard: https://pytorch.org/docs/stable/tensorboard.html
- Weights and Biases: https://docs.wandb.ai/guides/integrations/pytorch
Parallelism: https://docs.pytorch.org/tutorials/beginner/dist_overview.html
- Tensor parallelism: https://docs.pytorch.org/docs/stable/distributed.tensor.parallel.html
- Distributed Data Parallel: https://docs.pytorch.org/tutorials/beginner/ddp_series_theory.html

Advanced (for students who want to learn more):

C++ & CUDA:
- Introduction to CUDA: https://developer.nvidia.com/blog/even-easier-introduction-cuda
- Optimizing preprocessing pipelines with C++ modules: https://medium.com/data-science/how-to-optimize-your-dl-data-input-pipeline-with-a-custom-pytorch-operator-7f8ea2da5206
SAM Optimizer:
- Sharpness-Aware Minimization for Efficiently Improving Generalization: https://arxiv.org/abs/2010.01412
Muon Optimizer:
- Pytorch implementation: https://docs.pytorch.org/docs/stable/generated/torch.optim.Muon.html
- Muon is Scalable for LLM Training: https://arxiv.org/pdf/2502.16982
- Use the Muon implementation from timm if you have >2D weight matrices in your network.
Parallelism tutorials: