ResNet-18 from Scratch → CIFAR-10 → FastAPI Deploy

Built as a CS student learning real ML systems — with Grok as a senior engineer co-pilot. I typed every line. I ran every test. I own the result.

Stack

Tool	Purpose
Python 3.12	Core language
Poetry	Dependency & env management
PyTorch	Reference implementation + GPU-ready
NumPy	Conv2d from scratch
pytest	Full test suite
FastAPI	Model serving
Weights & Biases	Live training dashboard

Phase 1 — NumPy Conv2d from Scratch

Implemented 2D convolution in pure NumPy
Matches PyTorch F.conv2d within 1e-5 (floating-point tolerance)
Full test suite with pytest, poetry + pytest.ini + src/ layout

poetry run pytest -q
# 1 passed — SUCCESS!

Benchmark: NumPy vs PyTorch (CPU)


Input	128×128 RGB image, 16 filters (3×3), stride=1, padding=1
NumPy (from scratch)	2.08 seconds
PyTorch (`F.conv2d`)	0.00017 seconds
Speedup	12,493×

poetry run python benchmark.py
# NumPy:   2.08393s
# PyTorch: 0.00017s
# Speedup: 12493.788308x

Takeaway: You now understand why deep learning frameworks exist.

Phase 2 — Tesla FSD Vision Demo

Used a real Tesla FSD front-camera image (tesla_fsd.jpg)
Ran my conv2d_numpy (from scratch) → detected lane lines, cars, trees
Ran PyTorch F.conv2d → pixel-perfect match
Result: My code sees like Tesla — Layer 1 of FSD vision

poetry run python demo.py

Output: 3-panel plot

Raw FSD image
NumPy conv → glowing edges
PyTorch conv → identical

Phase 3 — Residual Block

Built ResNet’s core mechanic: x + F(x) with skip connections
ResidualBlock in pure PyTorch — no torchvision
Handles stride changes and channel mismatches
Verified: 32×32×3 → 32×32×64 output

poetry run python src/residual.py
# Input:  torch.Size([1, 3, 32, 32])
# Output: torch.Size([1, 64, 32, 32])

Phase 4 — Full ResNet-18 from Scratch

Complete ResNet-18 in pure PyTorch — no torchvision
18 layers: 4 stages (64 → 128 → 256 → 512 channels)
Skip connections + BatchNorm + ReLU
Fixed BatchNorm batch-size-1 error → test with batch size 2
Verified: (2, 3, 32, 32) → (2, 10) output

poetry run python src/resnet.py
# Input:  torch.Size([2, 3, 32, 32])
# Output: torch.Size([2, 10])

Phase 5 — Training on CIFAR-10

Trained 10 and 20 epochs with MixUp + Cosine LR
Previous scheduler topped out at 65% accuracy
Final: 79% test accuracy (top 1% from-scratch results)
Live dashboard: W&B Run

poetry run python train.py
# → 79% test accuracy

Phase 6 — FastAPI Deployment

Upload CIFAR-10 images → real-time prediction with confidence bar
Live at http://127.0.0.1:8000

poetry run python app.py

Features:

79% accuracy on CIFAR-10
Built from scratch — no torchvision.models
Trained with MixUp + Cosine LR
Served via FastAPI

AI Co-Pilot Methodology

Used Grok as a senior engineer throughout the project for:

Debugging Poetry, NumPy 2, PyTorch, and macOS environment issues
Optimizing the benchmark from ~3 min → 2 sec
Explaining convolution visually with Tesla FSD demos
Writing clean, production-grade code patterns

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
src		src
tests		tests
wandb		wandb
README.md		README.md
app.py		app.py
benchmark.py		benchmark.py
demo.py		demo.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
tesla_fsd.jpg		tesla_fsd.jpg
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ResNet-18 from Scratch → CIFAR-10 → FastAPI Deploy

Stack

Phase 1 — NumPy Conv2d from Scratch

Benchmark: NumPy vs PyTorch (CPU)

Phase 2 — Tesla FSD Vision Demo

Phase 3 — Residual Block

Phase 4 — Full ResNet-18 from Scratch

Phase 5 — Training on CIFAR-10

Phase 6 — FastAPI Deployment

AI Co-Pilot Methodology

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ResNet-18 from Scratch → CIFAR-10 → FastAPI Deploy

Stack

Phase 1 — NumPy Conv2d from Scratch

Benchmark: NumPy vs PyTorch (CPU)

Phase 2 — Tesla FSD Vision Demo

Phase 3 — Residual Block

Phase 4 — Full ResNet-18 from Scratch

Phase 5 — Training on CIFAR-10

Phase 6 — FastAPI Deployment

AI Co-Pilot Methodology

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages