Director of Machine Learning at Cloudastructure. Computer vision for physical security; GPU performance and the unglamorous parts of ML systems.
I write about CUDA, NVIDIA driver internals, and numerical stability at abhik.ai.
- [Best Resources for Learning CUDA Matrix Multiplication Optimization](https://www.abhik.ai/articles/best-resources-cuda-matmul-optimization) Jun 03, 2026- [C++ Build Pipeline: Compilation vs Linking vs Loading Explained](https://www.abhik.ai/articles/cpp-build-pipeline) Jun 03, 2026- [H.264 vs H.265 vs AV1: Comparing Modern Video Codecs](https://www.abhik.ai/articles/h264-vs-h265-vs-av1) Jun 03, 2026- [CUDA Matrix Multiplication Optimization: From Naive to Near-cuBLAS](https://www.abhik.ai/articles/cuda-matrix-multiplication-optimization) Apr 07, 2026- [The Complete NVIDIA Xid Error Field Guide](https://www.abhik.ai/articles/nvidia-xid-errors) Mar 13, 2026- Building pylings.
- Deeper on CUDA kernel authoring and Nsight Systems workflows.
- Researching failure modes in ML training clusters (GPU, network, and storage); wrote the NVIDIA Xid Error Field Guide from that work.
- Upcoming: talk at EuroPython 2026 (July, Kraków).
- Workshop at PyCon Italia 2026: "Write Your First High-Performance GPU Kernel in Python!" (github).
- Workshop at PyCon India 2025: ArrPy: rebuilding NumPy from scratch.





