Faster Matrix Operations with CUDA

Goal here was pretty straightforward: implement a functions that are able to use the parallel programming capa\biltities of the NVIDIA Gpu in my computer to perform matrix addition and multiplication far more efficiently than would be possible only using the CPU.

These computations are especially popular in both the machine learning and crypto mining spaces for the simple reason that they're fast -- VERY fast. Admittedly, there are far more advanced implementations utilizing this technology besides matrix operations, such as real-time image/video processing, data analytics, cryptography, deep learning among many other use-cases.

Regardless, both files in this repository display the immediate advantages of CUDA programming, and will be explained in-depth.

`matrix_addition_2d.cu`

This is about as simple as implementing functions with CUDA can get. Typically, using a CPU, every element in two matrices being added together would have to be computed sequentially -- one after the other -- before a valid solution could be found. However, using the cores in my NVIDIA GPU, these additions can actually occur AT THE SAME TIME. This is essense of multithreading: take simple operations to be applied to a dataset, then compute all of them parallel to one another in order to save on runtime.

`matrix_multiplication.cu`

Same idea here but this also introduces the idea of tiled matrix multiplication using shared memory (wowwwww). This is supposed to be even more efficient than standard mutlithreading due to the fact that all the computations happen within the same thread block. And sure enough, it actually does provide a significant improvement upon the standard GPU method.

NOTE:

This will only verifiably work for NVIDIA GPUS at the processing power of the GeForce 1660 Super or higher. In fact, due to the architectures in the 30 series and beyond being so far beyond my personal hardware, the speedups should be even more noteworthy.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.vscode		.vscode
README.md		README.md
Screenshot 2025-09-07 181458.png		Screenshot 2025-09-07 181458.png
Screenshot 2025-09-08 192236.png		Screenshot 2025-09-08 192236.png
matrix_addition_2d.cu		matrix_addition_2d.cu
matrix_multiplication.cu		matrix_multiplication.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Faster Matrix Operations with CUDA

`matrix_addition_2d.cu`

`matrix_multiplication.cu`

NOTE:

About

Uh oh!

Releases

Packages

Languages

farshadislam/Faster-Matrix-Operations-with-CUDA

Folders and files

Latest commit

History

Repository files navigation

Faster Matrix Operations with CUDA

matrix_addition_2d.cu

matrix_multiplication.cu

NOTE:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`matrix_addition_2d.cu`

`matrix_multiplication.cu`

Packages