benchmarks

Failed to load latest commit information.

Cannot retrieve latest commit at this time.

Name		Name	Last commit message	Last commit date
parent directory ..
Artifacts.toml		Artifacts.toml
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md
benchmarks_cufft.jl		benchmarks_cufft.jl
benchmarks_rocfft.jl		benchmarks_rocfft.jl
cpu_vs_gpu.jl		cpu_vs_gpu.jl
crystal.jl		crystal.jl

README.md

Benchmarks

This directory contains all the scripts and configuration files needed to reproduce the numerical results presented in the paper Recovering Sparse DFT from Missing Signals via Interior Point Method on GPU.

Overview

Each script in this folder benchmarks different aspects of our GPU-accelerated interior-point solver and FFT implementations, comparing them against CPU-based references.

Requirements

Ensure you have the appropriate hardware drivers installed (CUDA for NVIDIA GPUs, ROCm for AMD GPUs).

Installation

Launch Julia with the project environment:

julia --project=.

Instantiate the environment:

using Pkg
Pkg.instantiate()

Usage

To run a benchmark script, use one of the following commands:

julia --project=. -e 'include("benchmarks_cufft.jl")'
julia --project=. -e 'include("benchmarks_rocfft.jl")'
julia --project=. -e 'include("cpu_vs_gpu.jl")'
julia --project=. -e 'include("crystal.jl")'

Scripts

benchmarks_cufft.jl

Compares cuFFT (via CUDA.jl) against FFTW (via FFTW.jl) on problems of various sizes. Measures execution time for fft and ifft operations on random data.

benchmarks_rocfft.jl

Compares rocFFT (via AMDGPU.jl) against FFTW (via FFTW.jl). Similar to the cuFFT benchmarks; results were not included in the final paper.

cpu_vs_gpu.jl

Benchmarks our compressed sensing solver on CPU vs GPU across a range of problem sizes (artificial test cases).

crystal.jl

Applies the same solver to a real-world problem of 104 million variables, comparing CPU and GPU performance on a crystallographic dataset.

Preferences

To enable unified memory by default on the GH200, create a file named LocalPreferences.toml in this directory with the following content:

[CUDA]
default_memory = "unified"

Acknowledgments

We thank JLSE for providing access to the NVIDIA GH200 used in our experiments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmarks

benchmarks

README.md

Benchmarks

Overview

Requirements

Installation

Usage

Scripts

Preferences

Acknowledgments

Files

benchmarks

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmarks

Folders and files

parent directory

README.md

Benchmarks

Overview

Requirements

Installation

Usage

Scripts

Preferences

Acknowledgments