Using 2D grid in cudaKronprod for state initialization by sacpis · Pull Request #4130 · NVIDIA/cuda-quantum

sacpis · 2026-03-10T17:08:44Z

Using 2D grid in cudaKronprod to parallelize over both state dimensions.

Fixes QA bug 5961696.

The bug is in kronprod which calls cudaKronprod as a 1D grid. The kernel uses blockIdx.y / gridDim.y to go over the user state (for n = 30 qubits, it will be 1 billion elments), which is tsize2 parameter. Now, when gridDim.y = 1, only the first block ran. It went over all 2^n elements sequentially. Other blocks just launched and exited.

The 2D grid shows that there are 65535 blocks each handling 4 chunks in parallel.

Used the code below

import cudaq
import numpy as np

cudaq.set_target("nvidia")

@cudaq.kernel
def kernel(vec : cudaq.State):
    p = cudaq.qubit()
    q = cudaq.qvector(vec)

    mz(p)
    mz(q)

n = 30
v = np.zeros(2**n, dtype=cudaq.complex())
v[-1] = 1.
state = cudaq.State.from_data(v)

print(cudaq.sample(kernel, state))

This change now executes the kernel within 1 minute.

Signed-off-by: Sachin Pisal <spisal@nvidia.com>

github-actions · 2026-03-10T18:45:21Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

runtime/nvqir/custatevec/CuStateVecCircuitSimulator.cu

github-actions · 2026-03-10T20:53:20Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

Signed-off-by: Sachin Pisal <spisal@nvidia.com>

github-actions · 2026-03-10T22:36:55Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

Using 2D grid in cudaKronprod to parallelize over both state dimensions

a76612f

Signed-off-by: Sachin Pisal <spisal@nvidia.com>

sacpis requested review from 1tnguyen, bettinaheim and bmhowe23 March 10, 2026 17:08

Merge branch 'main' into 0.14_qvector_initialization_segfault

9a0a113

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 17:09 Inactive

sacpis changed the title ~~Using 2D grid in cudaKronprod~~ Using 2D grid in cudaKronprod for State vector initialization Mar 10, 2026

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 17:09 Inactive

copy-pr-bot bot had a problem deploying to ghcr-ci March 10, 2026 17:09 Error

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 17:09 Inactive

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 17:23 Inactive

github-actions bot pushed a commit that referenced this pull request Mar 10, 2026

Docs preview for PR #4130.

ecf9e49

Merge branch 'main' into 0.14_qvector_initialization_segfault

6e20ae5

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 19:08 Inactive

copy-pr-bot bot temporarily deployed to ghcr-ci March 10, 2026 19:09 Inactive

schweitzpgi reviewed Mar 10, 2026

View reviewed changes

runtime/nvqir/custatevec/CuStateVecCircuitSimulator.cu Outdated Show resolved Hide resolved

putting out the constant

af1a5ec

Signed-off-by: Sachin Pisal <spisal@nvidia.com>

schweitzpgi approved these changes Mar 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using 2D grid in cudaKronprod for state initialization#4130

Using 2D grid in cudaKronprod for state initialization#4130
sacpis merged 4 commits intoNVIDIA:mainfrom
sacpis:0.14_qvector_initialization_segfault

sacpis commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sacpis commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants