Skip to content

Jacobi#779

Open
yodada wants to merge 6 commits into
mainfrom
lc873-03-29-merge-jacobi
Open

Jacobi#779
yodada wants to merge 6 commits into
mainfrom
lc873-03-29-merge-jacobi

Conversation

@yodada

@yodada yodada commented Mar 29, 2022

Copy link
Copy Markdown
Collaborator

This PR merges Jacobi device code, which can be found at examples/cuda/test_jacobi/

Jacobi 3D takes an input of Nx * Ny * Nz. This implementation is unrolled along Nx, and distributes Ny and Nz along tileX and tileY respectively. So the minimal valid input is 64 * 18 * 10. Another valid input is 126 * 18 *10. Note there along Nx it reads 64 inputs and generates 62 outputs. There is overlapping between steps.

@drichmond

Copy link
Copy Markdown
Collaborator

Merged kernel code from manycore

@drichmond drichmond force-pushed the lc873-03-29-merge-jacobi branch from 211c3c9 to 6df3a7e Compare May 17, 2022 23:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants