Skip to content

Jacobi#641

Closed
yodada wants to merge 1 commit into
masterfrom
lc873-03-29-merge-jacobi
Closed

Jacobi#641
yodada wants to merge 1 commit into
masterfrom
lc873-03-29-merge-jacobi

Conversation

@yodada

@yodada yodada commented Mar 29, 2022

Copy link
Copy Markdown
Collaborator

This PR merges Jacobi kernel code, which can be found at software/spmd/bsg_cuda_lite_runtime/jacobi/

Jacobi 3D takes an input of Nx * Ny * Nz. This implementation is unrolled along Nx, and distributes Ny and Nz along tileX and tileY respectively. So the minimal valid input is 64 * 18 * 10. Another valid input is 126 * 18 *10. Note there along Nx it reads 64 inputs and generates 62 outputs. There is overlapping between steps.

@drichmond

Copy link
Copy Markdown
Contributor

Merged kernel code into bespoke-silicon-group/bsg_replicant#779

@drichmond drichmond closed this Mar 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants