-
Notifications
You must be signed in to change notification settings - Fork 2.9k
[FPSAN] Load TCGen shared operands directly #10473
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
26 commits
Select commit
Hold shift + click to select a range
45e2e4b
Accelerate FPSan MMA emulation with i8 decomposition
jeffniu-openai a784db0
Test FPSan TCGen MMA in warp partitions
jeffniu-openai 333348b
[FPSan] Address i8 decomposition review comments
jeffniu-openai 4dd75e4
Support multi-CTA local gather and scatter
jeffniu-openai 80bf7cc
Simplify multi-CTA gather and scatter lowering
jeffniu-openai 1ffc0a1
Preserve explicit cluster gather codegen
jeffniu-openai 6dc53ac
Apply pre-commit formatting
jeffniu-openai 4869637
Add instrumentation local gather for FPSan
jeffniu-openai 2c9405f
Simplify instrumentation local gather
jeffniu-openai b1d02d7
Apply pre-commit formatting
jeffniu-openai 8cc70b2
Apply post-restack formatting
jeffniu-openai 5194768
Merge remote-tracking branch 'refs/remotes/github/main' into jeffniu/…
jeffniu-openai abde891
[NVIDIA] Address multi-CTA gather review
jeffniu-openai 68037d4
[NVIDIA] Minimize multi-CTA shared dispatch
jeffniu-openai 3d0f65f
[NVIDIA] Trim multi-CTA gather changes
jeffniu-openai d098a0e
[NVIDIA] Restore multi-CTA lowering coverage
jeffniu-openai 96e29f0
[NVIDIA] Simplify multi-CTA runtime test setup
jeffniu-openai 1e55b3c
[NVIDIA] Use nullable values for distributed shared memory
jeffniu-openai d9bd4b7
merge
jeffniu-openai 2396557
Merge branch 'jeffniu/local-gather-scatter-multicta' of https://githu…
jeffniu-openai ac28f5d
cleanup
jeffniu-openai ac6263d
[NVIDIA] Always map distributed shared accesses
jeffniu-openai 32f914d
[GPUToLLVM] Lookup local address outputs by name
jeffniu-openai a54b8e9
[NVIDIA] Relax local gather barrier check
jeffniu-openai 87bbf02
Merge branch 'jeffniu/local-gather-scatter-multicta' into jeffniu/tti…
jeffniu-openai 967ee8f
Merge triton-lang/triton main into jeffniu/tti-experimental-local-gather
jeffniu-openai File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why can't just slice and local_load?