Skip to content

Conversation

@jeremylt
Copy link
Member

@jeremylt jeremylt commented Jul 2, 2025

Switching to gen QF assembly, which should decrease memory required and assembly time

  • force non-collocated strategy
  • create C side logic for calling build fn, launching kernel
  • copy to HIP

@jeremylt jeremylt self-assigned this Jul 2, 2025
@jeremylt jeremylt force-pushed the jeremy/gen-qf-assemble branch from 506967d to d990c3d Compare July 3, 2025 16:12
@jeremylt jeremylt changed the title WIP GPU Gen QFunction Assembly Jul 7, 2025
@jeremylt jeremylt force-pushed the jeremy/gen-qf-assemble branch 5 times, most recently from 894c9cc to 2d6696e Compare July 8, 2025 16:38
@jeremylt jeremylt force-pushed the jeremy/gen-qf-assemble branch from 2d6696e to af34f19 Compare July 9, 2025 15:15
@jeremylt jeremylt force-pushed the jeremy/gen-qf-assemble branch from 773b7f1 to ca38d01 Compare July 9, 2025 16:16
@jeremylt jeremylt merged commit 6d997e5 into main Jul 9, 2025
29 checks passed
@jeremylt jeremylt deleted the jeremy/gen-qf-assemble branch July 9, 2025 16:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants