Skip to content

Conversation

@jeremylt
Copy link
Member

@jeremylt jeremylt commented Jul 9, 2025

@zatkins-dev does this change the plots significantly? We weren't counting all the GPU FLOPs for at-points basis evaluation quite correctly.

@jeremylt jeremylt force-pushed the jeremy/fix-flop-count branch from 04a4994 to 5278038 Compare July 9, 2025 20:48
@jeremylt
Copy link
Member Author

Ok, this should be a mild decrease in the number of flops

@jeremylt
Copy link
Member Author

new setup is noticeably faster on small laptop runs

/gpu/cuda/shared

old
7.09
7.50
7.18

new
6.81
6.96
6.94

@jeremylt jeremylt force-pushed the jeremy/fix-flop-count branch from d998b37 to 56a8e69 Compare July 10, 2025 16:39
@jeremylt jeremylt force-pushed the jeremy/fix-flop-count branch from 56a8e69 to 8aca9eb Compare July 10, 2025 16:44
@jeremylt jeremylt force-pushed the jeremy/fix-flop-count branch from 6ddf54c to 802d760 Compare July 10, 2025 19:34
@jeremylt jeremylt merged commit 65d1306 into main Jul 10, 2025
27 of 29 checks passed
@jeremylt jeremylt deleted the jeremy/fix-flop-count branch July 10, 2025 19:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants