Moving evaluation inside generation class and enforcing empty generations when remove_thinking=True #3157
gpu_tests.yml
on: pull_request
gpu-tests-llama
0s
gpu-tests-qwen
0s