Skip to content

v0.4.3: fix for on Llama4, device memory usage details, vLLM container accepts params

Latest

Choose a tag to compare

@tengomucho tengomucho released this 10 Dec 16:23

What's Changed

Inference

Other

Full Changelog: v0.4.2...v0.4.3