Skip to content

v0.4.0

Latest

Choose a tag to compare

@dmahan93 dmahan93 released this 10 Mar 04:20
· 8 commits to main since this release
1d78069

Highlights

New example trainer

Weights are shared between vLLM and the trainer, no comms needed to sync weights, and memory saved by using only one copy of the weights!

On Policy/Self Distillation Support

Now support logprobs from a teacher/prompted endpoint, fully supporting on policy distillation/self distillation!

OpenAI Endpoint for managed server

Launch an openai endpoint and collect rollouts from any program that takes in an openai endpoint!

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.4.0