feat: add post_training RuntimeConfig #2036

cdoern · 2025-04-26T14:45:57Z

What does this PR do?

certain APIs require a bunch of runtime arguments per-provider. The best way currently to pass these arguments in is via the provider config. This is tricky because it requires a provider to be pre-configured with certain arguments that a client side user should be able to pass in at runtime

Especially with the advent of out-of-tree providers, it would be great for a generic RuntimeConfig class to allow for providers to add and validate their own runtime arguments for things like supervised_fine_tune

For example: https://github.com/opendatahub-io/llama-stack-provider-kft has things like input-pvc, model-path, etc in the Provider Config. This is not sustainable nor is adding each and every field needed to the post_training API spec. RuntimeConfig has a sub-class called Config which allows for extra fields to arbitrarily be specified. It is the providers job to create its own class based on this one and add valid options, parse them, etc

certain APIs require a bunch of runtime arguments per-provider. The best way currently to pass these arguments in is via the provider config. This is tricky because it requires a provider to be pre-configured with certain arguments that a client side user should be able to pass in at runtime Especially with the advent of out-of-tree providers, it would be great for a generic RuntimeConfig class to allow for providers to add and validate their own runtime arguments for things like supervised_fine_tune For example: https://github.com/opendatahub-io/llama-stack-provider-kft has things like `input-pvc`, `model-path`, etc in the Provider Config. This is not sustainable nor is adding each and every field needed to the post_training API spec. RuntimeConfig has a sub-class called Config which allows for extra fields to arbitrarily be specified. It is the providers job to create its own class based on this one and add valid options, parse them, etc Signed-off-by: Charlie Doern <[email protected]>

ashwinb · 2025-04-26T17:26:30Z

This is tricky territory and I want to be extremely careful going down this path. At a certain point, the API will stop having meaning and only specific provider combinations will work because client code will be tied to the provider. Why have a generic API at that point? We have kept these escape hatches in the safety.run_shield() method for example, but I think that is an anti-pattern really. I would rather pull up whatever is needed and generalize those pieces and see if we can make them part of the API.

cdoern · 2025-04-26T18:20:37Z

good point @ashwinb, maybe I could approach a larger refactor to some of the args/class structures of the post training API to better fit more generic implementations whether they be distributed or single node. Implementing some out of tree providers has led me to discover some things I think would be generally helpful, (S3 compatibility, K8s native handling at times, etc)

github-actions · 2025-06-26T00:13:18Z

This pull request has been automatically marked as stale because it has not had activity within 60 days. It will be automatically closed if no further activity occurs within 30 days.

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 26, 2025

cdoern force-pushed the runtime-args branch from 9ecc6da to 0ec5151 Compare April 26, 2025 14:47

github-actions bot added the stale label Jun 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add post_training RuntimeConfig #2036

feat: add post_training RuntimeConfig #2036

cdoern commented Apr 26, 2025

Uh oh!

ashwinb commented Apr 26, 2025

Uh oh!

cdoern commented Apr 26, 2025

Uh oh!

github-actions bot commented Jun 26, 2025

Uh oh!

Uh oh!

feat: add post_training RuntimeConfig #2036

Are you sure you want to change the base?

feat: add post_training RuntimeConfig #2036

Conversation

cdoern commented Apr 26, 2025

What does this PR do?

Uh oh!

ashwinb commented Apr 26, 2025

Uh oh!

cdoern commented Apr 26, 2025

Uh oh!

github-actions bot commented Jun 26, 2025

Uh oh!

Uh oh!