Skip to content

feat: support dynamic configuration for dynamic batching without service restart #448

@wenzhaojie

Description

@wenzhaojie

Describe the feature

We can set parameters such as num=1, max_batch_size=4, max_wait_time=10 within append_worker(). However, we cannot modify these parameters during the service's runtime. Is it possible to provide a corresponding API to adjust the service parameters?

Why do you need this feature?

If we can implement this control API, we can adjust resource configurations without restarting the service, achieving dynamic changes in QoS (Quality of Service).

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions