Skip to content

[Feature] Parse the vLLM baseline log for the default values. #106

@thameem-abbas

Description

@thameem-abbas

The values of a number of parameters is taken from definitions in vLLM and vary on a model-to-model basis. We need to extract this information from the vLLM logs (for the time-being) to get the right parameter information.

Example: cuda-graph-sizes, attention backends, etc,.....

Metadata

Metadata

Assignees

Labels

kind/featureCategorizes issue or PR as related to a new feature.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions