Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LocalVLLMModel and deployment handler #102

Merged
merged 14 commits into from
Mar 17, 2025

Conversation

michaelharrisonmai
Copy link
Collaborator

This PR introduces the LocalVLLMModel and LocalVLLMDeploymentHandler, the former is a usual Model (i.e. implements generate) and the latter is a new class to handle aspects of deployment (deployment itself, health checks on your deployment, shutdown of servers).

Use LocalVLLMModel either by defining a ModelConfig or by passing info in the command line so that vllm can recognize your deployment or even deploy for you.

If you have already deployed, pass "ports" parameter, otherwise the LocalVLLMDeploymentHandler will spin up "num_servers" (default = 1) servers for you, wait for deployment to finish, and continue with the eval pipeline.

gugarosa
gugarosa previously approved these changes Mar 5, 2025
@michaelharrisonmai
Copy link
Collaborator Author

@microsoft-github-policy-service agree company="Microsoft"

Michael Harrison added 2 commits March 15, 2025 18:27
@michaelharrisonmai michaelharrisonmai merged commit bd1a02b into main Mar 17, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants