added modelcar #149

rcarrata · 2024-09-27T18:24:23Z

This PR aims to:

Enable ModelCars into the cluster
Add InferenceService (modelCars) and ServingRuntime based on vLLM within RHOAI for deploy granite in ic-shared-llm ns
Remove the "old" deployment of Mistral LLM using k8s resources

rcarrata added 10 commits September 27, 2024 20:22

added modelcar and job/rbac

c8fae53

fix script

c62645d

check if the modelcar is enabled beforehand

1847067

add ignoreextraneous for avoid out-of-sync

6e2c4c3

add ignoreextraneous for avoid out-of-sync

e56e253

add ignoreextraneous for avoid out-of-sync

958d901

add ignoreextraneous for avoid out-of-sync

f8a6832

add ignoreextraneous for avoid out-of-sync

ee7ec0b

add ignoreextraneous at inferenceservice level

8d44ee5

add ignoreextraneous at inferenceservice level

44ddb79

rcarrata closed this Sep 27, 2024

rcarrata deleted the fix/modelcars branch September 27, 2024 19:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added modelcar #149

added modelcar #149

Uh oh!

rcarrata commented Sep 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

added modelcar #149

added modelcar #149

Uh oh!

Conversation

rcarrata commented Sep 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants