Show host / RAM usage even when GPUs are present #7897

Open

Labels

opened

llama.cpp allows for partial offload of model layers to CPU / RAM. It would be great if LocalAI showed RAM usage alongside VRAM usage.

Metadata

Assignees

No one assigned

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests