Skip to content

Conversation

@kemalcanbora
Copy link

Fix: Resolve SyntaxError caused by invalid f-string with unquoted square-bracket expression

Feat: Add llama_cpp backend using llama-cpp-python bindings

  • Register LlamaCpp class as 'llama_cpp' model type
  • Support both local .gguf files and Hugging Face repo_id + filename
  • Implement streaming and non-streaming chat methods
  • Add basic message conversion compatible with qwen_agent format
  • Include common llama.cpp parameters (n_ctx, n_gpu_layers, n_threads, etc.)
  • Follow patterns from existing Transformers backend

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant