Commit beb9393
committed
Add Jlama - Java-based LLM inference engine to Inference Engines section
- Jlama: Modern LLM inference engine for Java with 1.2k+ stars
- Apache 2.0 licensed, actively maintained (last release Jan 2026)
- Supports multiple models (Gemma, Llama, Mistral, Qwen, IBM Granite, GPT-2, BERT)
- Features: PagedAttention, MoE, tool calling, embeddings, quantization (Q8/Q4)
- Includes CLI and REST API for easy integration1 parent b77337c commit beb9393
1 file changed
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
348 | 348 | | |
349 | 349 | | |
350 | 350 | | |
| 351 | + | |
351 | 352 | | |
352 | 353 | | |
353 | 354 | | |
| |||
0 commit comments