|
3 | 3 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
4 | 4 | urls: |
5 | 5 | - https://huggingface.co/Jackrong/Qwopus3.6-27B-Coder-Compat-MTP-GGUF |
6 | | - description: | |
7 | | - 🪐 Qwopus-3.6-27B-Coder |
8 | | - Coder SFT Release |
9 | | - |
10 | | - Agentic Coding & Tool-Use Reasoning Model Fine-Tuned on Qwopus3.6-27B-v2 |
11 | | - |
12 | | - 🧬 Trace Inversion & Negentropy |
13 | | - 🧠 27B Dense Model |
14 | | - ⚡ Agentic Coding |
15 | | - 🛠️ Tool Calling & Agent |
16 | | - 🏆 SWE-bench Verified: 67.0% (off-thinking) |
17 | | - |
18 | | - 💡 What is Qwopus-3.6-27B-Coder? |
19 | | - 🪐 Qwopus-3.6-27B-Coder is a reasoning-enhanced agentic coding model built on top of Qwopus3.6-27B-v2. It inherits the powerful reasoning foundation of the v2 base — which achieved 87.43% MMLU-Pro and 75.25% SWE-bench Verified — and further specializes it for agentic code generation, structured tool calling, debugging, and instruction-following in developer workflows. The model is designed to excel at repository-level coding tasks, multi-turn tool orchestration, and complex logical reasoning under realistic agent environments. |
20 | | - |
21 | | - 🧩 Agentic Coding |
22 | | - Optimized for repository-level coding, debugging, patch generation, and structured multi-step development workflows. |
23 | | - |
24 | | - 🛠️ Tool Calling |
25 | | - Learns from real agent trajectories with tool definitions, tool calls, and environment feedback for robust multi-turn execution. |
26 | | - |
27 | | - ... |
| 6 | + description: "\U0001FA90 Qwopus-3.6-27B-Coder\nCoder SFT Release\n\nAgentic Coding & Tool-Use Reasoning Model Fine-Tuned on Qwopus3.6-27B-v2\n\n\U0001F9EC Trace Inversion & Negentropy\n\U0001F9E0 27B Dense Model\n⚡ Agentic Coding\n\U0001F6E0️ Tool Calling & Agent\n\U0001F3C6 SWE-bench Verified: 67.0% (off-thinking)\n\n\U0001F4A1 What is Qwopus-3.6-27B-Coder?\n\U0001FA90 Qwopus-3.6-27B-Coder is a reasoning-enhanced agentic coding model built on top of Qwopus3.6-27B-v2. It inherits the powerful reasoning foundation of the v2 base — which achieved 87.43% MMLU-Pro and 75.25% SWE-bench Verified — and further specializes it for agentic code generation, structured tool calling, debugging, and instruction-following in developer workflows. The model is designed to excel at repository-level coding tasks, multi-turn tool orchestration, and complex logical reasoning under realistic agent environments.\n\n\U0001F9E9 Agentic Coding\nOptimized for repository-level coding, debugging, patch generation, and structured multi-step development workflows.\n\n\U0001F6E0️ Tool Calling\nLearns from real agent trajectories with tool definitions, tool calls, and environment feedback for robust multi-turn execution.\n\n...\n" |
28 | 7 | license: "apache-2.0" |
29 | 8 | tags: |
30 | 9 | - llm |
|
241 | 220 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
242 | 221 | urls: |
243 | 222 | - https://huggingface.co/unsloth/GLM-5.2-GGUF |
244 | | - description: | |
245 | | - # GLM-5.2 |
246 | | - |
247 | | - 👋 Join our WeChat or Discord community. |
248 | | - |
249 | | - 📖 Check out the GLM-5.2 blog and GLM-5 Technical report. |
250 | | - |
251 | | - 📍 Use GLM-5.2 API services on Z.ai API Platform. |
252 | | - |
253 | | - 🔜 Try GLM-5.2 here. |
254 | | - |
255 | | - [Paper] |
256 | | - [GitHub] |
257 | | - |
258 | | - ## Introduction |
259 | | - |
260 | | - We're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers that capability on a **solid 1M-token context**. GLM-5.2's new capabilities include: |
261 | | - - **Solid 1M Context:** A solid 1M-token context that stably sustains long-horizon work |
262 | | - - **Advanced Coding with Flexible Effort**: Stronger coding capabilities with multiple thinking effort levels to balance performance and latency |
263 | | - - **Improved Architecture**: We propose IndexShare, which reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length. We also improve GLM-5.2’s MTP layer for speculative decoding, increasing the acceptance length by up to 20% |
264 | | - - **Pure Open**: An MIT open-source license — no regional limits, technical access without borders |
265 | | - |
266 | | - ## Benchmark |
267 | | - |
268 | | - ## Serve GLM-5.2 Locally |
269 | | - |
270 | | - ... |
| 223 | + description: "# GLM-5.2\n\n\U0001F44B Join our WeChat or Discord community.\n\n\U0001F4D6 Check out the GLM-5.2 blog and GLM-5 Technical report.\n\n\U0001F4CD Use GLM-5.2 API services on Z.ai API Platform.\n\n\U0001F51C Try GLM-5.2 here.\n\n[Paper]\n[GitHub]\n\n## Introduction\n\nWe're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers that capability on a **solid 1M-token context**. GLM-5.2's new capabilities include:\n - **Solid 1M Context:** A solid 1M-token context that stably sustains long-horizon work\n - **Advanced Coding with Flexible Effort**: Stronger coding capabilities with multiple thinking effort levels to balance performance and latency\n - **Improved Architecture**: We propose IndexShare, which reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length. We also improve GLM-5.2’s MTP layer for speculative decoding, increasing the acceptance length by up to 20%\n - **Pure Open**: An MIT open-source license — no regional limits, technical access without borders\n\n## Benchmark\n\n## Serve GLM-5.2 Locally\n\n...\n" |
271 | 224 | license: "mit" |
272 | 225 | tags: |
273 | 226 | - llm |
|
390 | 343 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
391 | 344 | urls: |
392 | 345 | - https://huggingface.co/michaelw9999/Qwopus3.6-27B-v2-MTP-NVFP4-GGUF |
393 | | - description: | |
394 | | - 🪐 Qwopus3.6-27B-v2-MTP |
395 | | - MTP Release |
396 | | - |
397 | | - Multi-Token Prediction reasoning model fine-tuned from Qwen3.6-27B |
398 | | - |
399 | | - 🧬 Trace Inversion & Negentropy |
400 | | - 🧠 27B Parameters |
401 | | - ⚡ Speculative Decoding |
402 | | - 🛠️ Coding / DevOps / Math |
403 | | - |
404 | | - 💡 What is Qwopus3.6-27B-v2-MTP? |
405 | | - 🪐 Qwopus3.6-27B-v2-MTP is a speed-oriented reasoning release built on top of Qwen3.6-27B. It keeps the Qwopus line's focus on reconstructed reasoning traces, coding discipline, DevOps procedures, and mathematical derivations, while adding Multi-Token Prediction for faster generation. The goal is simple: preserve the depth and structure of a 27B reasoning model while making real interactive use noticeably faster. |
406 | | - |
407 | | - ⚡ MTP DecodingAuxiliary future-token prediction improves throughput on long reasoning, code, math, and strict-format prompts. |
408 | | - 🧩 Structured ReasoningInherits the Qwopus training recipe built around reconstructed step-by-step reasoning trajectories. |
409 | | - 🧪 GB10 TestedValidated on a 30-question local benchmark across Logic, Coding, DevOps, Math, and Edge tasks. |
410 | | - 🚀 Practical SpeedDesigned for workflows where strong answers matter, but waiting several extra minutes per task does not. |
411 | | - |
412 | | - ... |
| 346 | + description: "\U0001FA90 Qwopus3.6-27B-v2-MTP\nMTP Release\n\nMulti-Token Prediction reasoning model fine-tuned from Qwen3.6-27B\n\n\U0001F9EC Trace Inversion & Negentropy\n\U0001F9E0 27B Parameters\n⚡ Speculative Decoding\n\U0001F6E0️ Coding / DevOps / Math\n\n\U0001F4A1 What is Qwopus3.6-27B-v2-MTP?\n\U0001FA90 Qwopus3.6-27B-v2-MTP is a speed-oriented reasoning release built on top of Qwen3.6-27B. It keeps the Qwopus line's focus on reconstructed reasoning traces, coding discipline, DevOps procedures, and mathematical derivations, while adding Multi-Token Prediction for faster generation. The goal is simple: preserve the depth and structure of a 27B reasoning model while making real interactive use noticeably faster.\n\n⚡ MTP DecodingAuxiliary future-token prediction improves throughput on long reasoning, code, math, and strict-format prompts.\n\U0001F9E9 Structured ReasoningInherits the Qwopus training recipe built around reconstructed step-by-step reasoning trajectories.\n\U0001F9EA GB10 TestedValidated on a 30-question local benchmark across Logic, Coding, DevOps, Math, and Edge tasks.\n\U0001F680 Practical SpeedDesigned for workflows where strong answers matter, but waiting several extra minutes per task does not.\n\n...\n" |
413 | 347 | tags: |
414 | 348 | - llm |
415 | 349 | - gguf |
|
435 | 369 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
436 | 370 | urls: |
437 | 371 | - https://huggingface.co/michaelw9999/Qwopus3.6-27B-Coder-MTP-NVFP4-GGUF |
438 | | - description: | |
439 | | - 🪐 Qwopus-3.6-27B-Coder |
440 | | - Coder SFT Release |
441 | | - |
442 | | - Agentic Coding & Tool-Use Reasoning Model Fine-Tuned on Qwopus3.6-27B-v2 |
443 | | - |
444 | | - 🧬 Trace Inversion & Negentropy |
445 | | - 🧠 27B Dense Model |
446 | | - ⚡ Agentic Coding |
447 | | - 🛠️ Tool Calling & Agent |
448 | | - 🏆 SWE-bench Verified: 67.0% (off-thinking) |
449 | | - |
450 | | - 💡 What is Qwopus-3.6-27B-Coder? |
451 | | - 🪐 Qwopus-3.6-27B-Coder is a reasoning-enhanced agentic coding model built on top of Qwopus3.6-27B-v2. It inherits the powerful reasoning foundation of the v2 base — which achieved 87.43% MMLU-Pro (300ex) and 75.25% SWE-bench Verified — and further specializes it for agentic code generation, structured tool calling, debugging, and instruction-following in developer workflows. The model is designed to excel at repository-level coding tasks, multi-turn tool orchestration, and complex logical reasoning under realistic agent environments. |
452 | | - |
453 | | - 🧩 Agentic Coding |
454 | | - Optimized for repository-level coding, debugging, patch generation, and structured multi-step development workflows. |
455 | | - |
456 | | - 🛠️ Tool Calling |
457 | | - Learns from real agent trajectories with tool definitions, tool calls, and environment feedback for robust multi-turn execution. |
458 | | - |
459 | | - ... |
| 372 | + description: "\U0001FA90 Qwopus-3.6-27B-Coder\nCoder SFT Release\n\nAgentic Coding & Tool-Use Reasoning Model Fine-Tuned on Qwopus3.6-27B-v2\n\n\U0001F9EC Trace Inversion & Negentropy\n\U0001F9E0 27B Dense Model\n⚡ Agentic Coding\n\U0001F6E0️ Tool Calling & Agent\n\U0001F3C6 SWE-bench Verified: 67.0% (off-thinking)\n\n\U0001F4A1 What is Qwopus-3.6-27B-Coder?\n\U0001FA90 Qwopus-3.6-27B-Coder is a reasoning-enhanced agentic coding model built on top of Qwopus3.6-27B-v2. It inherits the powerful reasoning foundation of the v2 base — which achieved 87.43% MMLU-Pro (300ex) and 75.25% SWE-bench Verified — and further specializes it for agentic code generation, structured tool calling, debugging, and instruction-following in developer workflows. The model is designed to excel at repository-level coding tasks, multi-turn tool orchestration, and complex logical reasoning under realistic agent environments.\n\n\U0001F9E9 Agentic Coding\nOptimized for repository-level coding, debugging, patch generation, and structured multi-step development workflows.\n\n\U0001F6E0️ Tool Calling\nLearns from real agent trajectories with tool definitions, tool calls, and environment feedback for robust multi-turn execution.\n\n...\n" |
460 | 373 | tags: |
461 | 374 | - llm |
462 | 375 | - gguf |
|
1676 | 1589 | use_tokenizer_template: true |
1677 | 1590 | files: |
1678 | 1591 | - filename: llama-cpp/models/Qwopus3.6-27B-v2-MTP-GGUF/Qwopus3.6-27B-v2-MTP-Q4_K_M.gguf |
1679 | | - sha256: 818d68223be4d8518dac0b3b5604dde633cbbcbae1f491d842a3e26711c6606d |
1680 | 1592 | uri: https://huggingface.co/Jackrong/Qwopus3.6-27B-v2-MTP-GGUF/resolve/main/Qwopus3.6-27B-v2-MTP-Q4_K_M.gguf |
| 1593 | + sha256: 31cf5fc2406a0c7aaebcc26d440bf0df94e215d0589d5205bf319649c052b50a |
1681 | 1594 | - name: "qwen3.6-40b-claude-4.6-opus-deckard-heretic-uncensored-thinking-neo-code-di-imatrix-max" |
1682 | 1595 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
1683 | 1596 | urls: |
|
0 commit comments