|
1 | 1 | --- |
| 2 | +- name: "qwopus3.5-9b-coder" |
| 3 | + url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
| 4 | + urls: |
| 5 | + - https://huggingface.co/Jackrong/Qwopus3.5-9B-Coder-GGUF |
| 6 | + description: | |
| 7 | + # 🌟 Qwopus3.5-9B-v3.5 |
| 8 | + |
| 9 | + ## 💡 Model Overview & v3.5 Design |
| 10 | + |
| 11 | + Qwopus3.5-9B-v3.5 is a **data-scaled continuation** of the Qwopus3.5-9B-v3 model. |
| 12 | + |
| 13 | + The training data in v3.5 is expanded to cover a broader range of domains, including mathematics, programming, puzzle-solving, multilingual dialogue, instruction-following, multi-turn interactions, and STEM-related tasks. |
| 14 | + |
| 15 | + Qwopus3.5-9B-v3.5 is a reasoning-enhanced model based on **Qwen3.5-9B**, designed for: |
| 16 | + |
| 17 | + - 🧩 Structured reasoning |
| 18 | + - 🔧 Tool-augmented workflows |
| 19 | + - 🔁 Multi-step agentic tasks |
| 20 | + - ⚡ Token-efficient inference |
| 21 | + |
| 22 | + Compared with Qwopus3.5-9B-v3, **3.5 version does not introduce a new architecture, RL stage, or template redesign**. |
| 23 | + |
| 24 | + This version is trained with approximately **2× more SFT data**. |
| 25 | + |
| 26 | + ## 🎯 Motivation & Generalization Insight |
| 27 | + |
| 28 | + The motivation behind v3.5 comes from a simple observation: |
| 29 | + |
| 30 | + > This work is motivated by the hypothesis that scaling high-quality SFT data may further enhance the generalization ability of large language models. |
| 31 | + |
| 32 | + In earlier Qwopus3.5 experiments, structured reasoning was observed to improve both **accuracy and efficiency**: |
| 33 | + |
| 34 | + ... |
| 35 | + license: "apache-2.0" |
| 36 | + tags: |
| 37 | + - llm |
| 38 | + - gguf |
| 39 | + - vision |
| 40 | + - multimodal |
| 41 | + - reasoning |
| 42 | + icon: https://cdn-uploads.huggingface.co/production/uploads/66309bd090589b7c65950665/9EnS13MSxNU3snpAgEiLq.jpeg |
| 43 | + overrides: |
| 44 | + backend: llama-cpp |
| 45 | + function: |
| 46 | + automatic_tool_parsing_fallback: true |
| 47 | + grammar: |
| 48 | + disable: true |
| 49 | + known_usecases: |
| 50 | + - chat |
| 51 | + mmproj: llama-cpp/mmproj/Qwopus3.5-9B-Coder-GGUF/mmproj-F32.gguf |
| 52 | + options: |
| 53 | + - use_jinja:true |
| 54 | + parameters: |
| 55 | + model: llama-cpp/models/Qwopus3.5-9B-Coder-GGUF/Qwopus3.5-9B-coder-Exp-Q4_K_M.gguf |
| 56 | + template: |
| 57 | + use_tokenizer_template: true |
| 58 | + files: |
| 59 | + - filename: llama-cpp/models/Qwopus3.5-9B-Coder-GGUF/Qwopus3.5-9B-coder-Exp-Q4_K_M.gguf |
| 60 | + sha256: 4e8f836c4afe01e9bf2a9931434ddd15e2a243cd8ec82a0aa3bc6573d4564051 |
| 61 | + uri: https://huggingface.co/Jackrong/Qwopus3.5-9B-Coder-GGUF/resolve/main/Qwopus3.5-9B-coder-Exp-Q4_K_M.gguf |
| 62 | + - filename: llama-cpp/mmproj/Qwopus3.5-9B-Coder-GGUF/mmproj-F32.gguf |
| 63 | + sha256: 5c769161b31697b6a2d83d8a806f37ee8ee7104bca15313c608dc53359fa0ef2 |
| 64 | + uri: https://huggingface.co/Jackrong/Qwopus3.5-9B-Coder-GGUF/resolve/main/mmproj-F32.gguf |
2 | 65 | - name: "gemma-4-12b-it-qat-q4_0" |
3 | 66 | url: "github:mudler/LocalAI/gallery/virtual.yaml@master" |
4 | 67 | urls: |
|
0 commit comments