Skip to content

Commit 015835d

Browse files
authored
models(gallery): add phi-3 vision (#3890)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 313ea2c commit 015835d

File tree

2 files changed

+36
-0
lines changed

2 files changed

+36
-0
lines changed

gallery/index.yaml

+13
Original file line numberDiff line numberDiff line change
@@ -6225,6 +6225,19 @@
62256225
- filename: Phi-3.5-mini-TitanFusion-0.2.Q4_K_M.gguf
62266226
sha256: 9579305712f2bca246914639c4873acdc1e7bc64ac2c7db0230df4f0ca0ef234
62276227
uri: huggingface://mradermacher/Phi-3.5-mini-TitanFusion-0.2-GGUF/Phi-3.5-mini-TitanFusion-0.2.Q4_K_M.gguf
6228+
- !!merge <<: *phi-3
6229+
name: "phi-3-vision:vllm"
6230+
url: "github:mudler/LocalAI/gallery/phi-3-vision.yaml@master"
6231+
description: |
6232+
Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.
6233+
- !!merge <<: *phi-3
6234+
name: "phi-3.5-vision:vllm"
6235+
url: "github:mudler/LocalAI/gallery/phi-3-vision.yaml@master"
6236+
override:
6237+
parameters:
6238+
model: microsoft/Phi-3.5-vision-instruct
6239+
description: |
6240+
Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision. The model belongs to the Phi-3 model family, and the multimodal version comes with 128K context length (in tokens) it can support. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures.
62286241
- &hermes-2-pro-mistral
62296242
### START Hermes
62306243
url: "github:mudler/LocalAI/gallery/hermes-2-pro-mistral.yaml@master"

gallery/phi-3-vision.yaml

+23
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,23 @@
1+
---
2+
name: "phi3-vision"
3+
4+
config_file: |
5+
name: phi3-vision
6+
backend: vllm
7+
parameters:
8+
model: microsoft/Phi-3-vision-128k-instruct
9+
trust_remote_code: true
10+
max_model_len: 32768
11+
template:
12+
chat_message: |-
13+
<|{{ .RoleName }}|>
14+
{{.Content}}<|end|>
15+
chat: >-
16+
{{.Input}}
17+
18+
<|assistant|>
19+
20+
completion: |
21+
{{.Input}}
22+
use_tokenizer_template: false
23+
image: "<|image_{{ add1 .ID }}|>\n{{.Text}}"

0 commit comments

Comments
 (0)