-
|
Has anyone gotten the google/gemma-3-270m model working with the candle transformers? I tried implementing the model https://github.com/vishpat/gemma-slurm/tree/c33fb9034f284558f222dc8df31e65a1067f463f, but I get a repeating value for the given test prompt. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 4 replies
-
|
Got it working https://github.com/vishpat/gemma-slurm/tree/0.1 |
Beta Was this translation helpful? Give feedback.
-
|
thanks @vishpat . this is working now . The main fix here was you need to manually add sliding_window_pattern properly in config.json file |
Beta Was this translation helpful? Give feedback.
-
|
Hey @vishpat not sure if useful, but we implemented Gemma3 in Hope that's useful! 🤗 |
Beta Was this translation helpful? Give feedback.
Had to add the following to the model config.json