-
Notifications
You must be signed in to change notification settings - Fork 93
Open
Description
This is amazing work.
I want to quantize the models so I can run them on Ollama however it requires the tokenizer. Is it the same as https://huggingface.co/meta-llama/Llama-3.1-8B ?
https://huggingface.co/meta-llama/Llama-3.1-8B/blob/main/original/tokenizer.model
I see that vLLM has its own code to get a tokenizer.
Do you know where I can get the right tokenizer.model file and config to use?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels