You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
### Ticket
[Add huggingface org/model names of tt-transformers supported models
#23858](#23858)
### Problem description
The document did not list the supported models handle to set up HF_MODEL
for automatic weight download.
When setting up HF_MODEL, users would have to access huggingface to
search for the proper org/name handle.
### What's changed
- Added a chart with the verified models names, hardware and org\name as
in huggingface for easier setup and improved user experience.
### Checklist
- [x] [All post
commit](https://github.com/tenstorrent/tt-metal/actions/runs/15789101701)
CI passes
- [x] New/Existing tests provide coverage for changes
Co-authored-by: Mark O'Connor <moconnor@tenstorrent.com>
Copy file name to clipboardExpand all lines: models/tt_transformers/README.md
+23-16Lines changed: 23 additions & 16 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,19 +1,21 @@
1
1
# TT-Transformers
2
2
3
-
This code can run large language models that are similar to the Llama3 family and other similar models such as Qwen2.5, Mistral and DeepSeek-R1-Distill variants. Tensor-parallelism is automatically used to parallelize workloads across all available chips.
3
+
This code can run large language models such as the Llama3 family, Qwen2.5, Mistral, DeepSeek-R1-Distill variants and similar. Tensor-parallelism automatically distributes workloads across all available chips.
4
4
5
5
The current version is verified to work with the following models:
To run a demo, choose one of the methods below for downloading the model weights:
30
32
31
-
The easiest way to do this is to set the `HF_MODEL` environment variable to the Huggingface org/name of the model you want to run:
33
+
### 1. Automatic download
32
34
35
+
Set the `HF_MODEL` environment variable to the Huggingface org/name of the model you want to run, This will automatically download the weights into your HuggingFace cache directory and run the model directly.
36
+
37
+
Check the models chart on the top of the page and substitue the <org/name> on the following command:
This will automatically download the weights into your HuggingFace cache directory and run the model directly. If you wish, you can manually download the weights either from Huggingface or from Meta as described by the two following sections:
42
+
### 2. Manual download
43
+
44
+
If you wish, you can manually download the weights either from Huggingface or from Meta as described by the two following sections:
0 commit comments