You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
dataset instructions changed from hugging face to kaggle
dataset instructions changed from hugging face to kaggle
Signed-off-by: Aryan Nanda <nandaaryan823@gmail.com>
changes in readme of cloud-edge-collaborative-inference done to use kaggle instead of huggingface
Signed-off-by: Aryan <nandaaryan823@gmail.com>
Copy file name to clipboardExpand all lines: examples/cloud-edge-collaborative-inference-for-llm/README.md
+26-26Lines changed: 26 additions & 26 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -99,35 +99,47 @@ Before using this example, you need to have the device ready:
99
99
100
100
The Docker-based setup assumes you have Docker installed on your system and are using an Ubuntu-based Linux distribution.
101
101
102
-
*If you don't have Docker installed, follow the Docker Engine installation guide [here](https://docs.docker.com/engine/install/ubuntu/).*
102
+
**Note**:
103
+
- If you don't have Docker installed, follow the Docker Engine installation guide [here](https://docs.docker.com/engine/install/ubuntu/).
104
+
- To enable Docker to download datasets from Kaggle within your docker container, you need to configure the Kaggle CLI authentication token. Please follow the [official Kaggle API documentation](https://www.kaggle.com/docs/api#:~:text=is%20%24PYTHON_HOME/Scripts.-,Authentication,-In%20order%20to) to download your `kaggle.json` token. Once downloaded, move the file to the `~/ianvs/examples/cloud-edge-collaborative-inference-for-llm/` directory after doing step 1(cloning the ianvs repo):
103
105
104
-
1. From the root directory of Ianvs, build the `cloud-edge-collaborative-inference-for-llm` Docker image:
@@ -171,23 +183,14 @@ If you want to use speculative decoding models like [EAGLE](https://github.com/S
171
183
172
184
##### Dataset Configuration
173
185
174
-
Here, we provide `MMLU-5-shot` dataset and `GPQA-diamond` dataset for testing. The following is the instruction for dataset preparation for `MMLU-5-shot`, `GPQA-diamond` follows the same progress.
175
-
176
-
1. Download `mmlu-5-shot` from [Ianvs-MMLU-5-shot](https://huggingface.co/datasets/FuryMartin/Ianvs-MMLU-5-shot), (or [Ianvs-GPQA-diamond](https://huggingface.co/datasets/FuryMartin/Ianvs-GPQA-diamond)) which is a transformed MMLU-5-shot dataset formatted to fit Ianvs's requirements.
2. Create a `dataset` folder in the root directory of Ianvs and move `mmlu-5-shot` into the `dataset` folder.
186
+
Here, we provide `MMLU-5-shot` dataset and `GPQA-diamond` dataset for testing. The following instruction for dataset preparation for `MMLU-5-shot`, `GPQA-diamond` follows the same progress.
187
187
188
+
1. Download `mmlu-5-shot` in the root directory of ianvs from [Ianvs-MMLU-5-shot](https://www.kaggle.com/datasets/kubeedgeianvs/ianvs-mmlu-5shot), which is a transformed MMLU-5-shot dataset formatted to fit Ianvs's requirements.
189
+
**Note**: To enable Docker to download datasets from Kaggle within your docker container, you need to configure the Kaggle CLI authentication token. Please follow the [official Kaggle API documentation](https://www.kaggle.com/docs/api#:~:text=is%20%24PYTHON_HOME/Scripts.-,Authentication,-In%20order%20to) to download your `kaggle.json` token.
3. Then, check the path of `train_data` and `test_data` in
@@ -342,11 +345,8 @@ The testing process may take much time, depending on the number of test cases an
342
345
343
346
To enable you directly get the results, here we provide a workspace folder with cached results of `Qwen/Qwen2.5-1.5B-Instruct`, `Qwen/Qwen2.5-3B-Instruct`,`Qwen/Qwen2.5-7B-Instruct` and `gpt-4o-mini`.
344
347
345
-
You can download `workspace-mmlu` folder from [Ianvs-MMLU-5-shot](https://huggingface.co/datasets/FuryMartin/Ianvs-MMLU-5-shot) and put it under your `ianvs` folder.
346
-
347
-
```bash
348
-
mv Ianvs-MMLU-5-shot/workspace-mmlu/ .
349
-
```
348
+
You can download `workspace-mmlu` folder from [Ianvs-MMLU-5-shot](https://www.kaggle.com/datasets/kubeedgeianvs/ianvs-mmlu-5shot) and put it under your `ianvs` folder.
349
+
- Since we have already downloaded the `Ianvs-MMLU-5-shot` folder. There is no need to do this again.
0 commit comments