You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-33Lines changed: 7 additions & 33 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,31 +3,16 @@
3
3
CloudAI benchmark framework aims to develop an industry standard benchmark focused on grading Data Center (DC) scale AI systems in the Cloud. The primary motivation is to provide automated benchmarking on various systems.
4
4
5
5
## Get Started
6
-
**Note**: instructions for installing a custom python version are available [here](#install-custom-python-version).
7
-
8
6
**Note**: instructions for setting up access for `enroot` are available [here](#set-up-access-to-the-private-ngc-registry).
9
7
10
-
1. Clone the CloudAI repository to your local machine:
11
-
```bash
12
-
git clone git@github.com:NVIDIA/cloudai.git
13
-
cd cloudai
14
-
```
15
-
16
-
2. Create a virtual environment:
17
-
```bash
18
-
python -m venv venv
19
-
source venv/bin/activate
20
-
```
21
-
22
-
3. Next, install the required packages:
23
-
```bash
24
-
pip install .
25
-
```
8
+
Using `uv` tool allows users to run CloudAI without manually managing required Python versions and dependencies.
9
+
```bash
10
+
git clone git@github.com:NVIDIA/cloudai.git
11
+
cd cloudai
12
+
uv run cloudai --help
13
+
```
26
14
27
-
For development please use the following command:
28
-
```bash
29
-
pip install -e '.[dev]'
30
-
```
15
+
For details and `pip`-based installation, please refer to the [documentation](https://nvidia.github.io/cloudai/#get-started).
Replace `<api-key>` with your respective credentials. Keep `$oauthtoken` as is.
76
61
77
62
78
-
### Install custom python version
79
-
If your system python version is not supported, you can install a custom version using [uv](https://docs.astral.sh/uv/getting-started/installation/) tool:
80
-
```bash
81
-
curl -LsSf https://astral.sh/uv/install.sh | sh
82
-
source $HOME/.local/bin/env
83
-
uv venv -p 3.10
84
-
source .venv/bin/activate
85
-
# optionally you might need to install pip which is not installed by default:
Copy file name to clipboardExpand all lines: doc/USER_GUIDE.md
+21-19Lines changed: 21 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -78,7 +78,7 @@ scheduler = "slurm"
78
78
79
79
install_path = "./install"
80
80
output_path = "./results"
81
-
cache_docker_images_locally = "True"
81
+
cache_docker_images_locally = true
82
82
default_partition = "<YOUR PARTITION NAME>"
83
83
84
84
mpi = "pmix"
@@ -104,15 +104,15 @@ Test Configuration describes a particular test configuration to be run. It is ba
104
104
name = "nccl_test_all_reduce_single_node"
105
105
description = "all_reduce"
106
106
test_template_name = "NcclTest"
107
-
extra_cmd_args = "--stepfactor 2"
108
107
109
108
[cmd_args]
110
-
"subtest_name" = "all_reduce_perf_mpi"
111
-
"ngpus" = "1"
112
-
"minbytes" = "8M"
113
-
"maxbytes" = "16G"
114
-
"iters" = "5"
115
-
"warmup_iters" = "3"
109
+
subtest_name = "all_reduce_perf_mpi"
110
+
ngpus = 1
111
+
minbytes = "8M"
112
+
maxbytes = "16G"
113
+
iters = 5
114
+
warmup_iters = 3
115
+
stepfactor = 2
116
116
```
117
117
You can find more examples under `conf/common/test`. In a test schema file, you can adjust arguments as shown above. In the `cmd_args` section, you can provide different values other than the default values for each argument. In `extra_cmd_args`, you can provide additional arguments that will be appended after the NCCL test command. You can specify additional environment variables in the `extra_env_vars` section.
118
118
@@ -122,12 +122,14 @@ Test Scenario uses Test description from step 5. Below is the `myconfig/scenario
122
122
name = "nccl-test"
123
123
124
124
[[Tests]]
125
-
id = "Tests.1"
125
+
id = "allreduce.1"
126
+
num_nodes = 1
126
127
test_name = "nccl_test_all_reduce_single_node"
127
128
time_limit = "00:20:00"
128
129
129
130
[[Tests]]
130
-
id = "Tests.2"
131
+
id = "allreduce.2"
132
+
num_nodes = 1
131
133
test_name = "nccl_test_all_reduce_single_node"
132
134
time_limit = "00:20:00"
133
135
[[Tests.dependencies]]
@@ -178,7 +180,7 @@ cloudai generate-report \
178
180
# Describing a System in the System Schema
179
181
In this section, we introduce the concept of the system schema, explain the meaning of each field, and describe how the fields should be used. The system schema is a TOML file that allows users to define a system's configuration.
Copy file name to clipboardExpand all lines: doc/index.md
+24-33Lines changed: 24 additions & 33 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,31 +3,35 @@
3
3
CloudAI benchmark framework aims to develop an industry standard benchmark focused on grading Data Center (DC) scale AI systems in the Cloud. The primary motivation is to provide automated benchmarking on various systems.
4
4
5
5
## Get Started
6
-
**Note**: instructions for installing a custom python version are available [here](install-custom-python-version).
7
-
8
6
**Note**: instructions for setting up access for `enroot` are available [here](set-up-access-to-the-private-ngc-registry).
9
7
10
-
1. Clone the CloudAI repository to your local machine:
11
-
```bash
12
-
git clone git@github.com:NVIDIA/cloudai.git
13
-
cdcloudai
14
-
```
8
+
```bash
9
+
git clone git@github.com:NVIDIA/cloudai.git
10
+
cdcloudai
11
+
uv run cloudai --help
12
+
```
15
13
16
-
2. Create a virtual environment:
17
-
```bash
18
-
python -m venv venv
19
-
source venv/bin/activate
20
-
```
14
+
### `pip`-based installation
15
+
See required Python version in the `.python-version` file, please ensure you have it installed (see how a custom python version [can be installed](#install-custom-python-version)). Follow these steps:
16
+
```bash
17
+
git clone git@github.com:NVIDIA/cloudai.git
18
+
cd cloudai
19
+
python -m venv venv
20
+
source venv/bin/activate
21
+
pip install -e .
22
+
```
21
23
22
-
3. Next, install the required packages:
23
-
```bash
24
-
pip install .
25
-
```
24
+
(install-custom-python-version)=
25
+
### Install custom python version
26
+
If your system python version is not supported, you can install a custom version using [uv](https://docs.astral.sh/uv/getting-started/installation/) tool:
27
+
```bash
28
+
curl -LsSf https://astral.sh/uv/install.sh | sh
29
+
source$HOME/.local/bin/env
30
+
uv venv --seed # this will pick up the python version from .python-version file
Replace `<api-key>` with your respective credentials. Keep `$oauthtoken` as is.
75
79
76
-
77
-
(install-custom-python-version)=
78
-
### Install custom python version
79
-
If your system python version is not supported, you can install a custom version using [uv](https://docs.astral.sh/uv/getting-started/installation/) tool:
80
-
```bash
81
-
curl -LsSf https://astral.sh/uv/install.sh | sh
82
-
source $HOME/.local/bin/env
83
-
uv venv -p 3.10
84
-
source .venv/bin/activate
85
-
# optionally you might need to install pip which is not installed by default:
0 commit comments