You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+45-33Lines changed: 45 additions & 33 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,14 +1,38 @@
1
1
# CloudAI Benchmark Framework
2
2
3
-
## Project Description
4
3
CloudAI benchmark framework aims to develop an industry standard benchmark focused on grading Data Center (DC) scale AI systems in the Cloud. The primary motivation is to provide automated benchmarking on various systems.
5
4
5
+
## Get Started
6
+
**Note**: instructions for installing a custom python version are available [here](#install-custom-python-version).
7
+
8
+
**Note**: instructions for setting up access for `enroot` are available [here](#set-up-access-to-the-private-ngc-registry).
9
+
10
+
1. Clone the CloudAI repository to your local machine:
11
+
```bash
12
+
git clone git@github.com:NVIDIA/cloudai.git
13
+
cd cloudai
14
+
```
15
+
16
+
2. Create a virtual environment:
17
+
```bash
18
+
python -m venv venv
19
+
source venv/bin/activate
20
+
```
21
+
22
+
3. Next, install the required packages:
23
+
```bash
24
+
pip install .
25
+
```
26
+
27
+
For development please use the following command:
28
+
```bash
29
+
pip install -e '.[dev]'
30
+
```
31
+
6
32
## Key Concepts
7
-
### Schemas
8
33
CloudAI operates on four main schemas:
9
34
10
35
- **System Schema**: Describes the system, including the scheduler type, node list, and global environment variables.
11
-
-**Test Template Schema**: A template for tests that includes all required command-line arguments and environment variables. This schema allows users to separate test template implementations from systems.
12
36
- **Test Schema**: An instance of a test template with custom arguments and environment variables.
13
37
- **Test Scenario Schema**: A set of tests with dependencies and additional descriptions about the test scenario.
14
38
@@ -30,8 +54,8 @@ These schemas enable CloudAI to be flexible and compatible with different system
30
54
|SlurmContainer|✅|❌|❌|❌|
31
55
|MegatronRun (experimental)|✅|❌|❌|❌|
32
56
33
-
34
-
## Set Up Access to the Private NGC Registry
57
+
## Details
58
+
### Set Up Access to the Private NGC Registry
35
59
First, ensure you have access to the Docker repository. Follow the following steps:
36
60
37
61
1. **Sign In**: Go to [NVIDIA NGC](https://ngc.nvidia.com/signin) and sign in with your credentials.
Replace `<api-key>` with your respective credentials. Keep `$oauthtoken` as is.
50
74
51
75
52
-
## Get Started
53
-
1. Clone the CloudAI repository to your local machine:
54
-
```bash
55
-
git clone git@github.com:NVIDIA/cloudai.git
56
-
cd cloudai
57
-
```
58
-
59
-
2. Create a virtual environment:
60
-
```bash
61
-
python -m venv venv
62
-
source venv/bin/activate
63
-
```
64
-
65
-
3. Next, install the required packages:
66
-
```bash
67
-
pip install .
68
-
```
76
+
### Install custom python version
77
+
If your system python version is not supported, you can install a custom version using [uv](https://docs.astral.sh/uv/getting-started/installation/) tool:
78
+
```bash
79
+
curl -LsSf https://astral.sh/uv/install.sh | sh
80
+
source $HOME/.local/bin/env
81
+
uv venv -p 3.10
82
+
source .venv/bin/activate
83
+
# optionally you might need to install pip which is not installed by default:
84
+
uv pip install -U pip
85
+
```
69
86
70
-
For development please use the following command:
71
-
```bash
72
-
pip install -e '.[dev]'
73
-
```
87
+
## CloudAI Modes Usage Examples
74
88
75
89
CloudAI supports five modes:
76
90
-[install](#install) - Use the install mode to install all test templates in the specified installation path
@@ -79,9 +93,7 @@ CloudAI supports five modes:
79
93
-[generate-report](#generate-report) - Use the generate-report mode to generate reports under the test directories alongside the raw data
80
94
-[uninstall](#uninstall) - Use the uninstall mode to remove installed test templates
81
95
82
-
### CloudAI Modes Usage Examples
83
-
84
-
#### install
96
+
### install
85
97
86
98
To install test prerequisites, run CloudAI CLI in install mode.
0 commit comments