You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: USER_GUIDE.md
+8Lines changed: 8 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -438,6 +438,14 @@ CloudAI runs all slurm jobs using containers. To simplify file system related ta
438
438
These mounts are not verified for validity and do not override default mounts.
439
439
1. Test specific mounts can be mounted in-code.
440
440
441
+
### Head node without shared storage available on compute nodes
442
+
When compute nodes don't share file system with head node, `--enable-cache-without-check` for `run` and `dry-run` skips real check for cache existence, but still builds all paths correctly. The flow is like this:
443
+
444
+
1. _[on the head node]_ run `cloudai install`
445
+
1. _[on the head node]_ copy cache to compute nodes
446
+
1. Modify system.toml to set compute nodes' installation root
447
+
1. Run `cloudai run --enable-cache-without-check ...`
448
+
441
449
#### Dev details
442
450
`SlurmCommandGenStrategy` defines abstract method `_container_mounts(tr: TestRun)` that must be implemented by every subclass. This method is used in `SlurmCommandGenStrategy.container_mounts(tr: TestRun)` (defined as `@final`) where mounts like `/cloudai_run_results` (default mount), `TestDefinition.extra_container_mounts` (from Test TOML) and test specific mounts (defined in-code) are added.
0 commit comments