We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 94fa1e0 commit 9465e2dCopy full SHA for 9465e2d
1 file changed
src/grouping_trainer/launch.py
@@ -3,8 +3,8 @@
3
4
The instance's startup script does bin/_startup.sh to set up the python env and then `eval`s an inputted command.
5
6
-Training jobs don't need to start in time, so by default they're launched async via flex-start w/ a max wait time of 1
7
-hour. Also saves some money.
+Training jobs don't need to start in time, so by default they're launched async via flex-start w/ a max wait time of 2
+hours. Also saves some money.
8
9
Eval jobs (on cheap L4 GPUs) are launched by sync-looping through zones ourselves b/c eval ideally does start in time,
10
e.g., training shouldn't start w/o an eval poller. L4s are cheap-enough that the flex-start discount isn't worth the
0 commit comments