How does NVFLARE manages resources and concurrent jobs? #3883
Replies: 1 comment
-
|
Does the client instance (or physical server) have multiple GPUs?
Create both "client1-1" and "client1-2" containers on the client instances. Then, when running a job, configure the participant list so that only one container per instance participates, such as "client1-1", "2-1", and "3-1".
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
In our legacy FL system set-up, the need to allow multiple jobs for a specific network of one server and multiple clients, with assurance that each job running did not error out for lack of pre-allocated resources, required the a set-up like the following:
That is there were multiple nets with replicated server, client and admin containers, the clients having dedicated GPUs. We wondered if NVFLARE's current infrastructure would support a single set of admin, server and client containers where, if a client has a job running using N GPUs on start, there is a guarantee that the compute availability will stay constant throughout the job regardless of other jobs starting after.
Beta Was this translation helpful? Give feedback.
All reactions