WIP: Add `JobManager` #4287

BalmungSan · 2025-03-04T01:45:22Z

Implements the JobManager idea proposed in #1345

Roadmap

Define the API.
Implement it.
Add tests.
Add docs.

BalmungSan · 2025-03-04T01:47:24Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+   * Creates and launches the given `Job` in the background. If another Job with the same id was
+   * already running, it will be cancelled before starting this one.
+   */
+  def startJob(id: Id, job: Resource[F, JobManager.Job[F, S]]): F[Unit]


On one had, I like the idea of users being able to use their own Ids.
On the other, I think most users would benefit from a default using automatically generated UUIDs.

Should we provide such default in some way?

BalmungSan · 2025-03-04T01:48:36Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+  /**
+   * Gets the status of the `Job` associated with the given `id`. If `id` doesn't exists or the
+   * `Job` already finished then the returned value will be a `None`.
+   */
+  def getJobStatus(id: Id): F[Option[S]]


I somewhat dislike the idea that both bad id and already finished return in a None.
But, otherwise, the map will grow indefinitely.

Any ideas?

By fully controlling IDs, we could have startJob return a fresh ID, so "bad ID" would never happen. (But then we'd lose the ability of users to have their own IDs...)

BalmungSan · 2025-03-04T01:48:58Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+  /**
+   * Signals cancellation of the `Job` associated with the given `id`, and waits for its
+   * completion.
+   */
+  def cancelJob(id: Id): F[Unit]


Should we provide a cacelAndForget variant where we don't wait on cancellation?

Would that be essentially cancelJob(...).start? If yes, I don't think we should add it (everyone can .start for themselves).

Conceptually yes, but also, since we already have a Dispatcher in place, it could be used to perform that rather than raw start.

Okay, so I think my point is: if it's somehow better (performance, safety, whatever) than just .start-ing, then yeah, maybe add it. If it's not better, then definitely not.

AFAIK, it is safer since its life cycle is attached to the Supervisor.
But I am not sure what canceling a cancel does.

It does "nothing", as a cancel is uncancelable.

Then I guess as you said, it provides no value and users who don't want to wait on the cancellation can just start.

BalmungSan · 2025-03-04T01:49:38Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+  trait Job[F[_], S] {
+
+    /**
+     * Starts the logic of this `Job`.
+     */
+    def run: F[Unit]
+
+    /**
+     * Gets the status of this `Job`.
+     */
+    def getStatus: F[S]
+  }


Users would then implement this trait for their own Jobs.

BalmungSan · 2025-03-04T01:50:00Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+          }
+        }
+
+        supervisor.supervise(runJob).void


We don't wait for the Job to start before returning to the user.
But, that means there is a brief delay between starting the Job and users being able to query its status.

I don't think that's okay. I think the point of std is to solve tricky race conditions like this for users.

Fair point.
What do you think would be the best way to solve that? A Deferred a CountdownLatch? Other thing?
Or maybe, simply run the runJob logic there rather that sending it to the Supervisor?

Yeah, I'm a little confused by why it's sent to the Supervisor... (I'm sure there is a reason, I just don't know it).

The main thing is that if the same id was already found we cancel it, which is costly.
But, I think I could just send that to the Supervisor as well.

Oh, looking at the code, I realized that we also need to run the job setup (Resource.acquire) before being able to run the job itself.
So that was also part of what was being done in the background, but it also means that the short delay could be bigger.

Thus, I decided to use a Deferred to wait until the job has been properly registered before returning. But that may take a while.
Another idea that I just had would be to have an Initializing status that could be used meanwhile. However, that would complicate the logic quite a bit.

BalmungSan · 2025-03-04T01:57:19Z

std/shared/src/main/scala/cats/effect/std/JobManager.scala

+                  cancel = fiber.cancel
+                ).some
+              )
+              .flatMap(_.traverse_(_.cancel)) >>


In case the same id was already used, we cancel the previous Job.

BalmungSan added 2 commits March 3, 2025 20:27

Add JobManager API

f336fd5

Implement JobManager

1e7138f

BalmungSan force-pushed the add-job-manager branch from 21ea992 to 1e7138f Compare March 4, 2025 01:56

BalmungSan commented Mar 4, 2025

View reviewed changes

startJob waits until the job has been successfully registered

c1bfc85

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Add `JobManager` #4287

WIP: Add `JobManager` #4287

BalmungSan commented Mar 4, 2025

BalmungSan Mar 4, 2025

BalmungSan Mar 4, 2025

durban Mar 23, 2025

BalmungSan Mar 4, 2025

durban Mar 23, 2025

BalmungSan Mar 23, 2025

durban Mar 23, 2025

BalmungSan Mar 23, 2025

durban Mar 23, 2025

BalmungSan Mar 23, 2025

BalmungSan Mar 4, 2025

BalmungSan Mar 4, 2025

durban Mar 23, 2025

BalmungSan Mar 23, 2025 •

edited

Loading

durban Mar 23, 2025

BalmungSan Mar 23, 2025 •

edited

Loading

BalmungSan Mar 23, 2025

BalmungSan Mar 4, 2025

WIP: Add JobManager #4287

Are you sure you want to change the base?

WIP: Add JobManager #4287

Conversation

BalmungSan commented Mar 4, 2025

Roadmap

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BalmungSan Mar 23, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BalmungSan Mar 23, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

WIP: Add `JobManager` #4287

WIP: Add `JobManager` #4287

BalmungSan Mar 23, 2025 •

edited

Loading

BalmungSan Mar 23, 2025 •

edited

Loading