Fix #3076 parEvalMap resource scoping #3512

reardonj · 2024-12-26T05:19:06Z

Updates parEvalMap*, broadcastThrough and prefetch to extend the resource scope past the channel/topic used to implement concurrency for these operators.

In all cases the solution is the same:

use underlying.uncons.flatMap to get into a Pull context with access to the source stream
setup the new foreground stream with Pull.extendScopeTo to pull the scope across to the new stream

I don't see an obvious way to extract this to something general. The extra finalization in the background stream for parEvalMapUnorderedUnbounded means I can't just make a concurrently variant that takes the background stream, since the foreground needs to consume the transformed stream, while only taking the original stream's scope.

Edit: I have abstracted this solution to a new extendScopeThrough method, which works as through except propagating the scope to the new stream. It appears we should be doing this for any stream combinator where the resulting stream is not directly derived from the current stream (e.g. any combinator that uses an internal buffer, channel, topic, or such).

I also had to fix the cancallation safety of extendScopeTo (see #3474). The StreamSuite "resource safety test 4" started failing after my changes, presumably because the scope started propagating far enough to actually get hit by cancellation.

There is at least conflateChunks broken the same way but I'd like to validate this solution is correct before I continue trying to chase down all the places the fix should be applied.

Updates parEvalMap* and broadcastThrough to extend the resource scope past the channel/topic used to implement concurrency for these operators.

Make extendScopeTo cancellation safe (see typelevel#3474)

yurique · 2024-12-26T16:29:48Z

core/shared/src/main/scala/fs2/Stream.scala

+  def extendScopeThrough[F2[x] >: F[x], O2](
+      f: Stream[F, O] => Stream[F2, O2]
+  )(implicit F: MonadError[F2, Throwable]): Stream[F2, O2] =
+    this.pull.peek


Is it guaranteed to be safe (in the context of scopes) to use .peek before Pull.extendScopeTo?

I'm assuming scopes work properly for simple streams. 🤞 Do you have a specific scenario in mind?

The fundamental issue is that I need to get ahold of a scope before we go adding more finalizers to source to avoid deadlocking in parEvalMap and I don't see a way to do that without uncons. peek is just a convenience helper.

Do you have a specific scenario in mind?

No, not really, it's just me being paranoid :)

I don't see a way to do it without uncons either, I was thinking about swapping the order of those Pulls:

def extendScopeThrough[F2[x] >: F[x], O2]( f: Stream[F, O] => Stream[F2, O2] )(implicit F: MonadError[F2, Throwable]): Stream[F2, O2] = Pull .extendScopeTo(this.covary[F2]) .flatMap { stream => stream.pull.peek.flatMap { case Some((_, tl)) => f(tl).underlying // <--------------- case None => f(Stream.empty).underlying } } .stream

but the problem is tl is not a Stream[F, O] (it's a Stream[F2, O]), and .covary[F2] at the beginning has to be there because we only have MonadError for F2.

Just make it f: Stream[F2, O] => Stream[F2, O2]. Requires type annotations at the call sites, but otherwise, this version also passes the tests.

Unless we can find an observable difference I'd rather keep the current version to avoid the extra annotations.

reardonj · 2024-12-30T02:04:37Z

This solution is inadequate. It only handles scopes open at the first pull.

yurique · 2024-12-30T02:38:59Z

@reardonj

wouldn't the Scope.lease (that is used in the Pull.extendScopeTo) handle the subsequent (child) scopes?

Leases the resources of this scope until the returned lease is cancelled.
Note that this leases all resources in this scope, resources in all parent scopes (up to root)
and resources of all child scopes.

(unless, of course, I'm mis-understanding what the above means)

yurique · 2024-12-30T02:44:11Z

I tried some ridiculous things like the following:

  fs2.Stream.unit
    .covary[IO]
    .flatMap { _ =>
      fs2.Stream.bracket {
        IO.println("making resource 1").as("TempFile")
      } { res =>
        IO.println(s"!! releasing resource 1: $res")
      }
    }
    .flatMap { _ =>
      fs2.Stream.bracket {
        IO.println("making resource 2").as("TempFile")
      } { res =>
        IO.println(s"!! releasing resource 2: $res")
      }
    }
    .parEvalMap(2) { _ =>
      IO.println("eval")
    }
    .flatMap { _ =>
      fs2.Stream.bracket {
        IO.println("making resource 3").as("TempFile")
      } { res =>
        IO.println(s"!! releasing resource 3: $res")
      }
    }
    .parEvalMap(2) { _ =>
      IO.println("eval 2")
    }
    .flatMap { _ =>
      fs2.Stream.bracket {
        IO.println("making resource 4").as("TempFile")
      } { res =>
        IO.println(s"!! releasing resource 4: $res")
      }
    }
    .parEvalMap(2) { _ =>
      IO.println("eval 3")
    }
    .compile
    .drain
    .unsafeRunSync()

and it works like a clock

making resource 1
making resource 2
eval
making resource 3
eval 2
making resource 4
eval 3
!! releasing resource 4: TempFile
!! releasing resource 3: TempFile
!! releasing resource 2: TempFile
!! releasing resource 1: TempFile

reardonj · 2024-12-30T03:05:08Z

@yurique , try a stream that creates multiple scopes. I just pushed the test that I broke based on the Network[IO].server conversation:

      Stream(1, 2, 3, 4, 5, 6)
        .flatMap(i => Stream.bracket(Deferred[IO, Int])(_.complete(i).void)) // 1
        .parEvalMap(2)(d => IO.sleep(1.second) >> d.complete(0))
        .evalMap(completed => IO.raiseWhen(!completed)(new RuntimeException("already completed")))
        .timeout(5.seconds)
        .compile
        .last
        .assertEquals(Some(()))

As I understand, each element passing through // 1 opens another scope, but only the first scope is actually propagated

Fix typelevel#3076 parEvalMap resource scoping

810af8a

Updates parEvalMap* and broadcastThrough to extend the resource scope past the channel/topic used to implement concurrency for these operators.

reardonj force-pushed the issue-3076-parEvalMap-cleanup-fix branch from 2b40725 to 810af8a Compare December 26, 2024 05:25

Abstract scope extension with extendScopeThrough

cc55983

Make extendScopeTo cancellation safe (see typelevel#3474)

reardonj force-pushed the issue-3076-parEvalMap-cleanup-fix branch from e229f82 to cc55983 Compare December 26, 2024 15:39

yurique reviewed Dec 26, 2024

View reviewed changes

armanbilge added this to the v3.12.0 milestone Dec 28, 2024

armanbilge marked this pull request as draft December 28, 2024 16:03

reardonj closed this Dec 30, 2024

armanbilge removed this from the v3.12.0 milestone Dec 30, 2024

yurique mentioned this pull request Dec 31, 2024

Fix #3076 parEvalMap resource scoping #3515

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #3076 parEvalMap resource scoping #3512

Fix #3076 parEvalMap resource scoping #3512

Uh oh!

reardonj commented Dec 26, 2024 •

edited

Loading

Uh oh!

yurique Dec 26, 2024

Uh oh!

reardonj Dec 26, 2024

Uh oh!

yurique Dec 26, 2024

Uh oh!

reardonj Dec 26, 2024

Uh oh!

yurique Dec 26, 2024

Uh oh!

reardonj commented Dec 30, 2024

Uh oh!

yurique commented Dec 30, 2024

Uh oh!

yurique commented Dec 30, 2024

Uh oh!

reardonj commented Dec 30, 2024

Uh oh!

Uh oh!

Fix #3076 parEvalMap resource scoping #3512

Fix #3076 parEvalMap resource scoping #3512

Uh oh!

Conversation

reardonj commented Dec 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yurique Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

reardonj Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

yurique Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

reardonj Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

yurique Dec 26, 2024

Choose a reason for hiding this comment

Uh oh!

reardonj commented Dec 30, 2024

Uh oh!

yurique commented Dec 30, 2024

Uh oh!

yurique commented Dec 30, 2024

Uh oh!

reardonj commented Dec 30, 2024

Uh oh!

Uh oh!

reardonj commented Dec 26, 2024 •

edited

Loading