Skip to content

Move task spawning to driver methods #4

Open
@yoshuawuyts

Description

@yoshuawuyts

Currently parallel-stream will start doing work the second it's initialized. This is much the same as task::spawn. The upside of this is that we got it to work, and it's what people want in the overwhelming amount of cases.

The downsides are that we're both spawning more tasks than needed, which interferes with the ability of the compiler to inline futures, which in turn will impact performance. Also there's no backpressure.

The solution to this seems to be to move "task spawning" to the edge methods: next, for_each, collect, sum that on the one hand spawn tasks as fast as possible, while on the other hand allow outputting them one-by-one.

There's probably some nuance here; for example next is by nature sequential so task spawning might not even make sense. But overall it seems that if we can invert the logic slightly this could lead to some neat results.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions