Auto-balancing Playwright test shards #13384

aslushnikov · 2026-05-23T23:16:14Z

aslushnikov
May 23, 2026

Hey folks,

I used to work on Playwright, and I noticed that Langflow uses Playwright too, including sharding in CI.

These days I’m building Flakiness.io: a GitHub-native test analytics dashboard that tracks flaky tests and detects regressions in pull requests.

We’re prototyping Playwright shard balancing, where Flakiness.io uses historical test timings to produce better-balanced shards. Since Langflow has a real-world Playwright setup, I’d be very happy if you’re open to testing it on your workload.

OSS projects can use Flakiness.io for free. I’d be happy to help set it up and see if you find it useful.

AntonioABLima · 2026-05-25T12:43:10Z

AntonioABLima
May 25, 2026
Collaborator

Hey @aslushnikov , thanks for the message! Flakiness.io sounds like a great tool, especially for a real-world Playwright setup like ours.

I've pinged the folks on our team who handle our E2E tests (@daniellicnerski1 and @Victor-w-Madeira) to check this out. We'd love to see how it can help with our shard balancing. Thanks again for offering this to the OSS community!

0 replies

AntonioABLima · 2026-05-25T12:46:28Z

AntonioABLima
May 25, 2026
Collaborator

Super beautiful website, btw!

0 replies

daniellicnerski1 · 2026-05-25T12:51:00Z

daniellicnerski1
May 25, 2026
Collaborator

Hey, @aslushnikov

Thanks for reaching out!

I'm Daniel, QA engineer at OrionTech, I currently own the E2E regression test suite for Langflow. The suite actually lives in a separate repo from Langflow itself: it's an independent Playwright + TypeScript project that runs against any Langflow
instance via URL, covering core flows, components, MCP, auth, playground, and so on, with nightly and weekly CI schedules.

Happy to try Flakiness.io on it. It's a real-world Playwright workload with LLM-backed flows (so we do see real flakiness), and we've been collecting historical run data in JSONL logs already, should be a useful target. Let me know what setup steps
you'd like me to take and I'll get it wired up.

0 replies

aslushnikov · 2026-05-25T15:56:29Z

aslushnikov
May 25, 2026
Author

Hey @daniellicnerski1!

The suite actually lives in a separate repo from Langflow itself: it's an independent Playwright + TypeScript project that runs against any Langflow instance via URL

With this poly-repo setup, Flakiness.io test analytics & regression analysis is pretty useless today: we attribute test results with the source code revision, thinking that commit id fully defines both the tests and the system-under-test. Maybe we can come up with some workarounds, depending on how exactly you set it up. Is the testing repository private? I can't find it, somehow.

On the plus side, shard auto-balancing should work just fine: it only uses historical timing data. For it to work, you just need to create an org + a project on Flakiness.io, and install the @flakiness/playwright reporter to upload the testing data. In a few weeks, once we release the experimental sharding, I'll come back to send a PR and enable the experiment for you. By this time, the project should already accumulate timing data, so we should be able to see how well it works.

0 replies

daniellicnerski1 · 2026-05-28T12:29:41Z

daniellicnerski1
May 28, 2026
Collaborator

Hi @aslushnikov
The repository is currently private, but I'm looking into the possibility of making it public, and I'll send it your way once I do.

Either way, I'm going to test it out in the repo to use some of the features. Definitely keep me updated on new features; I found it really interesting! Also, I'll talk to the team in charge of some of Langflow's current automations and tell them about the tool.

Let's stay in touch, thanks!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-balancing Playwright test shards #13384

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Auto-balancing Playwright test shards #13384

Uh oh!

aslushnikov May 23, 2026

Replies: 5 comments

Uh oh!

AntonioABLima May 25, 2026 Collaborator

Uh oh!

AntonioABLima May 25, 2026 Collaborator

Uh oh!

daniellicnerski1 May 25, 2026 Collaborator

Uh oh!

Uh oh!

aslushnikov May 25, 2026 Author

Uh oh!

daniellicnerski1 May 28, 2026 Collaborator

aslushnikov
May 23, 2026

AntonioABLima
May 25, 2026
Collaborator

AntonioABLima
May 25, 2026
Collaborator

daniellicnerski1
May 25, 2026
Collaborator

aslushnikov
May 25, 2026
Author

daniellicnerski1
May 28, 2026
Collaborator