.github/workflows: Naive unit test sharding #12780

timflannagan · 2025-10-30T04:04:35Z

Description

Attempt to shard the unit test workflow. Right now we're hovering around ~10-12m for the unit test workflow which is very heavy for this type of suite. Implement some naive sharding that mirrors how the e2e suite is currently sharded. Ideally, dynamic sharding based on historical runtime is the medium/long term solution here.

Change Type

/kind cleanup

Changelog

NONE

Additional Notes

Copilot

Pull Request Overview

This PR refactors the GitHub Actions unit test workflow to introduce test sharding for improved parallelization and performance. The workflow now splits tests into "fast" and "slow" shards that run concurrently, with an aggregation step to ensure all tests pass.

Key changes:

Introduced matrix-based test sharding (fast/slow shards) to parallelize test execution
Replaced make unit-with-coverage with direct gotestsum invocations for granular control
Added coverage artifact uploading and aggregation step to track test results from both shards

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/unit.yaml

timflannagan · 2025-10-30T04:39:59Z

~50% reduction. The fast matrix job is still the longest job though. Proper implementation requires dynamic sharding based on historical runtime. Existing problem though, same thing the e2e suite sharding approach lacks.

Signed-off-by: timflannagan <[email protected]>

jenshu · 2025-10-30T14:08:25Z

.github/workflows/unit.yaml

+        uses: actions/setup-go@v5
+        with:
+          go-version-file: go.mod
+          cache: true


i think the default is already true

jenshu · 2025-10-30T14:11:30Z

.github/workflows/unit.yaml

+      - name: Run unit tests (${{ matrix.shard.name }})
+        run: |
+          if [ "${{ matrix.shard.name }}" = "fast" ]; then
+            PACKAGES=$(go list ./... | grep -v -e 'internal/kgateway/translator/gateway$' -e 'internal/kgateway/controller$' -e 'internal/kgateway/agentgatewaysyncer$' -e 'internal/sds/pkg/run$' -e 'internal/kgateway/setup$' | tr '\n' ' ')


nit: seems a little error-prone to have to define this list in 2 places, wonder if we could store them in an env or something

jenshu · 2025-10-30T14:12:47Z

.github/workflows/unit.yaml

+          fi
+      - name: Validate Test Coverage
+        shell: bash
+        run: make validate-test-coverage || true


why is || true needed here?

Copilot AI review requested due to automatic review settings October 30, 2025 04:04

github-actions bot added kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. release-note-none labels Oct 30, 2025

Copilot AI reviewed Oct 30, 2025

View reviewed changes

timflannagan force-pushed the chore/fast-unit branch 4 times, most recently from 51e435d to 7998018 Compare October 30, 2025 04:32

.github/workflows: Naive unit test sharding

91abc6e

Signed-off-by: timflannagan <[email protected]>

timflannagan force-pushed the chore/fast-unit branch from 7998018 to 91abc6e Compare October 30, 2025 04:40

jenshu reviewed Oct 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

.github/workflows: Naive unit test sharding #12780

.github/workflows: Naive unit test sharding #12780

Uh oh!

timflannagan commented Oct 30, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timflannagan commented Oct 30, 2025

Uh oh!

jenshu Oct 30, 2025

Uh oh!

jenshu Oct 30, 2025

Uh oh!

jenshu Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

.github/workflows: Naive unit test sharding #12780

Are you sure you want to change the base?

.github/workflows: Naive unit test sharding #12780

Uh oh!

Conversation

timflannagan commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Change Type

Changelog

Additional Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timflannagan commented Oct 30, 2025

Uh oh!

jenshu Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

jenshu Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

jenshu Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

timflannagan commented Oct 30, 2025 •

edited

Loading