Skip to content

[INFRA]: Run CUB tests in parallel #9550

Description

@gevtushenko

Is this a duplicate?

Overview

Today, CUB tests run sequentially because some of them test large problem sizes requiring all the VRAM. This limits our coverage of concurrency-related issues. @pauleonix found cases where compute sanitizer and sequential test runs are fine, but parallel runs of ctest lead to time-sharing and expose a data race on CUB end.

By running CUB tests in parallel we'll get better coverage and faster CI. Current plan to achieve that is the following:

  1. wait for [FEA]: Extract reusable benchmarking and testing components #9310 to be merged - it'll provide common component that's going to improve compilation time on split targets on step (2)
  2. extract *_large tests into standalone TUs that require entire GPU and assign them appropriate RESOURCE_GROUPS
  3. identify appropriate concurrency level
  4. use concurrency level from (3) as opt-in for CI - some runners are RAM limited (orin etc.). We should avoid running concurrent tests by default to avoid OOM.

Details

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    infraShared CMake, github, etc infrastructure

    Type

    No type
    No fields configured for issues without a type.

    Projects

    Status
    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions