sig-node e2e tests machine hardware requirements #7339
Open
Description
sig-node owns a set of features related to exposing and using hardware details which require some hardware features to exercise the code. Examples are exclusive CPU allocation (cpumanager), device allocation (device manager), NUMA alignment (topology manager), NUMA alignment considering distances between NUMA zones (topology manager).
Note: some requirement overlap. Easy example: a powerful high end (at time of writing) server CPU can have at the same time multi core count, exposing multiple NUMA nodes, and have split L3, satisfying in one go all cpumanager requirements
Hardware requirements, driven by feature, rationale
- cpumanager (GA): x86_64, arm: machine with at least 4 cores exposed, better like 16 or more. We need more cores to be able to run different set of tests in different scenarios. 4 cores is the minimum to run basic test about the feature. We already got machines with 4 cores
- topology manager (GA): x86_64, arm: machine with 2 or more NUMA nodes. We need to align on NUMA nodes, so we need 2 or more nodes to begin with
- topology manager (KEP Improved multi-numa alignment in Topology Manager enhancements#3545): x86_64, arm: machine with 4 or more NUMA nodes. We need to consider NUMA distances in allocation, so we need 4 or more nodes with different distances between each other
- topology manager (KEP KEP-4622: Add a TopologyManager policy option for MaxAllowableNUMANodes enhancements#4622): arm only?: machine with 9 or more NUMA nodes (!) like grace gpus
- cpumanager (KEP KEP-4800: Split UnCoreCache awareness enhancements#4810): x86_64, arm: a machine with split (non-uniform) L3 cache, like epyc cpus
- cpumanager (KEP Add CPUManager policy option to align CPUs by Socket instead of by NUMA node enhancements#3327): x86_64, arm: a machine with multiple CPU sockets, in order to exercise alignment by socket.
- devicemanager (GA): x86_64, arm: hardware devices controlled by device plugins. Most common usecases are SRIOV cards and GPUs. One device is the bare minimum, we would like 2 or more to be able to use them also in the topology manager tests as deciding factor (see topology manager in this list)
- memorymanager (Beta, GA graduating): x86_64, arm: totally overlaps with topology manager requirements, listed here for the sake of completeness.
this list will be updated after more review of the ongoing sig-node features
Metadata
Assignees
Labels
Type
Projects
Status
Backlog