Skip to content

Conversation

@notoraptor
Copy link
Contributor

@nurbal ! Prêt pour une review !

…s prometheus spécifique aux GPUs sur un noeud donné plus bas qu’un threshold X
group_by_node: Union[bool, Sequence[str]] = ("mila",),
min_jobs_per_group: Optional[Union[int, Dict[str, int]]] = None,
nb_stddev=2,
with_gres_gpu=False,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ce nouveau parametre n'est pas testés dans test_check_prometheus_scraping_stats, et donc on teste uniquement le cas des jobs CPU.

df = load_job_series(start=start, end=end, clip_time=clip_time)

# Parse minimum_runtime, and select only jobs where
# elapsed time >= minimum runtime and allocated.gres_gpu == 0
Copy link
Collaborator

@nurbal nurbal Nov 18, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

... d'ailleurs avant on ignorait les jobs GPU, on dirait bien ^^

)


def check_prometheus_stats_for_gpu_jobs(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

... au temps pour moi, je n'étais pas bien réveillé. test_check_prometheus_stats_for_gpu_jobs teste bien les GPU à travers l'appel à check_prometheus_stats_for_gpu_jobs. Pas vraiment un test unitaire, mais c'est ok pour moi :-)

@nurbal nurbal merged commit 6cd17ec into master Nov 19, 2024
7 checks passed
@notoraptor notoraptor deleted the sarc-330 branch November 19, 2024 16:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants