CPU GPU statistics from session.logs #1146

sunset666 · 2025-11-24T19:34:00Z

Starting approach to gather statistics.

It queries search API to get all Published and QA derived datasets, then ask for its full path to access the session.log.

Starts parsing the log and sets starting and ending points based on the jobs (completed jobs). If the docker image requires GPU is counts this as a GPU job.

Caveats:

Some sub-tasks of a step in a pipeline happening on GPU nodes will be counted as CPU time, because it actually uses CPU to perform the sub-task.
It will count duplicated CPU/GPU jobs for failed attempts. If a step, or sub-task was performed successfully on a failed run and is restarted, the parser has no knowledge of it and will treat it as an independent job.

…he basic info from the datasets.

Debugged functions.

…er pipelines with different output.

…t in the API

sunset666 added 30 commits November 17, 2025 11:53

Adding a script to calculate cpu usage from processed datasets

2ba67dd

Adding a script to calculate cpu usage from processed datasets

c826f43

Adding a script to calculate cpu usage from processed datasets

5758378

Adding a script to calculate cpu usage from processed datasets

4b7c27e

Adding a script to calculate cpu usage from processed datasets

13bf669

Switching approach to DAG oriented

1e8acb3

Switching approach to DAG oriented

75f9cf3

Bugfix wrong name

5e6db0b

Merge branch 'refs/heads/devel' into sunset666/cpu_gpu_gathering

3a42d2e

Bugfix missing search_api connection for TEST environment

60d43c4

Adding full path to the endpoint since it is missing on the connection

74b36e0

Bug catching missing token

9457f9c

Bug catching missing token

26467de

Rolling back to old task declaration due to problems with op_kwargs

7ad466a

Rolling back to old task declaration due to problems with op_kwargs

bfa0afa

Sending the correct parameter

0a5fb8d

Updating elasticsearch query

0fe108b

Copying logic from Juan's approach in DRS

1d33a70

Copying logic from Juan's approach in DRS to build a dataframe with t…

7bad40b

…he basic info from the datasets.

Writing dataframe to tmp directory

f3fd254

Calculate usage, testing on 5 datasets

4472196

Adding calculation of hours parsing the session.log file

f068419

Removing old script.

c578aaf

Debugged functions.

Computing incremental usage of cpu, gpu

42d1e91

Changed starting/ending expression to match portal-containers and oth…

e05d27b

…er pipelines with different output.

Merge branch 'refs/heads/devel' into sunset666/cpu_gpu_gathering

f4dd0d1

Updating logic to use batches on ingest_api queries to minimize impac…

cf78e14

…t in the API

Adding cpu counting to multiply the time by the number of cpus used

81c0af7

Bugfixing the logic to convert string to real integers

3aa2dce

Adding some outputs for future debugging

d822a0a

sunset666 added 4 commits November 26, 2025 13:40

Adding some outputs for future debugging

4ba6fb3

Adding some outputs for future debugging

aaa8607

Adding some outputs for future debugging

6f96418

Adding some outputs for future debugging

2d2a698

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CPU GPU statistics from session.logs #1146

CPU GPU statistics from session.logs #1146

Uh oh!

sunset666 commented Nov 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CPU GPU statistics from session.logs #1146

Are you sure you want to change the base?

CPU GPU statistics from session.logs #1146

Uh oh!

Conversation

sunset666 commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sunset666 commented Nov 24, 2025 •

edited

Loading