Make flexynesis GPU version dependent and add docker user 999 by mira-miracoli · Pull Request #2116 · usegalaxy-eu/infrastructure-playbook

mira-miracoli · 2026-06-08T12:11:29Z

deployed and working ~ somewhat (@nilchia could you try with correct parameters ?)

bgruening · 2026-06-08T12:40:27Z


  toolshed.g2.bx.psu.edu/repos/bgruening/flexynesis/flexynesis/.*:
    rules:
+      - if: helpers.tool_version_eq(tool, '1.1.11+galaxy0')


Should be greater than equal I think, all future version need Docker as well I guess

@nilchia only one specific option needs GPU not all flexynesis jobs needs a GPU correct?

correct. Only "GNN"

nilchia · 2026-06-08T15:50:03Z

@@ -821,7 +825,6 @@ tools:
          retval


Suggested change

- if: helpers.tool_version_gte(tool, '1.1.11+galaxy0')

params:

docker_run_extra_arguments: --user 999

- id: flexynesis_gnn_high_mem

if: |

retval = False

if helpers.tool_version_gte(tool, '1.1.11+galaxy0'):

options = job.get_param_values(app)

if options:

training_type = options.get('training_type', {})

if training_type and isinstance(training_type, dict):

model_select = training_type.get('model_class', {})

if model_select and isinstance(model_select, dict):

retval = model_select.get('model_class') == 'GNN'

retval

gpu: 1

sth like this?

@anuprulez also suggested to specify the GPU like the one here:
https://github.com/usegalaxy-eu/infrastructure-playbook/blob/master/files/galaxy/tpv/tools.yml#L341

Looks good I think, the parabricks tool needs to exclude V100 GPUs, is there also a GPU model or cuda version limitation for your tool?

The tool itseld does not have limitation. I think it is just the Memory limitation.

I ran it once on galaxy and it failed becase of low memory:

torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 3.52 GiB. GPU 0 has a total capacity of 14.56 GiB of which 2.92 GiB is free. Including non-PyTorch memory, this process has 11.64 GiB memory in use. Of the allocated memory 7.82 GiB is allocated by PyTorch, and 3.69 GiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation. See documentation for Memory Management (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

toolshed.g2.bx.psu.edu/repos/bgruening/flexynesis/flexynesis/.*: rules: - if: helpers.tool_version_gte(tool, '1.1.11+galaxy0') params: docker_run_extra_arguments: --user 999 - id: flexynesis_gnn_high_mem if: | retval = False if helpers.tool_version_gte(tool, '1.1.11+galaxy0'): options = job.get_param_values(app) if options: training_type = options.get('training_type', {}) if training_type and isinstance(training_type, dict): model_select = training_type.get('model_class', {}) if model_select and isinstance(model_select, dict): retval = model_select.get('model_class') == 'GNN' retval gpu: 1 cores: 20 mem: 100 context: exclude_gpu_models: ["Tesla T4"] # T4 GPUs have only 16 GB of memory, which is not enough for the GNN model

So excluding T4 should fix it.

I tested your rule, but I am not sure if I chose the right parameters, however, my job did not trigger it:

The parameters are correct.

I think the problem is at gpu:1
I think it should be gpus:1

I am also not sure if include_gpu_models is also needed?

make flexynesis GPU version dependent and add docker user 999

afab8c3

mira-miracoli requested review from gsaudade99 and nilchia June 8, 2026 12:11

bgruening reviewed Jun 8, 2026

View reviewed changes

gte instead of eq flexynesis

6ac60a5

nilchia requested changes Jun 8, 2026

View reviewed changes

draft for flexynesis gpu rule and condor_container_gpu model rule

b047a42

mira-miracoli force-pushed the flexynesis-user branch from 0709aea to b047a42 Compare June 10, 2026 12:13

bgruening reviewed Jun 10, 2026

View reviewed changes

Comment thread files/galaxy/tpv/tools.yml Outdated

Apply suggestion from @bgruening

96140be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make flexynesis GPU version dependent and add docker user 999#2116

Make flexynesis GPU version dependent and add docker user 999#2116
mira-miracoli wants to merge 4 commits into
usegalaxy-eu:masterfrom
mira-miracoli:flexynesis-user

mira-miracoli commented Jun 8, 2026 •

edited

Loading

Uh oh!

bgruening Jun 8, 2026

Uh oh!

nilchia Jun 8, 2026

Uh oh!

nilchia Jun 8, 2026

Uh oh!

nilchia Jun 8, 2026

Uh oh!

nilchia Jun 8, 2026

Uh oh!

mira-miracoli Jun 9, 2026

Uh oh!

nilchia Jun 9, 2026

Uh oh!

nilchia Jun 10, 2026

Uh oh!

mira-miracoli Jun 10, 2026

Uh oh!

nilchia Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

+      - if: helpers.tool_version_gte(tool, '1.1.11+galaxy0')
+        params:
+          docker_run_extra_arguments: --user 999
+      - id: flexynesis_gnn_high_mem
+        if: |
+          retval = False
+          if helpers.tool_version_gte(tool, '1.1.11+galaxy0'):
+            options = job.get_param_values(app)
+            if options:
+              training_type = options.get('training_type', {})
+              if training_type and isinstance(training_type, dict):
+                model_select = training_type.get('model_class', {})
+                if model_select and isinstance(model_select, dict):
+                  retval = model_select.get('model_class') == 'GNN'
+          retval
+        gpu: 1

Conversation

mira-miracoli commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mira-miracoli commented Jun 8, 2026 •

edited

Loading