update whisperx to 3.8.1 by bgruening · Pull Request #1776 · bgruening/galaxytools

bgruening · 2026-02-14T23:07:44Z

No description provided.

bgruening · 2026-02-15T21:28:38Z

@arash77 any idea why the test fails?

Or better how could the test pass in your first version :)

arash77 · 2026-02-16T11:09:49Z

@bgruening I can't remember how did they pass the test before! but should we include the models in the docker file so that the CI can access it?

bgruening · 2026-02-16T12:34:26Z

No, they are too large I think. Our models are here: https://github.com/usegalaxy-eu/infrastructure-playbook/blob/master/files/galaxy/tpv/tools.yml#L523

arash77 · 2026-02-16T13:04:09Z

Previously, the test for this tool might not have even run because we didn't pass WHISPERX_MODEL_DIR to it. How can we test the tool in CI if we don't have access to the models? Should we use the expect_failure approach?

bgruening · 2026-02-16T13:13:25Z

I don't know, that was my questino. The CI def. run ... but it turned green. Can it be that huggingface downloaded the model on the fly?

arash77 · 2026-02-16T13:16:19Z

But it needs an hf_token that has accepted the terms and conditions to download the models. Since we are using --model_cache_only True, it shouldn't be downloading them anyway.

arash77 · 2026-02-16T13:19:02Z

Also by upgrading WhisperX, it now requires another model: pyannote/speaker-diarization-community-1.

bgruening · 2026-02-16T13:25:15Z

Ok, then I have no clue how this ever worked on CI :)

Do you have time to test it locally, and then we merge? Or should we YOLO?

arash77 · 2026-02-16T13:29:13Z

I'm trying to test it, but it requires a GPU! I'm also not sure if the GPU is enabled on GitHub Actions but I see we fallback to cpu here.

arash77 · 2026-02-16T14:36:07Z

There are some models that need to be added based on the WhisperX documentation, then we can run it on galaxy to test if it works.
This should be defently done, pyannote/speaker-diarization-3.1 is no longer needed and replaced with pyannote/speaker-diarization-community-1. So we have to add this model to the infrastructure.
Not sure about this one, but the NLTK punkt_tab tokenizer for alignment seems to be required

arash77 · 2026-02-17T16:32:01Z

I have tested it, and the Docker container is safe to work with. However, the models should be updated as I mentioned before; perhaps using this script. Also, make sure to accept the terms for pyannote/speaker-diarization-community-1 and provide the HF_AUTH_TOKEN to it.
And then adding export NLTK_DATA=\${WHISPERX_MODEL_DIR}/nltk_data && to the tool.

update whisperx to 3.8.1

e8fd083

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update whisperx to 3.8.1#1776

update whisperx to 3.8.1#1776
bgruening wants to merge 1 commit intomasterfrom
whisperx381

bgruening commented Feb 14, 2026

Uh oh!

bgruening commented Feb 15, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026 •

edited

Loading

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026 •

edited

Loading

Uh oh!

arash77 commented Feb 16, 2026 •

edited

Loading

Uh oh!

arash77 commented Feb 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bgruening commented Feb 14, 2026

Uh oh!

bgruening commented Feb 15, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bgruening commented Feb 16, 2026

Uh oh!

arash77 commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arash77 commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arash77 commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

arash77 commented Feb 16, 2026 •

edited

Loading

arash77 commented Feb 16, 2026 •

edited

Loading

arash77 commented Feb 16, 2026 •

edited

Loading

arash77 commented Feb 17, 2026 •

edited

Loading