Xiaoya update prefect3 by xiaoyachong · Pull Request #201 · mlexchange/mlex_highres_segmentation

xiaoyachong · 2025-11-12T03:20:52Z

This PR focus on Prefect upgrade. It is compatible with the upcoming changes in mlex_utils and mlex_prefect_worker and includes the following updates:

1.Add algorithm registry:
The application now reads the .json file and saves the algorithm details to MLflow, and subsequently retrieves them from MLflow when needed.
This enables the Prefect worker to access algorithm information directly from MLflow, rather than relying on parameters passed from the application.

Simplify parameters and remove flow_type in segmentation.py:
The structure of parameters sent from the application to the Prefect worker has been simplified as follows:

{
    "model_name": model_name,
    "task_name": "train",
    "params": {
        "io_parameters": io_parameters,
        "model_parameters": model_parameters,
    },
}

Meanwhile, the following credentials have been removed from io_parameters:

data_tiled_api_key
mask_tiled_api_key
seg_tiled_api_key
mlflow_tracking_username
mlflow_tracking_password

These credentials are now stored on the Prefect worker side, eliminating the need to send them from the application to the Prefect worker and improving security and configuration consistency.

Companion PRs:
tomo: mlexchange/mlex_tomo_framework#16
prefect worker: mlexchange/mlex_prefect_worker#26
mlex_utils: mlexchange/mlex_utils#5
dlsia_proto: mlexchange/mlex_dlsia_segmentation_prototype#38

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1211925772191119

gitnotebooks · 2025-11-12T03:20:55Z

Review these changes at https://app.gitnotebooks.com/mlexchange/mlex_highres_segmentation/pull/201

taxe10

This is great! I have a couple of comments that should be addressed prior merging:

Enhanced interface startup when no algorithms have been registered - I accidentally started the app without registering the algorithms first, and the application failed to initialize, so I added the following:

% git diff components/control_bar.py 
diff --git a/components/control_bar.py b/components/control_bar.py
index 80e0b59..c39a056 100644
--- a/components/control_bar.py
+++ b/components/control_bar.py
@@ -594,7 +594,7 @@ def layout():
                                         data=models.modelname_list,
                                         value=(
                                             models.modelname_list[0]
-                                            if models.modelname_list[0]
+                                            if len(models.modelname_list) > 0 and models.modelname_list[0]
                                             else None
                                         ),
                                         placeholder="Select a model...",

In the future - we could add a "refresh" button nearby the model selection controls to check if new algorithms have been made available in the application, but I don't think this is needed at this time.

Additionally, I was wondering if we could do a quick model registration at the application startup - something like:

  mlex_segmentation:
    build:
      context: ./mlex_highres_segmentation
      dockerfile: Dockerfile
    command: 'python scripts/save_mlflow_algorithm.py && gunicorn -b 0.0.0.0:8075 --reload app:server'

FYI - I have not been able to test the Prefect 3.x integration locally due to additional comments in the worker's PR

requirements.txt

components/parameter_items.py

xiaoyachong · 2025-11-19T03:37:31Z

This is great! I have a couple of comments that should be addressed prior merging:

Enhanced interface startup when no algorithms have been registered - I accidentally started the app without registering the algorithms first, and the application failed to initialize, so I added the following:
% git diff components/control_bar.py 
diff --git a/components/control_bar.py b/components/control_bar.py
index 80e0b59..c39a056 100644
--- a/components/control_bar.py
+++ b/components/control_bar.py
@@ -594,7 +594,7 @@ def layout():
                                         data=models.modelname_list,
                                         value=(
                                             models.modelname_list[0]
-                                            if models.modelname_list[0]
+                                            if len(models.modelname_list) > 0 and models.modelname_list[0]
                                             else None
                                         ),
                                         placeholder="Select a model...",
In the future - we could add a "refresh" button nearby the model selection controls to check if new algorithms have been made available in the application, but I don't think this is needed at this time.

Additionally, I was wondering if we could do a quick model registration at the application startup - something like:
  mlex_segmentation:
    build:
      context: ./mlex_highres_segmentation
      dockerfile: Dockerfile
    command: 'python scripts/save_mlflow_algorithm.py && gunicorn -b 0.0.0.0:8075 --reload app:server'
FYI - I have not been able to test the Prefect 3.x integration locally due to additional comments in the worker's PR

Thanks for your revision! I’ve made the changes accordingly in the latest commit. I also change the command in mlexchange/mlex_tomo_framework#16.

taxe10

These changes worked well at my end.

Just a couple of comments for follow-up in the next PR:

The status check for the prefect worker switches to ready as soon as the parent flow pool becomes available, even if the individual child worker (conda/docker/etc.) is not actually ready. This can mislead users, so we should refine this logic or clarify the status reporting.
For MLflow model registration, we're currently using the Prefect flow run ID. We should consider switching to human-readable names—possibly the job name—but we need to think through a long-term strategy that supports multi-tenant scenarios.

Wiebke

Thanks for the PR. I think this is pretty ready to merge. I left some minor comments.
Prior to merging we should:

first merge the mlex_utils PR and update requirements.txt accordingly,
[optionally] still introduce the guarding against no models being found

Wiebke · 2025-12-03T20:50:02Z

components/control_bar.py

+                                            if len(models.modelname_list) > 0
+                                            and models.modelname_list[0]


I am not sure if this is coming out of the options here, but with no models registered yet, I get this error:

Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/mlex_utils/mlflow_utils/mlflow_algorithm_client.py", line 267, in __getitem__ return self.algorithms[key] ^^^^^^^^^^^^^^^^^^^^ KeyError: None During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/app/utils/data_utils.py", line 402, in __getitem__ return self.mlflow_client[key] ^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/mlex_utils/mlflow_utils/mlflow_algorithm_client.py", line 269, in __getitem__ raise KeyError(f"An algorithm with name '{key}' does not exist.") KeyError: "An algorithm with name 'None' does not exist." During handling of the above exception, another exception occurred: KeyError: 'A model with name None does not exist.'

Note that the reason for me starting the application without models registered is that I unintentionally overwrote the command in my docker-compose.override.yaml.
Some error handling for this might be useful though.

One idea could be to update the update_model_parameters callback

https://github.com/xiaoyachong/mlex_highres_segmentation/blob/f742fb97868606da4d7f471b8f1f138fc60832cd/callbacks/control_bar.py#L956-L969

with an additional check:

if not model_name: return html.Div("No model available.")

Sounds good. I’ve added a safeguard in the update_model_parameters callback in control_bar.py to handle cases where no models are found.

Wiebke · 2025-12-03T22:37:05Z

scripts/save_mlflow_algorithm.py

+load_dotenv(dotenv_path="../.env")
+
+# MLflow Configuration from environment variables
+MLFLOW_TRACKING_URI = os.getenv("MLFLOW_TRACKING_URI_OUTSIDE", "http://localhost:5000")
+MLFLOW_TRACKING_USERNAME = os.getenv("MLFLOW_TRACKING_USERNAME", "")
+MLFLOW_TRACKING_PASSWORD = os.getenv("MLFLOW_TRACKING_PASSWORD", "")
+# Algorithm JSON path from environment variable
+ALGORITHM_JSON_PATH = os.getenv("ALGORITHM_JSON_PATH", "../assets/models.json")


In a future PR (in line with archiving mlex_tomo_framework, it might make sense to convert this script to use typer and read from environment variables.

Thanks for the kind reminder!

xiaoyachong · 2025-12-03T23:45:30Z

These changes worked well at my end.

Just a couple of comments for follow-up in the next PR:

The status check for the prefect worker switches to ready as soon as the parent flow pool becomes available, even if the individual child worker (conda/docker/etc.) is not actually ready. This can mislead users, so we should refine this logic or clarify the status reporting.

For MLflow model registration, we're currently using the Prefect flow run ID. We should consider switching to human-readable names—possibly the job name—but we need to think through a long-term strategy that supports multi-tenant scenarios.

Thanks for the kind reminder!

Wiebke

LGTM!

xiaoyachong added 2 commits November 11, 2025 19:10

add algorithm registry

2995203

update params for new prefect worker

3fc2d6f

xiaoyachong added 3 commits November 11, 2025 19:25

use isort and black

a4e4a43

use isort and black

1d50e07

remove credentials from io_params

ed97a32

xiaoyachong mentioned this pull request Nov 13, 2025

Xiaoya update prefect3 mlexchange/mlex_tomo_framework#16

Merged

xiaoyachong added 2 commits November 13, 2025 12:03

update docker compose with tomo

be3ef92

import from mlex_utils

28ec22a

xiaoyachong requested review from Wiebke and taxe10 November 13, 2025 20:28

taxe10 marked this pull request as ready for review November 14, 2025 17:08

xiaoyachong added 3 commits November 15, 2025 18:57

change batch size for tunet3+

87e72cf

update readme

0b76c6b

Merge remote-tracking branch 'upstream/main' into xiaoya-update-prefect3

ba9bb3a

taxe10 requested changes Nov 19, 2025

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

components/parameter_items.py Show resolved Hide resolved

xiaoyachong added 3 commits November 18, 2025 18:19

fix mlflow component

38d4baa

fix mlflow component

d927f6f

update cmd

d484557

xiaoyachong added 4 commits November 19, 2025 09:41

update model_dir

ace04b4

update model_dir

c2d6453

load dvc html from mlflow

bbe978b

load dvc html from mlflow

414b1a7

xiaoyachong mentioned this pull request Nov 19, 2025

save dvc html to mlflow mlexchange/mlex_dlsia_segmentation_prototype#39

Closed

xiaoyachong requested a review from taxe10 November 20, 2025 23:00

xiaoyachong mentioned this pull request Nov 21, 2025

Fix result selection #206

Merged

✨ DLSIA refactor

6a65f67

This was referenced Nov 24, 2025

DLSIA refactor #207

Closed

Refactor mlexchange/mlex_dlsia_segmentation_prototype#38

Merged

Merge remote-tracking branch 'upstream/main' into xiaoya-update-prefect3

f742fb9

taxe10 reviewed Dec 3, 2025

View reviewed changes

Wiebke reviewed Dec 3, 2025

View reviewed changes

guide against no model available

a538f80

xiaoyachong requested review from Wiebke and taxe10 December 3, 2025 23:45

update mlex_utils

5b1adb4

Wiebke approved these changes Dec 4, 2025

View reviewed changes

Wiebke merged commit 21c6c87 into mlexchange:main Dec 4, 2025
2 checks passed

		if len(models.modelname_list) > 0
		and models.modelname_list[0]

Comments

Conversation

xiaoyachong commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gitnotebooks bot commented Nov 12, 2025

Uh oh!

taxe10 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

xiaoyachong commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

taxe10 left a comment

Choose a reason for hiding this comment

Uh oh!

Wiebke left a comment

Choose a reason for hiding this comment

Uh oh!

Wiebke Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Wiebke Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

xiaoyachong Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Wiebke Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

xiaoyachong Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

xiaoyachong commented Dec 3, 2025

Uh oh!

Wiebke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xiaoyachong commented Nov 12, 2025 •

edited

Loading

xiaoyachong commented Nov 19, 2025 •

edited

Loading