Update scikit-learn to version 1.5 by tomerbv · Pull Request #967 · mlrun/functions

tomerbv · 2026-01-19T15:47:05Z

Updates scikit-learn dependency from 1.3 to ~=1.5 with necessary compatibility fixes.

Changes by function:

auto_trainer

Added monkey-patches for sklearn metrics to handle removed squared parameter (MLRun 1.10.0 compatibility)
Updated tests to use get_or_create_project() instead of new_project()
Added lightgbm models to tests

gen_class_data

Fixed test artifact path for local mode execution

describe

Updated dependencies only

All functions:

Updated requirements.txt: scikit-learn 1.3 → ~1.5
Updated item.yaml and regenerated function.yaml

fixes and patches for new scikit-learn version changes in item.yaml and regenerate function.yaml

fix _get_dataframe to handle list/dict before accessing artifact_url added feature name preservation logic in predict function

mlrun v 1.10

added skip for test_train in test_auto_trainer.py

change XGBRegressor to LGBMRegressor

guy1992l

one last thing

guy1992l · 2026-02-11T23:39:21Z

functions/src/auto_trainer/test_auto_trainer.py

    ("sklearn.linear_model.LinearRegression", "regression"),
    ("sklearn.ensemble.RandomForestClassifier", "classification"),
-    ("xgboost.XGBRegressor", "regression"),
+    ("lightgbm.LGBMRegressor", "regression"),


add these to the test so we can know for sure all is working with the feature names changes:

("xgboost.XGBRegressor", "regression"), ("xgboost.XGBClassifier", "classification"), ("lightgbm.LGBMRegressor", "regression"), # This one you added already. ("lightgbm.LGBMClassifier", "classification")

…assifier models to test

* updated scikit-learn~=1.5 fixes and patches for new scikit-learn version changes in item.yaml and regenerate function.yaml * remove filename * remove numpy import * revert sklearn.metrics monkey patch fix _get_dataframe to handle list/dict before accessing artifact_url added feature name preservation logic in predict function * revert mlrun version * revert get_or_create_project * revert scikit-learn version * scikit-learn==1.5.2 mlrun v 1.10 * scikit-learn==1.4.2 * revert scikit-learn<1.4.0 * scikit-learn~=1.5 * mlrun 1.10 with scikit-learn<1.4.0 * scikit-learn strict v~=1.5.2 added skip for test_train in test_auto_trainer.py * revert sklearn_classifier.py changes change XGBRegressor to LGBMRegressor * added xgboost.XGBRegressor, xgboost.XGBClassifier and lightgbm.LGBMClassifier models to test

* Update scikit-learn to version 1.5 (#967) * updated scikit-learn~=1.5 fixes and patches for new scikit-learn version changes in item.yaml and regenerate function.yaml * remove filename * remove numpy import * revert sklearn.metrics monkey patch fix _get_dataframe to handle list/dict before accessing artifact_url added feature name preservation logic in predict function * revert mlrun version * revert get_or_create_project * revert scikit-learn version * scikit-learn==1.5.2 mlrun v 1.10 * scikit-learn==1.4.2 * revert scikit-learn<1.4.0 * scikit-learn~=1.5 * mlrun 1.10 with scikit-learn<1.4.0 * scikit-learn strict v~=1.5.2 added skip for test_train in test_auto_trainer.py * revert sklearn_classifier.py changes change XGBRegressor to LGBMRegressor * added xgboost.XGBRegressor, xgboost.XGBClassifier and lightgbm.LGBMClassifier models to test * [module][langchain_mlrun] First version (#963) * [Build] Fix html links, Add <function>.html as source in documentation * Update CI temporarily and update index * [XGB-Custom] Fix test artifact key name * [XGB-Serving][XGB-Test][XGB-Trainer] Fix tests - artifact key * [Build] Install python 3.9 when testing (#618) * [Build] Update python version in CI (#620) * [Build] Install python 3.9 when testing * [Build] Update python version in CI * . * Revert "[Build] Update python version in CI (#620)" (#621) This reverts commit 0cd1f15. * Revert "[Build] Install python 3.9 when testing (#618)" (#619) This reverts commit 3301415. * [Build] Build with python 3.9 (#622) * [Build] Build with python 3.9 * . * [onnx utils] update onnx utils packages * [Noise-reduction] Add new function to hub (#765) * [Noise-reduction] Add new function to hub * fix test * added multiprocessing and silence removal to function * delete `load_dask` (#822) * [feature selection] update function yaml * [feature selection] update function yaml * Revert "[onnx utils] update onnx utils packages" This reverts commit 8872798. * [feature selection] update function yaml * [feature selection] update function yaml * Delete unsupported functions from the hub (#824) * delete EOS functions * bring back validate_great_expectations * bring back load_dataset * Update feature_selection/test_feature_selection.py Co-authored-by: Eyal Danieli <eyald@iguazio.com> * Update item.yaml * Align to master branch (#826) * [Category] Fix and add categories to functions (#808) * [Category] Fix and add categories to functions * bump version in structured * test is not valid in huggingface_serving * Fix duplicated footer * Fix duplicated footer * revert python version change as it will be done in another PR * comments * comments * Bump python:3.6 to python:3.9 (#810) * [Describe] Align describe to new pandas version (#812) * [Describe] Align describe to new pandas version * minor test fix * update mlrun version * add dask to requirements * remove dask * update numpy version * debug * debug * debug * remove dask tests * remove debug code * [get_offline_features] Updated to mlrun 1.6.3 (#813) * [Feature-selection] Replace matplotlib with plotly (#815) * Iguazio-cicd user token updated Iguazio-cicd user token updated in repo secrets: https://github.com/mlrun/functions/settings/secrets/actions MARKETPLACE_ACCESS_TOKEN_V3 new token gh...Zmf was set around April * forcing iguazio-cicd auth forcing iguazio-cicd to deal with Author identity unknown * checkout@v3 to v4 and echo * [Mlflow_utils] - mlflow model server (#811) * mlflow server * small fix to test * small fixes to ms and nb * small fixes to mlrun version * update requirements lightgbm * added req * Added xgboost to req --------- Co-authored-by: Avi Asulin <34214569+aviaIguazio@users.noreply.github.com> * [Mlflow] Remove mlflow tag (#825) * remove mlflow tag * remove mlflow tag --------- Co-authored-by: Avi Asulin <34214569+aviaIguazio@users.noreply.github.com> * align feature_selection yaml --------- Co-authored-by: Avi Asulin <34214569+aviaIguazio@users.noreply.github.com> Co-authored-by: Yonatan Shelach <92271540+yonishelach@users.noreply.github.com> Co-authored-by: rokatyy <ekaterina_molchanova@mckinsey.com> Co-authored-by: Katerina Molchanova <35141662+rokatyy@users.noreply.github.com> Co-authored-by: nashpaz123 <44337075+nashpaz123@users.noreply.github.com> Co-authored-by: ZeevRispler <73653682+ZeevRispler@users.noreply.github.com> * set `navigation_with_keys` to False (#829) * remove xgb and churn functions (#830) * [Batch Infer V2] Adjust function to 1.7 (#832) * adjust batch infer v2 * update docs in NB * bring back deprecated params and add warn (#834) * fix PyYAML loading (#837) * [text to audio generator] Replaced bark with openai tts models (#836) * [Text to audio generator] Add speech engine (#838) * [text to audio generator] Replaced bark with openai tts models * [text to audio generator] Fix base url env var * fix version * Add speech engine * after review * [auto-trainer] update test requirements (#839) * [Build] Fix html links, Add <function>.html as source in documentation * Update CI temporarily and update index * [XGB-Custom] Fix test artifact key name * [XGB-Serving][XGB-Test][XGB-Trainer] Fix tests - artifact key * [Build] Install python 3.9 when testing (#618) * [Build] Update python version in CI (#620) * [Build] Install python 3.9 when testing * [Build] Update python version in CI * . * Revert "[Build] Update python version in CI (#620)" (#621) This reverts commit 0cd1f15. * Revert "[Build] Install python 3.9 when testing (#618)" (#619) This reverts commit 3301415. * [Build] Build with python 3.9 (#622) * [Build] Build with python 3.9 * . * Update requirements.txt * [Feature Selection] Fix deprecated `get_offline_features` (#844) * fix feature_selection * fix feature_selection * fix feature_selection nb * update yaml name * fix test * fix test * limit torchaudio for unit test (#845) * Update requirements.txt (#843) * [Open Archive] Fix arbitrary file vulnerability (#847) * fix arbitrary file vulnerability * fix arbitrary file vulnerability * fix test * [open_archive] Add traversal attack test (#849) * add traversal test * add traversal test * add traversal test * first version * run upg to pydantic v2 * added kafka and mlrun-ce code preparation * Eyal review * LangChain MLRun Integration with Kafka Support (CE Mode) (#1) * docstring syntax fixes * Add initial Kafka support for MLRun CE Implement _KafkaMLRunEndPointClient with KafkaProducer Add kafka_broker and kafka_topic to MLRunTracerClientSettings Add Kafka parameters to setup_langchain_monitoring() Update notebook to auto-detect CE/Enterprise mode Add kafka-python, orjson, uuid-utils to requirements.txt * added KAFKA_BROKER and DatastoreProfileTDEngine for registering kafka and tsdb profiles * clear output * use get_kafka_topic for the project name remove "raises:" docstring added kafka flush added s3fs to requirements.txt * adaptive notebook for local execution - AWS_ENDPOINT_URL_S3 env variable in deployment - port forwarding scripts * Replace kafka_broker/kafka_topic with stream_profile_name - Update _KafkaMLRunEndPointClient to use DatastoreProfileKafkaStream - Fetch Kafka config (broker, topic, SASL, SSL) from registered profile - Auto-retrieve stream_profile_name from model monitoring credentials - Update MLRunTracerClientSettings with new stream_profile_name field - Update setup_langchain_monitoring() to use profile-based config - Update notebook to use simplified API * Add get_kafka_stream_profile_name() utility to auto-detect Kafka profile * Move dependencies from requirements.txt to item.yaml * generalize env variables * Deploy monitoring app once instead of redeploy Unify profile variable naming for CE and Enterprise modes * _KafkaMLRunEndPointClient constructor - keep project handling NONE able with parent class handling in * revert removal of raise docstring enforce usage of stream_profile_name * change stream_profile_name to kafka_stream_profile_name revert ValueError message * Revert ValueError message Added mechanism to flush stream upon root run (instead of each monitor call) * ensure self._mlrun_client is not None before flush * revert double deploy * Add configurable kafka_linger_ms for hybrid flush control Adds kafka_linger_ms parameter to control message delivery timing: - Explicit flush mode (linger_ms=0, default): flush after each root run - Kafka-managed mode (linger_ms>0): Kafka controls delivery timing The flush() method now handles the mode internally - it's a no-op when Kafka-managed mode is enabled, keeping the tracer code simple. * fix double deploy in notebook (set the in-cluster MinIO endpoint before deploying) * Simplify Kafka flush behavior and enable batching by default - Always flush at end of root run (removed conditional linger_ms check) - Set default kafka_linger_ms to 500ms for message batching - Simplify KafkaProducer initialization (pop bootstrap_servers instead of lambda) * remove local module import from notebook remove redundant requirements from item.yaml * move env variables setup to top of the notebook * renamed env vars * fix test * gilad review --------- Co-authored-by: yonishelach <yonatanshelach@gmail.com> Co-authored-by: Yoni Shelach <92271540+yonishelach@users.noreply.github.com> Co-authored-by: Avi Asulin <avia@iguazio.com> Co-authored-by: Eyal Danieli <eyald@iguazio.com> Co-authored-by: Avi Asulin <34214569+aviaIguazio@users.noreply.github.com> Co-authored-by: rokatyy <ekaterina_molchanova@mckinsey.com> Co-authored-by: Katerina Molchanova <35141662+rokatyy@users.noreply.github.com> Co-authored-by: nashpaz123 <44337075+nashpaz123@users.noreply.github.com> Co-authored-by: ZeevRispler <73653682+ZeevRispler@users.noreply.github.com> Co-authored-by: daniels290813 <78727943+daniels290813@users.noreply.github.com> Co-authored-by: Tomer Weitzman <81749152+tomerbv@users.noreply.github.com> * [onnx_utils] updated pytorch due to security vulnerability (#968) * Update onnx_utils dependencies and improve test robustness - Upgrade PyTorch (2.6.0 → 2.8.0) and TorchVision (0.21.0 → 0.23.0) for better compatibility and performance - Update MLRun version requirement to 1.10.0 in item.yaml - Bump function version to 1.4.0 Test improvements: - Add environment variable validation (MLRUN_DBPATH, MLRUN_ARTIFACT_PATH) - Add conditional test skipping based on tf2onnx availability - Fix cleanup function to properly remove test artifacts (model.pt, model_modules_map.json, onnx_model.onnx, etc.) - Update deprecated artifact_path parameter to output_path - Add explicit project context to all MLRun function calls - Fix PyTorch test artifact path construction * Add conftest fixture for test environment and update notebook to PyTorch demo - Centralize test setup/cleanup in conftest autouse fixture - Rewrite notebook demo from Keras to a working PyTorch MobileNetV2 example * deleted iguazio credentials * Remove conftest.py and inline fixtures into test_onnx_utils.py Move onnx_project and test_environment fixtures directly into the test file to reduce unnecessary indirection for a single test module. --------- Co-authored-by: Tomer Weitzman <81749152+tomerbv@users.noreply.github.com> Co-authored-by: guy1992l <83535508+guy1992l@users.noreply.github.com> Co-authored-by: yonishelach <yonatanshelach@gmail.com> Co-authored-by: Yoni Shelach <92271540+yonishelach@users.noreply.github.com> Co-authored-by: Avi Asulin <avia@iguazio.com> Co-authored-by: Avi Asulin <34214569+aviaIguazio@users.noreply.github.com> Co-authored-by: rokatyy <ekaterina_molchanova@mckinsey.com> Co-authored-by: Katerina Molchanova <35141662+rokatyy@users.noreply.github.com> Co-authored-by: nashpaz123 <44337075+nashpaz123@users.noreply.github.com> Co-authored-by: ZeevRispler <73653682+ZeevRispler@users.noreply.github.com> Co-authored-by: daniels290813 <78727943+daniels290813@users.noreply.github.com> Co-authored-by: Omer Mimon <81911093+omermaim@users.noreply.github.com>

tomerbv added 14 commits January 19, 2026 16:39

updated scikit-learn~=1.5

3ab150c

fixes and patches for new scikit-learn version changes in item.yaml and regenerate function.yaml

remove filename

613d020

Merge branch 'development' into scikitlearn_version

71e3d00

remove numpy import

6256dcf

revert sklearn.metrics monkey patch

227cc95

fix _get_dataframe to handle list/dict before accessing artifact_url added feature name preservation logic in predict function

revert mlrun version

1c2323c

revert get_or_create_project

ef99df2

revert scikit-learn version

da51395

scikit-learn==1.5.2

c89f1c3

mlrun v 1.10

scikit-learn==1.4.2

63b968c

revert scikit-learn<1.4.0

dc2a766

scikit-learn~=1.5

514f4a4

mlrun 1.10 with scikit-learn<1.4.0

1f76feb

scikit-learn strict v~=1.5.2

ace141e

added skip for test_train in test_auto_trainer.py

gilad-shaham requested a review from guy1992l January 30, 2026 10:58

tomerbv added 3 commits February 9, 2026 13:34

Merge branch 'development' into scikitlearn_version

844f74b

Merge branch 'development' into scikitlearn_version

671d6ac

revert sklearn_classifier.py changes

94ff0d5

change XGBRegressor to LGBMRegressor

guy1992l requested changes Feb 11, 2026

View reviewed changes

added xgboost.XGBRegressor, xgboost.XGBClassifier and lightgbm.LGBMCl…

2078e86

…assifier models to test

Eyal-Danieli merged commit 3d1e85c into mlrun:development Feb 12, 2026
5 checks passed

Eyal-Danieli mentioned this pull request Feb 16, 2026

Cherry Pick from Development #973

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update scikit-learn to version 1.5 #967

Update scikit-learn to version 1.5 #967
Eyal-Danieli merged 18 commits intomlrun:developmentfrom
tomerbv:scikitlearn_version

tomerbv commented Jan 19, 2026 •

edited

Loading

Uh oh!

guy1992l left a comment

Uh oh!

guy1992l Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tomerbv commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guy1992l left a comment

Choose a reason for hiding this comment

Uh oh!

guy1992l Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tomerbv commented Jan 19, 2026 •

edited

Loading