Skip to content

Releases: kedro-org/kedro

0.18.13

31 Aug 10:35
0293dc1

Choose a tag to compare

Release 0.18.13

Major features and improvements

  • Added support for Python 3.11. This includes tackling challenges like dependency pinning and test adjustments to ensure a smooth experience. Detailed migration tips are provided below for further context.
  • Added new OmegaConfigLoader features:
    • Allowed registering of custom resolvers to OmegaConfigLoader through CONFIG_LOADER_ARGS.
    • Added support for global variables to OmegaConfigLoader.
  • Added kedro catalog resolve CLI command that resolves dataset factories in the catalog with any explicit entries in the project pipeline.
  • Implemented a flat conf/ structure for modular pipelines, and accordingly, updated the kedro pipeline create and kedro catalog create command.
  • Updated new Kedro project template and Kedro starters:
    • Change Kedro starters and new Kedro projects to use OmegaConfigLoader.
    • Converted setup.py in new Kedro project template and Kedro starters to pyproject.toml and moved flake8 configuration
      to dedicated file .flake8.
    • Updated the spaceflights starter to use the new flat conf/ structure.

Bug fixes and other changes

  • Updated OmegaConfigLoader to ignore config from hidden directories like .ipynb_checkpoints.

Documentation changes

  • Revised the data section to restructure beginner and advanced pages about the Data Catalog and datasets.
  • Moved contributor documentation to the GitHub wiki.
  • Updated example of using generator functions in nodes.
  • Added migration guide from the ConfigLoader and the TemplatedConfigLoader to the OmegaConfigLoader. The ConfigLoader and the TemplatedConfigLoader are deprecated and will be removed in the 0.19.0 release.

Migration Tips for Python 3.11:

  • PyTables on Windows: Users on Windows with Python >=3.8 should note we've pinned pytables to 3.8.0 due to compatibility issues.
  • Spark Dependency: We've set an upper version limit for pyspark at <3.4 due to breaking changes in 3.4.
  • Testing with Python 3.10: The latest moto version now supports parallel test execution for Python 3.10, resolving previous issues.

Breaking changes to the API

Upcoming deprecations for Kedro 0.19.0

  • Renamed abstract dataset classes, in accordance with the Kedro lexicon. Dataset classes ending with "DataSet" are deprecated and will be removed in 0.19.0. Note that all of the below classes are also importable from kedro.io; only the module where they are defined is listed as the location.
Type Deprecated Alias Location
AbstractDataset AbstractDataSet kedro.io.core
AbstractVersionedDataset AbstractVersionedDataSet kedro.io.core
  • Using the layer attribute at the top level is deprecated; it will be removed in Kedro version 0.19.0. Please move layer inside the metadata -> kedro-viz attributes.

Community contributions

Thanks to Laíza Milena Scheid Parizotto and Jonathan Cohen.

0.18.12

01 Aug 14:33
7ac654d

Choose a tag to compare

Release 0.18.12

Major features and improvements

  • Added dataset factories feature which uses pattern matching to reduce the number of catalog entries.
  • Activated all built-in resolvers by default for OmegaConfigLoader except for oc.env.
  • Added kedro catalog rank CLI command that ranks dataset factories in the catalog by matching priority.

Bug fixes and other changes

  • Consolidated dependencies and optional dependencies in pyproject.toml.
  • Made validation of unique node outputs much faster.
  • Updated kedro catalog list to show datasets generated with factories.

Documentation changes

  • Recommended ruff as the linter and removed mentions of pylint, isort, flake8.

Community contributions

Thanks to Laíza Milena Scheid Parizotto and Chris Schopp.

Breaking changes to the API

Upcoming deprecations for Kedro 0.19.0

  • ConfigLoader and TemplatedConfigLoader will be deprecated. Please use OmegaConfigLoader instead.

0.18.11

03 Jul 12:54
8065fb0

Choose a tag to compare

Release 0.18.11

Major features and improvements

  • Added databricks-iris as an official starter.

Bug fixes and other changes

  • Reworked micropackaging workflow to use standard Python packaging practices.
  • Make kedro micropkg package accept --verbose.

Documentation changes

  • Significant improvements to the documentation that covers working with Databricks and Kedro, including a new page for workspace-only development, and a guide to choosing the best workflow for your use case.
  • Updated documentation for deploying with Prefect for version 2.0.

0.18.10

08 Jun 17:51
20f7eec

Choose a tag to compare

Major features and improvements

  • Rebrand across all documentation and Kedro assets.
  • Added support for variable interpolation in the catalog with the OmegaConfigLoader.

0.18.9

31 May 16:58
2fa1478

Choose a tag to compare

Major features and improvements

  • kedro run --params now updates interpolated parameters correctly when using OmegaConfigLoader.
  • Added metadata attribute to kedro.io datasets. This is ignored by Kedro, but may be consumed by users or external plugins.
  • Added kedro.logging.RichHandler. This replaces the default rich.logging.RichHandler and is more flexible, user can turn off the rich traceback if needed.

Bug fixes and other changes

  • OmegaConfigLoader will return a dict instead of DictConfig.
  • OmegaConfigLoader does not show a MissingConfigError when the config files exist but are empty.

Documentation changes

  • Added documentation for collaborative experiment tracking within Kedro-Viz.
  • Revised section on deployment to better organise content and reflect how recently docs have been updated.
  • Minor improvements to fix typos and revise docs to align with engineering changes.

Breaking changes to the API

  • kedro package does not produce .egg files anymore, and now relies exclusively on .whl files.

Community contributions

Many thanks to the following Kedroids for contributing PRs to this release:

0.18.8

02 May 12:16
2e70dec

Choose a tag to compare

Major features and improvements

  • Added KEDRO_LOGGING_CONFIG environment variable, which can be used to configure logging from the beginning of the kedro process.
  • Removed logs folder from the kedro new project template. File-based logging will remain but just be level INFO and above and go to project root instead.

Bug fixes and other changes

  • Improvements to Jupyter E2E tests.
  • Added full kedro run CLI command to session store to improve run reproducibility using Kedro-Viz experiment tracking.

Documentation changes

  • Improvements to documentation about configuration.
  • Improvements to Sphinx toolchain including incrementing to use a newer version.
  • Improvements to documentation on visualising Kedro projects on Databricks, and additional documentation about the development workflow for Kedro projects on Databricks.
  • Updated Technical Steering Committee membership documentation.
  • Revised documentation section about linting and formatting and extended to give details of flake8 configuration.
  • Updated table of contents for documentation to reduce scrolling.
  • Expanded FAQ documentation.
  • Added a 404 page to documentation.
  • Added deprecation warnings about the removal of kedro.extras.datasets.

0.18.7

22 Mar 16:11
0591590

Choose a tag to compare

Release 0.18.7

Major features and improvements

  • Added new Kedro CLI kedro jupyter setup to setup Jupyter Kernel for Kedro.
  • kedro package now includes the project configuration in a compressed tar.gz file.
  • Added functionality to the OmegaConfigLoader to load configuration from compressed files of zip or tar format. This feature requires fsspec>=2023.1.0.
  • Significant improvements to on-boarding documentation that covers setup for new Kedro users. Also some major changes to the spaceflights tutorial to make it faster to work through. We think it's a better read. Tell us if it's not.

Bug fixes and other changes

  • Added a guide and tooling for developing Kedro for Databricks.
  • Implement missing dict-like interface for _ProjectPipeline.

0.18.6

06 Mar 11:55
22fc270

Choose a tag to compare

Release 0.18.6

Bug fixes and other changes

  • Fixed bug that didn't allow to read or write datasets with s3a or s3n filepaths
  • Fixed bug with overriding nested parameters using the --params flag
  • Fixed bug that made session store incompatible with Kedro-Viz experiment tracking

Migration guide from Kedro 0.18.5 to 0.18.6

A regression introduced in Kedro version 0.18.5 caused the Kedro-Viz console to fail to show experiment tracking correctly. If you experienced this issue, you will need to:

  • upgrade to Kedro version 0.18.6
  • delete any erroneous session entries created with Kedro 0.18.5 from your session_store.db stored at <project-path>/data/session_store.db.

Thanks to Kedroids tomohiko kato, tsanikgr and maddataanalyst for very detailed reports about the bug.

0.18.5

20 Feb 17:47
393d9d2

Choose a tag to compare

Release 0.18.5

NOTE: This version of Kedro introduced a bug such that the Kedro-Viz console to fail to show experiment tracking correctly. We recommend that you don't use it and prefer instead to use Kedro version 0.18.6.

Major features and improvements

  • Added new OmegaConfigLoader which uses OmegaConf for loading and merging configuration.
  • Added the --conf-source option to kedro run, allowing users to specify a source for project configuration for the run.
  • Added omegaconf syntax as option for --params. Keys and values can now be separated by colons or equals signs.
  • Added support for generator functions as nodes, i.e. using yield instead of return.
    • Enable chunk-wise processing in nodes with generator functions.
    • Save node outputs after every yield before proceeding with next chunk.
  • Fixed incorrect parsing of Azure Data Lake Storage Gen2 URIs used in datasets.
  • Added support for loading credentials from environment variables using OmegaConfigLoader.
  • Added new --namespace flag to kedro run to enable filtering by node namespace.
  • Added a new argument node for all four dataset hooks.
  • Added the kedro run flags --nodes, --tags, and --load-versions to replace --node, --tag, and --load-version.

Bug fixes and other changes

  • Commas surrounded by square brackets (only possible for nodes with default names) will no longer split the arguments to kedro run options which take a list of nodes as inputs (--from-nodes and --to-nodes).
  • Fixed bug where micropkg manifest section in pyproject.toml isn't recognised as allowed configuration.
  • Fixed bug causing load_ipython_extension not to register the %reload_kedro line magic when called in a directory that does not contain a Kedro project.
  • Added anyconfig's ac_context parameter to kedro.config.commons module functions for more flexible ConfigLoader customizations.
  • Change reference to kedro.pipeline.Pipeline object throughout test suite with kedro.modular_pipeline.pipeline factory.
  • Fixed bug causing the after_dataset_saved hook only to be called for one output dataset when multiple are saved in a single node and async saving is in use.
  • Log level for "Credentials not found in your Kedro project config" was changed from WARNING to DEBUG.
  • Added safe extraction of tar files in micropkg pull to fix vulnerability caused by CVE-2007-4559.
  • Documentation improvements
    • Bug fix in table font size
    • Updated API docs links for datasets
    • Improved CLI docs for kedro run
    • Revised documentation for visualisation to build plots and for experiment tracking
    • Added example for loading external credentials to the Hooks documentation

Breaking changes to the API

Community contributions

Many thanks to the following Kedroids for contributing PRs to this release:

Upcoming deprecations for Kedro 0.19.0

  • project_version will be deprecated in pyproject.toml please use kedro_init_version instead.
  • Deprecated kedro run flags --node, --tag, and --load-version in favour of --nodes, --tags, and --load-versions.

0.18.4

05 Dec 16:36
a6e91be

Choose a tag to compare

Major features and improvements

  • Make Kedro instantiate datasets from kedro_datasets with higher priority than kedro.extras.datasets. kedro_datasets is the namespace for the new kedro-datasets python package.
  • The config loader objects now implement UserDict and the configuration is accessed through conf_loader['catalog'].
  • You can configure config file patterns through settings.py without creating a custom config loader.
  • Added the following new datasets:
Type Description Location
svmlight.SVMLightDataSet Work with svmlight/libsvm files using scikit-learn library kedro.extras.datasets.svmlight
video.VideoDataSet Read and write video files from a filesystem kedro.extras.datasets.video
video.video_dataset.SequenceVideo Create a video object from an iterable sequence to use with VideoDataSet kedro.extras.datasets.video
video.video_dataset.GeneratorVideo Create a video object from a generator to use with VideoDataSet kedro.extras.datasets.video
  • Implemented support for a functional definition of schema in dask.ParquetDataSet to work with the dask.to_parquet API.

Bug fixes and other changes

  • Fixed kedro micropkg pull for packages on PyPI.
  • Fixed format in save_args for SparkHiveDataSet, previously it didn't allow you to save it as delta format.
  • Fixed save errors in TensorFlowModelDataset when used without versioning; previously, it wouldn't overwrite an existing model.
  • Added support for tf.device in TensorFlowModelDataset.
  • Updated error message for VersionNotFoundError to handle insufficient permission issues for cloud storage.
  • Updated Experiment Tracking docs with working examples.
  • Updated MatplotlibWriter Dataset, TextDataset, plotly.PlotlyDataSet and plotly.JSONDataSet docs with working examples.
  • Modified implementation of the Kedro IPython extension to use local_ns rather than a global variable.
  • Refactored ShelveStore to its own module to ensure multiprocessing works with it.
  • kedro.extras.datasets.pandas.SQLQueryDataSet now takes optional argument execution_options.
  • Removed attrs upper bound to support newer versions of Airflow.
  • Bumped the lower bound for the setuptools dependency to <=61.5.1.

Minor breaking changes to the API

Upcoming deprecations for Kedro 0.19.0

  • kedro test and kedro lint will be deprecated.

Documentation

  • Revised the Introduction to shorten it
  • Revised the Get Started section to remove unnecessary information and clarify the learning path
  • Updated the spaceflights tutorial to simplify the later stages and clarify what the reader needed to do in each phase
  • Moved some pages that covered advanced materials into more appropriate sections
  • Moved visualisation into its own section
  • Fixed a bug that degraded user experience: the table of contents is now sticky when you navigate between pages
  • Added redirects where needed on ReadTheDocs for legacy links and bookmarks

Contributions from the Kedroid community

We are grateful to the following for submitting PRs that contributed to this release: jstammers, FlorianGD, yash6318, carlaprv, dinotuku, williamcaicedo, avan-sh, Kastakin, amaralbf, BSGalvan, levimjoseph, daniel-falk, clotildeguinard, avsolatorio, and picklejuicedev for comments and input to documentation changes