automl
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 0 deletions b/‎.gitignore‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎examples/ablation-example.ipynb‎
Lines changed: 4 additions & 25 deletions b/‎examples/ablation-example.ipynb‎
Lines changed: 4 additions & 25 deletions
diff --git a/‎examples/optuna-example.ipynb‎
Lines changed: 261 additions & 0 deletions b/‎examples/optuna-example.ipynb‎
Lines changed: 261 additions & 0 deletions
@@ -212,3 +212,4 @@ __marimo__/
 
 # IDE project files
 .idea
+.DS_Store
@@ -18,9 +18,8 @@
       "id": "initial_id",
       "metadata": {
         "collapsed": true,
-        "ExecuteTime": {
-          "end_time": "2025-08-19T15:17:31.718520Z",
-          "start_time": "2025-08-19T15:17:31.336310Z"
+        "jupyter": {
+          "is_executing": true
         }
       },
       "source": [
@@ -73,28 +72,8 @@
         "print(\"\")\n",
         "print(\"Config of interest\", config_of_interest, \"\\nValue\", eval_fun(config_of_interest))"
       ],
-      "outputs": [
-        {
-          "name": "stdout",
-          "output_type": "stream",
-          "text": [
-            "Baseline Configuration(values={\n",
-            "  'a': 0.6243561663863,\n",
-            "  'b': 4,\n",
-            "  'c': np.str_('Y'),\n",
-            "}) \n",
-            "Value 0.7004003054122951\n",
-            "\n",
-            "Config of interest Configuration(values={\n",
-            "  'a': 1.45774946311,\n",
-            "  'b': 10,\n",
-            "  'c': np.str_('X'),\n",
-            "}) \n",
-            "Value 10.99361700532433\n"
-          ]
-        }
-      ],
-      "execution_count": 1
+      "outputs": [],
+      "execution_count": null
     },
     {
       "cell_type": "markdown",
 
@@ -0,0 +1,261 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "id": "74b6c2461143b273",
+      "metadata": {},
+      "source": [
+        "# HyperSHAP: Example for Optuna Integration\n",
+        "\n",
+        "In this example, we demonstrate how to load data from an [optuna](https://optuna.org/) study directly into HyperSHAP for downstream hyperparameter analysis.\n",
+        "\n",
+        "This is useful when you have already run an optuna HPO study and want to understand *why* certain hyperparameters matter more than others \u2014 without having to redefine a `ConfigSpace` manually.\n",
+        "\n",
+        "> **Prerequisites:** `optuna` must be installed.\n",
+        "> ```bash\n",
+        "> pip install optuna\n",
+        "> # or\n",
+        "> pip install hypershap[optuna]\n",
+        "> ```\n",
+        "\n",
+        "## Step 1 \u2014 Run an optuna study\n",
+        "\n",
+        "We first set up a small synthetic objective and run an optuna study to mimic a realistic HPO scenario.\n",
+        "The objective uses a float, an integer, and a categorical hyperparameter \u2014 the same types supported by the optuna integration."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "initial_id",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from __future__ import annotations\n",
+        "\n",
+        "import math\n",
+        "\n",
+        "import optuna\n",
+        "\n",
+        "optuna.logging.set_verbosity(optuna.logging.WARNING)  # suppress per-trial logs\n",
+        "\n",
+        "\n",
+        "def objective(trial: optuna.Trial) -> float:\n",
+        "    \"\"\"Synthetic objective that mimics a tunable ML algorithm.\n",
+        "\n",
+        "    Hyperparameters\n",
+        "    ---------------\n",
+        "    a : float  in [0.1, 1.5]   \u2014 learning-rate-like continuous parameter\n",
+        "    b : int    in [2, 10]      \u2014 depth-like integer parameter\n",
+        "    c : str    in {\"X\", \"Y\"}   \u2014 algorithm variant (categorical)\n",
+        "    \"\"\"\n",
+        "    a = trial.suggest_float(\"a\", 0.1, 1.5)\n",
+        "    b = trial.suggest_int(\"b\", 2, 10)\n",
+        "    c = trial.suggest_categorical(\"c\", [\"X\", \"Y\"])\n",
+        "\n",
+        "    # Variant X: performance mainly driven by b, slightly by a\n",
+        "    if c == \"X\":\n",
+        "        return math.sin(a) + b\n",
+        "    # Variant Y: interaction between a and b dominates\n",
+        "    return math.cos(a * b) + 1.5\n",
+        "\n",
+        "\n",
+        "# Run a maximisation study with 200 trials\n",
+        "study = optuna.create_study(direction=\"maximize\", sampler=optuna.samplers.TPESampler(seed=42))\n",
+        "study.optimize(objective, n_trials=200)\n",
+        "\n",
+        "print(f\"Completed trials : {len(study.trials)}\")\n",
+        "print(f\"Best value       : {study.best_value:.4f}\")\n",
+        "print(f\"Best params      : {study.best_params}\")"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "97bf13e91a18e32e",
+      "metadata": {},
+      "source": [
+        "## Step 2 \u2014 Load the study into HyperSHAP\n",
+        "\n",
+        "`from_optuna_study` is the main entry point for the optuna integration. It:\n",
+        "\n",
+        "1. Extracts a `ConfigurationSpace` from the trial distributions.\n",
+        "2. Converts all completed trial results into `(Configuration, float)` pairs.\n",
+        "3. Fits a surrogate model (default: `RandomForestRegressor`) on those pairs.\n",
+        "4. Returns an `ExplanationTask` ready for HyperSHAP analysis.\n",
+        "\n",
+        "For **minimisation** studies (`direction=\"minimize\"`) the objective values are automatically negated so that HyperSHAP's *higher-is-better* convention is respected. Pass `negate=False` to disable this."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "31b290fdd2788b74",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from hypershap import HyperSHAP, from_optuna_study\n",
+        "\n",
+        "# One-liner: study \u2192 ExplanationTask\n",
+        "explanation_task = from_optuna_study(study)\n",
+        "\n",
+        "print(\"Config space HPs :\", explanation_task.get_hyperparameter_names())\n",
+        "print(\"Number of HPs    :\", explanation_task.get_num_hyperparameters())\n",
+        "\n",
+        "hypershap = HyperSHAP(explanation_task=explanation_task)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "6d398a1ada8f99a2",
+      "metadata": {},
+      "source": "## Step 3 \u2014 Tunability analysis\n\nFor tunability we need a **baseline configuration** \u2014 the starting point from which we measure how much tuning each hyperparameter can improve performance. A natural choice is the *default configuration* of the inferred `ConfigurationSpace` (i.e. the midpoint/default of each hyperparameter range), which represents the algorithm before any tuning has taken place."
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "a248ad7cf028ced8",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "# Use the default configuration of the inferred ConfigSpace as the baseline \u2014\n",
+        "# this represents the algorithm with no tuning applied.\n",
+        "default_config = explanation_task.config_space.get_default_configuration()\n",
+        "print(\"Default (baseline) config:\", default_config)\n",
+        "\n",
+        "iv_tunability = hypershap.tunability(baseline_config=default_config)\n",
+        "print(iv_tunability)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "9c0b58a6ff43f6f1",
+      "metadata": {},
+      "source": [
+        "### Visualisations"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "27ff3fe327082acf",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "hypershap.plot_si_graph()"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "87fcba5335b5aa7c",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "hypershap.plot_stacked_bar()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "c3a1b2d0e4f5a6b7",
+      "metadata": {},
+      "source": "## Step 4 \u2014 Ablation analysis\n\nWe can also run an ablation analysis to understand which hyperparameters are responsible for the performance gain from the default configuration to the best configuration found by optuna."
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "d5e6f7a8b9c0d1e2",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from ConfigSpace import Configuration\n",
+        "\n",
+        "best_config = Configuration(\n",
+        "    explanation_task.config_space,\n",
+        "    values=study.best_params,\n",
+        ")\n",
+        "\n",
+        "iv_ablation = hypershap.ablation(\n",
+        "    config_of_interest=best_config,  # optimized config found by optuna\n",
+        "    baseline_config=default_config,  # default / untuned starting point\n",
+        ")\n",
+        "print(iv_ablation)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "f0a1b2c3d4e5f6a7",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "hypershap.plot_waterfall()"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "b8c9d0e1f2a3b4c5",
+      "metadata": {},
+      "source": [
+        "## Step 5 \u2014 Advanced: using the lower-level helpers\n",
+        "\n",
+        "If you need more control \u2014 e.g. to inspect the inferred `ConfigurationSpace`, filter trials manually, or pass a custom surrogate model \u2014 you can use the lower-level helpers directly."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "d6e7f8a9b0c1d2e3",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "from sklearn.ensemble import GradientBoostingRegressor\n",
+        "\n",
+        "from hypershap.optuna_task import study_to_config_space, study_to_data\n",
+        "\n",
+        "# 1. Inspect the inferred configuration space\n",
+        "cs = study_to_config_space(study)\n",
+        "print(\"Inferred ConfigurationSpace:\")\n",
+        "print(cs)\n",
+        "\n",
+        "# 2. Convert trials to (Configuration, float) pairs \u2014 apply custom filtering if needed\n",
+        "data = study_to_data(study, config_space=cs)\n",
+        "print(f\"\\nConverted {len(data)} trials to (Configuration, float) pairs.\")\n",
+        "\n",
+        "# 3. Build an ExplanationTask with a custom surrogate model\n",
+        "from hypershap.task import ExplanationTask\n",
+        "\n",
+        "custom_task = ExplanationTask.from_data(\n",
+        "    config_space=cs,\n",
+        "    data=data,\n",
+        "    base_model=GradientBoostingRegressor(n_estimators=200, random_state=0),\n",
+        ")\n",
+        "\n",
+        "hs_custom = HyperSHAP(explanation_task=custom_task)\n",
+        "iv_custom = hs_custom.tunability(baseline_config=default_config)\n",
+        "print(\"\\nTunability with GradientBoostingRegressor surrogate:\")\n",
+        "print(iv_custom)"
+      ]
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbformat_minor": 5,
+      "pygments_lexer": "ipython3",
+      "version": "3.10.0"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 5
+}
Original file line number	Diff line number	Diff line change
`@@ -212,3 +212,4 @@ __marimo__/`
`212`	`212`
`213`	`213`	`# IDE project files`
`214`	`214`	`.idea`
	`215`	`+.DS_Store`