make sure the notebook runs

dbdimitrov · dbdimitrov · commit 1e43c069b427 · 2026-05-14T10:59:26.000+02:00
diff --git a/.gitignore b/.gitignore
@@ -37,6 +37,8 @@ docs/generated/*
 *.db
 *.pkl
 *.png
+*.pdf
+*.h5df
 
 # PyInstaller
 #  Usually these files are written by a python script from a template
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -5,6 +5,11 @@
 - Fixed `get_hcop_orthologs` to use the HGNC Google Cloud Storage bucket instead of the defunct EBI FTP mirror, resolving 404 errors in CI.
 - Added `target_organism` parameter (default `"mouse"`) to `get_hcop_orthologs`, enabling homology mapping to any of the 19 species available in the HCOP database.
 - Updated documentation notebook (`prior_knowledge.ipynb`) to use the new `target_organism` API.
+- Updated `sc_multi.ipynb` metabolite-receptor section for decoupler v2: renamed `pd_net`/`t_net` columns to `source`/`target`/`weight` and removed deprecated `source`/`target`/`weight`/`min_n` kwargs from `estimate_metalinks` (replaced by `tmin`).
+- Standardized all public docstrings to NumPy format and added type annotations across public modules (#219).
+- Added mypy type-checking to pre-commit hooks (`--no-strict-optional --ignore-missing-imports`).
+- Added `build.yaml` CI workflow: validates the package build with `uv build` + `twine check --strict` on every push and pull request.
+- Renamed `.github/workflows/main.yml` → `test.yml`.
 
 ## 1.7.1 (24.01.2026)
 
diff --git a/docs/notebooks/prior_knowledge.ipynb b/docs/notebooks/prior_knowledge.ipynb
@@ -23,7 +23,7 @@
      "name": "stderr",
      "output_type": "stream",
      "text": [
-      "/Users/b260-admin/miniforge3/envs/liana311/lib/python3.11/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n",
+      "/Users/b260-admin/miniforge3/envs/liana313/lib/python3.13/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n",
       "Downloading data from `https://omnipathdb.org/queries/enzsub?format=json`\n",
       "Downloading data from `https://omnipathdb.org/queries/interactions?format=json`\n",
       "Downloading data from `https://omnipathdb.org/queries/complexes?format=json`\n",
@@ -192,18 +192,6 @@
    "execution_count": 4,
    "metadata": {},
    "outputs": [
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
-      "Downloading data from `https://omnipathdb.org/interactions?datasets=kinaseextra%2Cligrecextra%2Comnipath%2Cpathwayextra&fields=curation_effort%2Creferences%2Csources%2Ctype&format=tsv&license=commercial`\n",
-      "10.5MB [00:00, 37.9MB/s]\n",
-      "Downloading data from `https://omnipathdb.org/intercell?causality=trans&databases=CellChatDB&format=tsv&scope=generic`\n",
-      "124kB [00:00, 91.7MB/s]\n",
-      "Downloading data from `https://omnipathdb.org/intercell?causality=rec&databases=CellChatDB&format=tsv&scope=generic`\n",
-      "84.4kB [00:00, 108MB/s]\n"
-     ]
-    },
     {
      "data": {
       "text/html": [
@@ -463,14 +451,106 @@
   {
    "cell_type": "markdown",
    "metadata": {},
-   "source": "## Homology Mapping\n\nSimilarly, LIANA+ provides on demand homology mapping beyond mouse symbols. It utilises the [HCOP database](https://www.genenames.org/help/hcop/) to obtain homologous genes across species. Files are downloaded from the HGNC Google Cloud Storage bucket.\n\nThe homology mapping is accessible through the `resource` module:"
+   "source": [
+    "## Homology Mapping\n",
+    "\n",
+    "Similarly, LIANA+ provides on demand homology mapping beyond mouse symbols. It utilises the [HCOP database](https://www.genenames.org/help/hcop/) to obtain homologous genes across species. Files are downloaded from the HGNC Google Cloud Storage bucket.\n",
+    "\n",
+    "The homology mapping is accessible through the `resource` module:"
+   ]
   },
   {
    "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 5,
    "metadata": {},
-   "outputs": [],
-   "source": "# let's say we are interested in zebrafish homologs of human genes\nmap_df = li.rs.get_hcop_orthologs(target_organism='zebrafish',\n                                   columns=['human_symbol', 'zebrafish_symbol'],\n                                   # NOTE: HCOP integrates multiple resource, so we can filter out mappings in at least 3 of them for confidence\n                                   min_evidence=3\n                                   )\n# rename the columns to source and target, respectively for the original organism and the target organism\nmap_df = map_df.rename(columns={'human_symbol':'source', 'zebrafish_symbol':'target'})\nmap_df.tail()"
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/b260-admin/Repos/liana-py/src/liana/resource/_orthology.py:217: DtypeWarning: Columns (0) have mixed types. Specify dtype option on import or set low_memory=False.\n"
+     ]
+    },
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>source</th>\n",
+       "      <th>target</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>132672</th>\n",
+       "      <td>ZYG11B</td>\n",
+       "      <td>zyg11</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>132673</th>\n",
+       "      <td>ZYG11B</td>\n",
+       "      <td>zyg11l</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>132674</th>\n",
+       "      <td>ZYX</td>\n",
+       "      <td>zyx</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>132676</th>\n",
+       "      <td>ZZEF1</td>\n",
+       "      <td>zzef1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>132677</th>\n",
+       "      <td>ZZZ3</td>\n",
+       "      <td>zzz3</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "        source  target\n",
+       "132672  ZYG11B   zyg11\n",
+       "132673  ZYG11B  zyg11l\n",
+       "132674     ZYX     zyx\n",
+       "132676   ZZEF1   zzef1\n",
+       "132677    ZZZ3    zzz3"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "# let's say we are interested in zebrafish homologs of human genes\n",
+    "map_df = li.rs.get_hcop_orthologs(target_organism='zebrafish',\n",
+    "                                   columns=['human_symbol', 'zebrafish_symbol'],\n",
+    "                                   # NOTE: HCOP integrates multiple resource, so we can filter out mappings in at least 3 of them for confidence\n",
+    "                                   min_evidence=3\n",
+    "                                   )\n",
+    "# rename the columns to source and target, respectively for the original organism and the target organism\n",
+    "map_df = map_df.rename(columns={'human_symbol':'source', 'zebrafish_symbol':'target'})\n",
+    "map_df.tail()"
+   ]
   },
   {
    "cell_type": "markdown",
@@ -504,10 +584,143 @@
   },
   {
    "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 7,
    "metadata": {},
-   "outputs": [],
-   "source": "map_df = li.rs.get_hcop_orthologs(target_organism='mouse',\n                                  columns=['human_symbol', 'mouse_symbol'],\n                                   # NOTE: HCOP integrates multiple resource, so we can filter out mappings in at least 3 of them for confidence\n                                   min_evidence=3\n                                   )\n# rename the columns to source and target, respectively for the original organism and the target organism\nmap_df = map_df.rename(columns={'human_symbol':'source', 'mouse_symbol':'target'})\n\n# We will then translate\nmouse = li.rs.translate_resource(resource,\n                                 map_df=map_df,\n                                 columns=['ligand', 'receptor'],\n                                 replace=True,\n                                 # Here, we will be harsher and only keep mappings that don't map to more than 1 mouse gene\n                                 one_to_many=1\n                                 )\nmouse"
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/b260-admin/Repos/liana-py/src/liana/resource/_orthology.py:217: DtypeWarning: Columns (0) have mixed types. Specify dtype option on import or set low_memory=False.\n"
+     ]
+    },
+    {
+     "data": {
+      "text/html": [
+       "<div>\n",
+       "<style scoped>\n",
+       "    .dataframe tbody tr th:only-of-type {\n",
+       "        vertical-align: middle;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe tbody tr th {\n",
+       "        vertical-align: top;\n",
+       "    }\n",
+       "\n",
+       "    .dataframe thead th {\n",
+       "        text-align: right;\n",
+       "    }\n",
+       "</style>\n",
+       "<table border=\"1\" class=\"dataframe\">\n",
+       "  <thead>\n",
+       "    <tr style=\"text-align: right;\">\n",
+       "      <th></th>\n",
+       "      <th>ligand</th>\n",
+       "      <th>receptor</th>\n",
+       "    </tr>\n",
+       "  </thead>\n",
+       "  <tbody>\n",
+       "    <tr>\n",
+       "      <th>0</th>\n",
+       "      <td>Lgals9</td>\n",
+       "      <td>Ptprc</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>1</th>\n",
+       "      <td>Lgals9</td>\n",
+       "      <td>Met</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>2</th>\n",
+       "      <td>Lgals9</td>\n",
+       "      <td>Cd44</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>3</th>\n",
+       "      <td>Lgals9</td>\n",
+       "      <td>Lrp1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4</th>\n",
+       "      <td>Lgals9</td>\n",
+       "      <td>Cd47</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>...</th>\n",
+       "      <td>...</td>\n",
+       "      <td>...</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4619</th>\n",
+       "      <td>Bmp2</td>\n",
+       "      <td>Actr2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4620</th>\n",
+       "      <td>Bmp15</td>\n",
+       "      <td>Actr2</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4621</th>\n",
+       "      <td>Csf1</td>\n",
+       "      <td>Csf3r</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4622</th>\n",
+       "      <td>Il36g</td>\n",
+       "      <td>Ifnar1</td>\n",
+       "    </tr>\n",
+       "    <tr>\n",
+       "      <th>4623</th>\n",
+       "      <td>Il36g</td>\n",
+       "      <td>Ifnar2</td>\n",
+       "    </tr>\n",
+       "  </tbody>\n",
+       "</table>\n",
+       "<p>4055 rows × 2 columns</p>\n",
+       "</div>"
+      ],
+      "text/plain": [
+       "      ligand receptor\n",
+       "0     Lgals9    Ptprc\n",
+       "1     Lgals9      Met\n",
+       "2     Lgals9     Cd44\n",
+       "3     Lgals9     Lrp1\n",
+       "4     Lgals9     Cd47\n",
+       "...      ...      ...\n",
+       "4619    Bmp2    Actr2\n",
+       "4620   Bmp15    Actr2\n",
+       "4621    Csf1    Csf3r\n",
+       "4622   Il36g   Ifnar1\n",
+       "4623   Il36g   Ifnar2\n",
+       "\n",
+       "[4055 rows x 2 columns]"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "map_df = li.rs.get_hcop_orthologs(target_organism='mouse',\n",
+    "                                  columns=['human_symbol', 'mouse_symbol'],\n",
+    "                                   # NOTE: HCOP integrates multiple resource, so we can filter out mappings in at least 3 of them for confidence\n",
+    "                                   min_evidence=3\n",
+    "                                   )\n",
+    "# rename the columns to source and target, respectively for the original organism and the target organism\n",
+    "map_df = map_df.rename(columns={'human_symbol':'source', 'mouse_symbol':'target'})\n",
+    "\n",
+    "# We will then translate\n",
+    "mouse = li.rs.translate_resource(resource,\n",
+    "                                 map_df=map_df,\n",
+    "                                 columns=['ligand', 'receptor'],\n",
+    "                                 replace=True,\n",
+    "                                 # Here, we will be harsher and only keep mappings that don't map to more than 1 mouse gene\n",
+    "                                 one_to_many=1\n",
+    "                                 )\n",
+    "mouse"
+   ]
   },
   {
    "cell_type": "markdown",
@@ -761,9 +974,7 @@
      "name": "stderr",
      "output_type": "stream",
      "text": [
-      "Downloading annotations for all proteins from the following resources: `['DisGeNet']`\n",
-      "Downloading data from `https://omnipathdb.org/annotations?format=tsv&resources=DisGeNet`\n",
-      "38.3MB [00:00, 67.5MB/s]\n"
+      "Downloading annotations for all proteins from the following resources: `['DisGeNet']`\n"
      ]
     }
    ],
@@ -782,7 +993,7 @@
      "name": "stderr",
      "output_type": "stream",
      "text": [
-      "/var/folders/gk/kmrvz5m90sb9wftqk2n94p0h0000gq/T/ipykernel_79557/3054253547.py:2: FutureWarning: The default value of observed=False is deprecated and will change to observed=True in a future version of pandas. Specify observed=False to silence this warning and retain the current behavior\n"
+      "/var/folders/gk/kmrvz5m90sb9wftqk2n94p0h0000gq/T/ipykernel_26684/3054253547.py:2: FutureWarning: The default value of observed=False is deprecated and will change to observed=True in a future version of pandas. Specify observed=False to silence this warning and retain the current behavior\n"
      ]
     },
     {
@@ -992,14 +1203,6 @@
    "execution_count": 14,
    "metadata": {},
    "outputs": [
-    {
-     "name": "stderr",
-     "output_type": "stream",
-     "text": [
-      "Downloading data from `https://omnipathdb.org/interactions?datasets=omnipath&fields=curation_effort%2Creferences%2Csources&format=tsv&genesymbols=1`\n",
-      "10.0MB [00:00, 32.3MB/s]\n"
-     ]
-    },
     {
      "data": {
       "text/html": [
@@ -1594,7 +1797,7 @@
  ],
  "metadata": {
   "kernelspec": {
-   "display_name": "spiana",
+   "display_name": "liana313",
    "language": "python",
    "name": "python3"
   },
@@ -1608,7 +1811,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.10.10"
+   "version": "3.13.7"
   }
  },
  "nbformat": 4,
diff --git a/docs/notebooks/sc_multi.ipynb b/docs/notebooks/sc_multi.ipynb