Intro changes to scaling articles (#113)

dschwarz26 · web-flow · commit 602665938d7c · 2026-02-08T07:28:41.000-08:00
diff --git a/docs/case_studies/llm-powered-merging-at-scale/notebook.ipynb b/docs/case_studies/llm-powered-merging-at-scale/notebook.ipynb
@@ -4,27 +4,47 @@
    "cell_type": "markdown",
    "id": "bb0b6427",
    "metadata": {},
-   "source": "# LLM-powered Merging at Scale\n\nThe `merge()` function joins two tables using LLM intelligence to match rows that belong together. When table data alone is insufficient, it automatically falls back to web search. This notebook demonstrates merging 2,246 rows where each match may require different levels of investigation."
+   "source": [
+    "# LLM-powered Merging at Scale\n",
+    "\n",
+    "The everyrow `merge()` function joins two tables using LLMs, and LLM research agents, to identify matching rows at high accuracy. This notebook demonstrates how this scales to two tables of 2,246 rows. So each row gets LLM-level intelligence and research to find which of the 2,246 rows in the other table is the most likely match.\n",
+    "\n",
+    "Cost grows super linearly with the number of rows. At small scale (100 to 400 rows) the cost is negligible; at 2,246 x 2,246 rows, this cost $26.80."
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "mkoy1995el",
-   "source": "## Example: Matching 2,246 People to Personal Websites\n\nThis example takes two tables: one with people's names and professional information (position, university, email), and another with a shuffled list of personal website URLs. The task is to determine which website belongs to which person.\n\nMost matches can be resolved by comparing names and emails against URL patterns. But some require web search to confirm ownership when the connection is not obvious from the data alone.",
-   "metadata": {}
+   "metadata": {},
+   "source": [
+    "## Example: Matching 2,246 People to Personal Websites\n",
+    "\n",
+    "This example takes two tables: one with people's names and professional information (position, university, email), and another with a shuffled list of personal website URLs. The task is to determine which website belongs to which person.\n",
+    "\n",
+    "Most matches can be resolved by comparing names and emails against URL patterns. But some require web search to confirm ownership when the connection is not obvious from the data alone."
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "03955b64",
    "metadata": {},
-   "source": "## Load Data"
+   "source": [
+    "## Load Data"
+   ]
   },
   {
    "cell_type": "code",
+   "execution_count": null,
    "id": "my38zwvuk2n",
-   "source": "import numpy as np\nimport pandas as pd\nfrom everyrow.ops import merge\n\npd.set_option(\"display.max_colwidth\", None)",
    "metadata": {},
-   "execution_count": null,
-   "outputs": []
+   "outputs": [],
+   "source": [
+    "import numpy as np\n",
+    "import pandas as pd\n",
+    "from everyrow.ops import merge\n",
+    "\n",
+    "pd.set_option(\"display.max_colwidth\", None)"
+   ]
   },
   {
    "cell_type": "code",
@@ -163,7 +183,11 @@
    "cell_type": "markdown",
    "id": "d506d52e",
    "metadata": {},
-   "source": "## Run Merge\n\nRun the merge at increasing scales to see how it behaves."
+   "source": [
+    "## Run Merge\n",
+    "\n",
+    "Run the merge at increasing scales to see how it behaves."
+   ]
   },
   {
    "cell_type": "code",
@@ -229,7 +253,9 @@
    "cell_type": "markdown",
    "id": "774d421c",
    "metadata": {},
-   "source": "## Cost"
+   "source": [
+    "## Cost"
+   ]
   },
   {
    "cell_type": "code",
@@ -260,13 +286,19 @@
    "cell_type": "markdown",
    "id": "3fb4297b",
    "metadata": {},
-   "source": "Cost grows super linearly with the number of rows. As the number of rows increases, each match becomes harder because the LLM has more candidates to consider, and more rows require web search to resolve ambiguity. At small scale (100 to 400 rows) the cost is negligible; at 2,246 rows it is $26.80."
+   "source": [
+    "Cost grows super linearly with the number of rows. As the number of rows increases, each match becomes harder because the LLM has more candidates to consider, and more rows require web search to resolve ambiguity. At small scale (100 to 400 rows) the cost is negligible; at 2,246 rows it is $26.80."
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "e1f2a3b4",
    "metadata": {},
-   "source": "## Inspecting Results\n\nSample matches from the n=800 run."
+   "source": [
+    "## Inspecting Results\n",
+    "\n",
+    "Sample matches from the n=800 run."
+   ]
   },
   {
    "cell_type": "code",
@@ -282,7 +314,9 @@
    "cell_type": "markdown",
    "id": "c1d2e3f4",
    "metadata": {},
-   "source": "Most matches are resolved by the LLM alone. It can often match a person to their website by comparing names, emails, and URL patterns without any web search."
+   "source": [
+    "Most matches are resolved by the LLM alone. It can often match a person to their website by comparing names, emails, and URL patterns without any web search."
+   ]
   },
   {
    "cell_type": "code",
@@ -342,7 +376,9 @@
    "cell_type": "markdown",
    "id": "g1h2i3j4",
    "metadata": {},
-   "source": "For harder cases where the LLM cannot confidently match from the table data alone, everyrow automatically falls back to web search."
+   "source": [
+    "For harder cases where the LLM cannot confidently match from the table data alone, everyrow automatically falls back to web search."
+   ]
   },
   {
    "cell_type": "code",
@@ -394,10 +430,15 @@
    "cell_type": "markdown",
    "id": "k1l2m3n4",
    "metadata": {},
-   "source": "In this case, there is no obvious connection between \"Charles London\" and `le-big-mac.github.io` from the table data alone. everyrow searched the web, found his Oxford profile and GitHub username, and confirmed the match."
+   "source": [
+    "In this case, there is no obvious connection between \"Charles London\" and `le-big-mac.github.io` from the table data alone. everyrow searched the web, found his Oxford profile and GitHub username, and confirmed the match."
+   ]
   }
  ],
  "metadata": {
+  "everyrow": {
+   "description": "Python notebook using LLM-powered merge to match 2,246 people to personal websites. Demonstrates semantic joining at scale with web search fallback."
+  },
   "kernelspec": {
    "display_name": ".venv",
    "language": "python",
@@ -414,11 +455,8 @@
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.12.6"
-  },
-  "everyrow": {
-   "description": "Python notebook using LLM-powered merge to match 2,246 people to personal websites. Demonstrates semantic joining at scale with web search fallback."
   }
  },
  "nbformat": 4,
  "nbformat_minor": 5
-}
+}
diff --git a/docs/case_studies/llm-powered-screening-at-scale/notebook.ipynb b/docs/case_studies/llm-powered-screening-at-scale/notebook.ipynb
@@ -4,27 +4,47 @@
    "cell_type": "markdown",
    "id": "bb0b6427",
    "metadata": {},
-   "source": "# LLM-powered Screening at Scale\n\nThe `screen()` function filters a dataframe by applying LLM judgment to every row. Each row is evaluated against natural language criteria, with the LLM determining relevance. This notebook demonstrates screening 10,000 rows where some are trivially relevant or irrelevant while others require deeper analysis."
+   "source": [
+    "# LLM-powered Screening at Scale\n",
+    "\n",
+    "The everyrow `screen()` function filters a dataframe by applying LLMs, and LLM research agents, to every row to determine if the criteria are met. This notebook demonstrates how this scales to screening 10,000 rows. Since tricky rows get LLM agents that themselves make dozens of LLM calls, this results in running vastly more LLM calls than is generally feasible without dedicated orchestration or infrastructure. The total cost is ~$0.001 per row."
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "09zeehb0muql",
-   "source": "## Example: Filtering 10,000 FDA Recalls\n\nThis example takes a dataset of FDA product recalls and filters it to find recalls relevant to a specific personal situation: products that might have been used for a child born on a particular date. The screening task requires understanding product types, typical use cases, and timing to determine which recalls matter.",
-   "metadata": {}
+   "metadata": {},
+   "source": [
+    "## Example: Filtering 10,000 FDA Recalls\n",
+    "\n",
+    "This example takes a dataset of FDA product recalls and filters it to find recalls relevant to a specific personal situation: products that might have been used for a child born on a particular date. The screening task requires understanding product types, typical use cases, and timing to determine which recalls matter."
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "03955b64",
    "metadata": {},
-   "source": "## Load Data"
+   "source": [
+    "## Load Data"
+   ]
   },
   {
    "cell_type": "code",
+   "execution_count": null,
    "id": "2ac7mwmcy7h",
-   "source": "from dotenv import load_dotenv\nimport pandas as pd\nfrom everyrow import create_session\nfrom everyrow.ops import screen\n\npd.set_option(\"display.max_colwidth\", None)\n\n\nload_dotenv()",
    "metadata": {},
-   "execution_count": null,
-   "outputs": []
+   "outputs": [],
+   "source": [
+    "from dotenv import load_dotenv\n",
+    "import pandas as pd\n",
+    "from everyrow import create_session\n",
+    "from everyrow.ops import screen\n",
+    "\n",
+    "pd.set_option(\"display.max_colwidth\", None)\n",
+    "\n",
+    "\n",
+    "load_dotenv()"
+   ]
   },
   {
    "cell_type": "code",
@@ -211,7 +231,11 @@
    "cell_type": "markdown",
    "id": "009cd66c",
    "metadata": {},
-   "source": "## Define Screen Task\n\nThe screening criteria specify finding recalls of products that might have been used for a child born on 2021-08-01."
+   "source": [
+    "## Define Screen Task\n",
+    "\n",
+    "The screening criteria specify finding recalls of products that might have been used for a child born on 2021-08-01."
+   ]
   },
   {
    "cell_type": "code",
@@ -258,13 +282,19 @@
    "cell_type": "markdown",
    "id": "9882d4b5",
    "metadata": {},
-   "source": "Session URL: https://everyrow.io/sessions/df145a50-2dfd-48c6-97ed-6f82a82bca66"
+   "source": [
+    "Session URL: https://everyrow.io/sessions/df145a50-2dfd-48c6-97ed-6f82a82bca66"
+   ]
   },
   {
    "cell_type": "markdown",
    "id": "3fb4297b",
    "metadata": {},
-   "source": "### Cost\n\nThis run cost $12.10, averaging around $0.001 per row."
+   "source": [
+    "### Cost\n",
+    "\n",
+    "This run cost $12.10, averaging around $0.001 per row."
+   ]
   },
   {
    "cell_type": "markdown",
@@ -635,6 +665,9 @@
   }
  ],
  "metadata": {
+  "everyrow": {
+   "description": "Python notebook using LLM agents to screen 10,000 FDA product recalls for personal relevance. Demonstrates intelligent filtering at scale."
+  },
   "kernelspec": {
    "display_name": ".venv",
    "language": "python",
@@ -651,11 +684,8 @@
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
    "version": "3.12.6"
-  },
-  "everyrow": {
-   "description": "Python notebook using LLM agents to screen 10,000 FDA product recalls for personal relevance. Demonstrates intelligent filtering at scale."
   }
  },
  "nbformat": 4,
  "nbformat_minor": 5
-}
+}
diff --git a/docs/case_studies/llm-web-research-agents-at-scale/notebook.ipynb b/docs/case_studies/llm-web-research-agents-at-scale/notebook.ipynb