Merge branch 'chore/documentation_updates'

travis-bauer · travis-bauer · commit ac51a8100fc3 · 2026-05-24T06:48:57.000-06:00
diff --git a/README.md b/README.md
@@ -508,7 +508,7 @@ talkpipe/
 
 ## Configuration
 
-TalkPipe uses a flexible configuration system via `~/.talkpipe.toml` or environment variables:
+TalkPipe uses a flexible configuration system via `~/.talkpipe.toml` or environment variables. For LLM and embedding `model` / `source` defaults, see [Model and source configuration](docs/guides/model-and-source-configuration.md).
 
 ```toml
 # ~/.talkpipe.toml
diff --git a/docs/README.md b/docs/README.md
@@ -10,6 +10,9 @@ New to TalkPipe? Start here for installation, basic concepts, and your first pip
 ### 📖 [RAG Commands](guides/makevectordatabase-and-serverag.md)
 Create vector databases and run RAG servers in two commands: `makevectordatabase` and `serverag`.
 
+### ⚙️ [Model and source configuration](guides/model-and-source-configuration.md)
+Set default LLM and embedding providers via segment parameters, `~/.talkpipe.toml`, or environment variables.
+
 ### 🐳 [Container images](guides/container-images.md)
 Pull multi-platform release images from GitHub Container Registry (`ghcr.io`), tags, and Docker/Podman usage.
 
diff --git a/docs/api-reference/chatterlang-script.md b/docs/api-reference/chatterlang-script.md
@@ -66,7 +66,7 @@ chatterlang_script \
     --factor 10
 ```
 
-This feature is useful for parameterizing scripts without editing configuration files or script content.
+This feature is useful for parameterizing scripts without editing configuration files or script content. You can use `$key` for `model` and `source` on LLM segments (for example `llmPrompt[model=$default_model_name, source=$default_model_source]`). See [Model and source configuration](../guides/model-and-source-configuration.md).
 
 ## Troubleshooting
 
diff --git a/docs/architecture/README.md b/docs/architecture/README.md
@@ -55,6 +55,8 @@ How TalkPipe manages configuration across different environments.
 - Precedence rules
 - Security considerations
 
+For LLM and embedding `model` / `source` defaults, see [Model and source configuration](../guides/model-and-source-configuration.md).
+
 ---
 
 *For API details, see [API Reference](../api-reference/). For working examples, see the [tutorials](../tutorials/) directory.*
diff --git a/docs/architecture/chatterlang.md b/docs/architecture/chatterlang.md
@@ -59,6 +59,8 @@ INPUT FROM @variable_name
 
 #### `llmPrompt` conversation memory controls
 
+For `model`, `source`, and global defaults, see [Model and source configuration](../guides/model-and-source-configuration.md).
+
 When using `llmPrompt`, three parameters control memory compaction behavior:
 
 - `context_token_trigger`: **when** compaction triggers.
diff --git a/docs/architecture/configuration.md b/docs/architecture/configuration.md
@@ -148,6 +148,8 @@ For application settings (logging, server ports, etc.):
 
 ### Embedding and LLM defaults: `serverag` vs segment keys
 
+See [Model and source configuration](../guides/model-and-source-configuration.md) for a user-focused guide to `model`, `source`, and related config keys.
+
 Several components resolve embedding and chat model defaults from `get_config()`. Two naming patterns appear in configuration:
 
 - **Segment defaults** — `LLMEmbed` and `LLMPrompt` fall back to these keys when `model` / `source` arguments are omitted: `default_embedding_model_name`, `default_embedding_model_source`, `default_model_name`, and `default_model_source` (see `talkpipe.util.constants`).
diff --git a/docs/contributing/developer-handbook.md b/docs/contributing/developer-handbook.md
@@ -90,6 +90,8 @@ After talkpipe is installed, a script called "chatterlang_reference_browser" is
 
 Configuration constants can be defined either in ~/.talkpipe.toml or in environment variables.  Any constant defined in an environment variable needs to be prefixed with TALKPIPE_.  So email_password, stored in an environment variable, needs to be TALKPIPE_email_password.  Note that in ChatterLang, any key defined in ~/.talkpipe.toml or set via a TALKPIPE_* environment variable can be referenced in scripts as a parameter using $var_name.  That reference resolves to the environment variable TALKPIPE_var_name or to var_name in talkpipe.toml.
 
+For how `model`, `source`, and LLM defaults interact across segments and CLIs, see [Model and source configuration](../guides/model-and-source-configuration.md).
+
 * **default_embedding_model_source** - The default source (e.g. ollama) to be used for creating sentence embeddings.
 * **default_embedding_model_name** - The name of the LLM model to be used for creating sentence embeddings.
 * **default_model_name** - The default name of a LLM model to be used in chat
diff --git a/docs/guides/makevectordatabase-and-serverag.md b/docs/guides/makevectordatabase-and-serverag.md
@@ -24,7 +24,7 @@ Together they form a minimal path from raw documents to a queryable RAG interfac
 - **Completion model** (for serverag): Ollama with an LLM (e.g. `ollama pull llama3.2`)
 - **Configuration**: Set `DEFAULT_EMBEDDING_MODEL`, `DEFAULT_EMBEDDING_SOURCE`, `DEFAULT_LLM_MODEL`, and `DEFAULT_LLM_SOURCE` in `~/.talkpipe.toml` or pass them on the command line
 
-See [Configuration](../architecture/configuration.md) for details.
+See [Model and source configuration](model-and-source-configuration.md) for how embedding and completion defaults work. For general config (logging, `$key` syntax), see [Configuration](../architecture/configuration.md).
 
 ---
 
diff --git a/docs/guides/model-and-source-configuration.md b/docs/guides/model-and-source-configuration.md
@@ -0,0 +1,213 @@
+# Model and source configuration
+
+TalkPipe LLM segments need two values for every call:
+
+- **`source`** — which backend provides the model (for example `ollama`, `openai`, or `anthropic` for chat).
+- **`model`** — the model id on that backend (for example `llama3.2`, `gpt-4o`, or `mxbai-embed-large`).
+
+You can set these on each segment, in `~/.talkpipe.toml`, via `TALKPIPE_*` environment variables, or through ChatterLang `$key` substitution. This guide explains how those layers interact. For logging, security, and general config mechanics, see [Configuration architecture](../architecture/configuration.md).
+
+---
+
+## Supported sources
+
+Sources are registered in `talkpipe.llm.config`:
+
+| Segment | Registered sources |
+|---------|-------------------|
+| **`llmPrompt`** (chat) | `ollama`, `openai`, `anthropic` |
+| **`llmEmbed`** (embeddings) | `ollama` |
+
+Additional sources can be registered at runtime with `registerPromptAdapter` or `registerEmbeddingAdapter` (see [Extending TalkPipe](../architecture/extending-talkpipe.md)).
+
+Install optional provider dependencies as needed: `pip install talkpipe[ollama]`, `talkpipe[openai]`, `talkpipe[anthropic]`, or `talkpipe[all]`.
+
+---
+
+## How values are resolved
+
+When `LLMPrompt` or `LLMEmbed` is constructed, TalkPipe fills in missing `model` / `source` from `get_config()` (merged `~/.talkpipe.toml` plus `TALKPIPE_*` environment variables). If either is still missing, construction raises an error.
+
+```mermaid
+flowchart TD
+  segmentParams["Segment parameters model and source"]
+  chatterlangDollar["ChatterLang $key at parse time"]
+  getConfig["get_config: TALKPIPE env then talkpipe.toml"]
+  providerSdk["Provider SDK env OPENAI_API_KEY etc"]
+
+  segmentParams -->|"highest for LLM segments"| resolved["Resolved model and source"]
+  chatterlangDollar --> segmentParams
+  getConfig --> segmentParams
+  providerSdk -->|"credentials only"| adapters["OpenAI and Anthropic adapters"]
+```
+
+### Precedence (highest first)
+
+| Layer | How it applies | Example |
+|-------|----------------|---------|
+| **Segment parameters** | Explicit `model` / `source` on the segment always win | `llmPrompt[model="gpt-4o", source="openai"]` |
+| **ChatterLang `$key`** | Resolved at parse time from `get_config()` | `llmPrompt[model=$default_model_name, source=$default_model_source]` |
+| **Environment variables** | `TALKPIPE_` + exact config key name | `export TALKPIPE_default_model_name=llama3.2` |
+| **Configuration file** | `~/.talkpipe.toml` | `default_model_name = "llama3.2"` |
+
+Within `get_config()`, environment variables override file values. ChatterLang `$key` precedence for CLI overrides is documented in [Configuration architecture](../architecture/configuration.md#chatterlang-script-variable-access): command-line `--key value` beats `TALKPIPE_key` beats TOML.
+
+**Provider credentials** (API keys) are separate: OpenAI and Anthropic adapters use their official SDKs, which read `OPENAI_API_KEY` and `ANTHROPIC_API_KEY` from the environment—not TalkPipe `default_*` keys.
+
+---
+
+## Configuration keys
+
+### Segment defaults (`default_*`)
+
+Used by `llmPrompt` and `llmEmbed` when `model` / `source` are omitted:
+
+| Purpose | TOML / config key | Environment variable |
+|---------|-------------------|----------------------|
+| Default chat model | `default_model_name` | `TALKPIPE_default_model_name` |
+| Default chat source | `default_model_source` | `TALKPIPE_default_model_source` |
+| Default embedding model | `default_embedding_model_name` | `TALKPIPE_default_embedding_model_name` |
+| Default embedding source | `default_embedding_model_source` | `TALKPIPE_default_embedding_model_source` |
+| Ollama server URL | `OLLAMA_SERVER_URL` | `TALKPIPE_OLLAMA_SERVER_URL` |
+
+Example `~/.talkpipe.toml`:
+
+```toml
+default_model_name = "llama3.2"
+default_model_source = "ollama"
+default_embedding_model_name = "mxbai-embed-large"
+default_embedding_model_source = "ollama"
+OLLAMA_SERVER_URL = "http://localhost:11434"
+```
+
+### RAG CLI defaults (`DEFAULT_*`)
+
+`makevectordatabase` and `serverag` read these when you omit `--embedding_model`, `--embedding_source`, `--completion_model`, and `--completion_source`:
+
+| Purpose | TOML / config key | Environment variable |
+|---------|-------------------|----------------------|
+| Embedding model | `DEFAULT_EMBEDDING_MODEL` | `TALKPIPE_DEFAULT_EMBEDDING_MODEL` |
+| Embedding source | `DEFAULT_EMBEDDING_SOURCE` | `TALKPIPE_DEFAULT_EMBEDDING_SOURCE` |
+| Completion model | `DEFAULT_LLM_MODEL` | `TALKPIPE_DEFAULT_LLM_MODEL` |
+| Completion source | `DEFAULT_LLM_SOURCE` | `TALKPIPE_DEFAULT_LLM_SOURCE` |
+
+If a CLI flag is omitted and the matching `DEFAULT_*` key is unset, the value passed into the RAG pipeline may be `None`, and inner `llmEmbed` / `llmPrompt` segments fall back to the `default_*` keys above.
+
+**Recommendation:** set `default_*` once for most workflows. Add `DEFAULT_*` only when you want different defaults specifically for the RAG commands. See [makevectordatabase and serverag](makevectordatabase-and-serverag.md).
+
+---
+
+## Segment parameters
+
+### `llmPrompt` / `LLMPrompt`
+
+Required (directly or via config): `model`, `source`.
+
+```chatterlang
+INPUT FROM prompt[data="Summarize this:"]
+| llmPrompt[model="llama3.2", source="ollama", field="data"]
+| print
+```
+
+```python
+from talkpipe.llm.chat import LLMPrompt
+
+segment = LLMPrompt(model="gpt-4o", source="openai", system_prompt="You are concise.")
+```
+
+Memory and compaction options (`memory_mode`, `context_token_trigger`, etc.) are described in [ChatterLang memory controls](../architecture/chatterlang.md#llmprompt-conversation-memory-controls).
+
+### `llmEmbed` / `LLMEmbed`
+
+Required (directly or via config): `model`, `source`. Optional: `field` (text field to embed), `set_as` (field to store the vector on the item).
+
+```chatterlang
+INPUT FROM echo[data="Hello world"]
+| llmEmbed[model="mxbai-embed-large", source="ollama", set_as="vector"]
+| print
+```
+
+### RAG and vector pipelines
+
+Higher-level segments forward model settings to inner LLM segments:
+
+| Segment / app | Parameters |
+|---------------|------------|
+| `makeVectorDatabase`, `searchVectorDatabase` | `embedding_model`, `embedding_source` |
+| `ragToText`, `ragToBinaryAnswer`, etc. | `embedding_model`, `embedding_source`, `completion_model`, `completion_source` |
+| `makevectordatabase`, `serverag` CLIs | `--embedding_model`, `--embedding_source`, `--completion_model`, `--completion_source` |
+
+### Ollama server URL
+
+Not a segment parameter by default. Set `OLLAMA_SERVER_URL` in config or `TALKPIPE_OLLAMA_SERVER_URL` in the environment when Ollama is not on localhost.
+
+---
+
+## Examples
+
+### 1. Explicit model and source (per call)
+
+```chatterlang
+INPUT FROM prompt[data="Hello"]
+| llmPrompt[model="llama3.2", source="ollama"]
+| print
+```
+
+### 2. Global defaults in TOML
+
+With `default_model_name` and `default_model_source` set in `~/.talkpipe.toml`:
+
+```chatterlang
+INPUT FROM prompt[data="Hello"]
+| llmPrompt
+| print
+```
+
+### 3. Environment-only defaults (containers / CI)
+
+```bash
+export TALKPIPE_default_model_name=llama3.2
+export TALKPIPE_default_model_source=ollama
+export TALKPIPE_default_embedding_model_name=mxbai-embed-large
+export TALKPIPE_default_embedding_model_source=ollama
+chatterlang_script --script 'INPUT FROM prompt[data="Hi"] | llmPrompt | print'
+```
+
+### 4. ChatterLang `$key` and CLI overrides
+
+```bash
+chatterlang_script --script 'INPUT FROM prompt[data="Hi"] | llmPrompt[model=$default_model_name, source=$default_model_source] | print' \
+  --default_model_name llama3.2 \
+  --default_model_source ollama
+```
+
+### 5. Pipe API with config fallback
+
+```python
+from talkpipe.llm.chat import LLMPrompt
+
+# Uses default_model_name / default_model_source from config when omitted
+segment = LLMPrompt(system_prompt="You are helpful.")
+```
+
+---
+
+## Troubleshooting
+
+| Symptom | What to check |
+|---------|----------------|
+| `Model name and source must be provided` | Set `model` and `source` on the segment, or add `default_model_name` and `default_model_source` (or embedding equivalents for `llmEmbed`). |
+| `Unknown source` | Chat: use `ollama`, `openai`, or `anthropic`. Embeddings: only `ollama` is registered unless you added adapters. |
+| Ollama connection refused | Run `ollama serve` or set `OLLAMA_SERVER_URL` / `TALKPIPE_OLLAMA_SERVER_URL`. |
+| OpenAI / Anthropic auth errors | Set `OPENAI_API_KEY` or `ANTHROPIC_API_KEY`; these are not read from `TALKPIPE_*` model keys. |
+| RAG CLI uses unexpected models | Check `DEFAULT_*` keys and CLI flags; then check segment `default_*` fallbacks. |
+
+---
+
+## Related documentation
+
+- [Configuration architecture](../architecture/configuration.md) — full config system, precedence, and security
+- [ChatterLang](../architecture/chatterlang.md) — DSL syntax and `llmPrompt` memory
+- [makevectordatabase and serverag](makevectordatabase-and-serverag.md) — RAG workflow
+- [Quickstart](../quickstart.md) — first pipeline examples
+- [Developer handbook](../contributing/developer-handbook.md) — standard `~/.talkpipe.toml` keys
diff --git a/docs/quickstart.md b/docs/quickstart.md
@@ -96,6 +96,8 @@ chat = compiler.compile(script).as_function(single_in=True, single_out=True)
 response = chat("Hello! Tell me about the history of computers.")
 ```
 
+To avoid repeating `model` and `source` on every segment, set defaults in `~/.talkpipe.toml` or environment variables. See [Model and source configuration](guides/model-and-source-configuration.md).
+
 For `llmPrompt` memory behavior (`context_token_trigger`, `memory_mode`, `unsummarized_message_count`, `memory_size`), see
 [ChatterLang memory controls](architecture/chatterlang.md#llmprompt-conversation-memory-controls).
 
diff --git a/docs/tutorials/Tutorial_2-Search_by_Example_and_RAG/README.md b/docs/tutorials/Tutorial_2-Search_by_Example_and_RAG/README.md
@@ -28,7 +28,7 @@ Keyword search breaks when users ask "find documents similar to this" or "show m
 ## Prerequisites
 
 - **Tutorial 1 completed**: `stories.json` must exist at `../Tutorial_1-Document_Indexing/stories.json`
-- **TalkPipe** installed: See [Getting Started](../../quickstart.md). For this tutorial: `pip install talkpipe[ollama]` or `pip install talkpipe[all]`
+- **TalkPipe** installed: See [Getting Started](../../quickstart.md). For this tutorial: `pip install talkpipe[ollama]` or `pip install talkpipe[all]`. Model and source defaults are explained in [Model and source configuration](../../guides/model-and-source-configuration.md).
 - **Ollama** with these models:
   - `mxbai-embed-large` (embeddings): `ollama pull mxbai-embed-large`
   - `llama3.2` (Step 3 only): `ollama pull llama3.2`
diff --git a/src/talkpipe/llm/chat.py b/src/talkpipe/llm/chat.py
@@ -38,9 +38,8 @@ class LLMPrompt(AbstractSegment):
     The model name and source can be specified in three different ways.  If
     explicitly included in the constructor, those values will be used.  If not,
     the values will be loaded from environment variables (TALKPIPE_default_model_name
-    and TALKPIPE_default_source).  If those are not set, the values will be loaded
-    from the configuration file (~/.talkpipe.toml).  If none of those are set, an 
-    error will be raised.
+    and TALKPIPE_default_model_source) or the configuration file (~/.talkpipe.toml).
+    If none of those are set, an error will be raised.
 
     Currently supported sources are "ollama," "openai," and "anthropic."  If 
     you specify "ollama," you can optionally set the OLLAMA_SERVER_URL environment