Documentation updates for latest AI changes. (#371)

Jeadie · web-flow · commit 13ec465b5c6c · 2024-09-03T09:21:45.000+10:00
* ML docs housekeeping for LLMs, add OpenAI to spiceaidocs/docs/components/models

* add configurable LLMS, dataset[*].embeddings reference docs

* lint

* Apply suggestions from code review

Co-authored-by: peasee &lt;98815791+peasee@users.noreply.github.com&gt;
Co-authored-by: Evgenii Khramkov &lt;evgenii@spice.ai&gt;

* Update models.md

* Update datasets.md

* Update runtime_tools.md
diff --git a/spiceaidocs/docs/components/models/huggingface.md b/spiceaidocs/docs/components/models/huggingface.md
@@ -13,7 +13,7 @@ models:
   - from: huggingface:huggingface.co/spiceai/darts:latest
     name: hf_model
     files:
-      - model.onnx
+      - path: model.onnx
     datasets:
       - taxi_trips
 ```
@@ -44,4 +44,4 @@ The `from` key follows the following regex format:
 - ML models currently only support ONNX file format.
 - Only ONNX and GGUF file formats are currently supported.
 
-:::
+:::
diff --git a/spiceaidocs/docs/components/models/index.md b/spiceaidocs/docs/components/models/index.md
@@ -4,25 +4,22 @@ sidebar_label: 'AI/ML Models'
 description: ''
 ---
 
-Machine Learning (ML) models can be deployed and loaded from the following sources.
+Spice supports traditional machine learning (ML) models and language models (LLMs).
 
 - **Filesystem**: [ONNX](https://onnx.ai) models.
 - **HuggingFace**: ONNX models hosted on [HuggingFace](https://huggingface.co).
 - **Spice Cloud Platform**: Models hosted on the [Spice Cloud Platform](https://docs.spice.ai/building-blocks/spice-models).
+- **OpenAI**: OpenAI (or compatible) LLM endpoints.
 
-Defined in the `spicepod.yml`, a `model` component has the following format.
+### Model Sources
 
-| field      | Description                                                             |
-| ---------- | ----------------------------------------------------------------------- |
-| `name`     | Unique, readable name for the model within the Spicepod.                |
-| `from`     | Source-specific address to uniquely identify a model                    |
-| `datasets` | Datasets that the model depends on for inference                        |
-| `files`    | Specify additional files, or override default files needed by the model |
+| Name                         | Description      | ML Format(s) | LLM Format(s)*          |
+| ---------------------------- | ---------------- | ------------ | ----------------------- |
+| `file`                       | Local filesystem |    ONNX      | GGUF, GGML, SafeTensor  |
+| `huggingface:huggingface.co` | Models hosted on [HuggingFace](https://huggingface.co)                                          | ONNX  | GGUF, GGML, SafeTensor |
+| `spice.ai`                   | Models hosted on the [Spice Cloud Platform](https://docs.spice.ai/building-blocks/spice-models) | ONNX  | - |
+| `openai`                     | OpenAI (or compatible) LLM endpoint | -  | Remote HTTP endpoint |
 
-For more detail, refer to the `model` [reference specification](/reference/spicepod/models.md).
+* LLM Format(s) may require additional files (e.g. `tokenizer_config.json`).
 
-## Model Sources
-
-import DocCardList from '@theme/DocCardList';
-
-<DocCardList />
+The model type is inferred based on the model source and files. For more detail, refer to the `model` [reference specification](/reference/spicepod/models.md).
diff --git a/spiceaidocs/docs/components/models/openai.md b/spiceaidocs/docs/components/models/openai.md
@@ -0,0 +1,34 @@
+---
+title: 'OpenAI (or Compatible) Language Models'
+sidebar_label: 'OpenAI'
+sidebar_position: 4
+---
+
+To use a language model hosted on OpenAI (or compatible), specify the `openai` path in `from`. 
+
+For a specific model, include it as the model ID in `from` (see example below). Defaults to `"gpt-3.5-turbo"`.
+These parameters are specific to OpenAI models:
+
+| Param | Description | Default |
+| ----- | ----------- | ------- |
+| `openai_api_key` | The OpenAI API key.        | -                           |
+| `openai_org_id` | The OpenAI organization id. | -                           |
+| `openai_project_id` | The OpenAI project id.  | -                           |
+| `endpoint` | The OpenAI API base endpoint.    | `https://api.openai.com/v1` |
+
+
+Example:
+
+```yaml
+models:
+  - from: openai:gpt-4o
+    name: local_fs_model
+    params:
+      openai_api_key: ${ secrets:SPICE_OPENAI_API_KEY }
+
+  - from: openai:llama3-groq-70b-8192-tool-use-preview
+    name: groq-llama
+    params:
+      endpoint: https://api.groq.com/openai/v1
+      openai_api_key: ${ secrets:SPICE_GROQ_API_KEY }
+```
diff --git a/spiceaidocs/docs/features/configurable-llms/default_overrides.md b/spiceaidocs/docs/features/configurable-llms/default_overrides.md
@@ -0,0 +1,35 @@
+---
+title: 'Language Model Overrides'
+sidebar_label: 'Default overrides'
+description: 'Learn how to override default LLM hyperparameters in Spice.'
+sidebar_position: 1
+pagination_prev: null
+pagination_next: null
+---
+
+### Chat Completion Parameter Overrides
+[`v1/chat/completion`](/api/http/chat-completions) is an OpenAI compatible endpoint.
+
+It supports all request body parameters defined in the [OpenAI reference documentation](https://platform.openai.com/docs/api-reference/chat/create). Spice can configure different defaults for these request parameters.
+```yaml
+models:
+  - name: pirate-haikus
+    from: openai:gpt-4o
+    params:
+      openai_temperature: 0.1
+      openai_response_format: { "type": "json_object" }
+```
+To specify a default override for a parameter, use the `openai_` prefix followed by the parameter name. For example, to set the `temperature` parameter to `0.1`, use `openai_temperature: 0.1`.
+
+### System Prompt
+In addition to any system prompts provided in message dialogue, or added by model providers, Spice can configure an additional system prompt.
+```yaml
+models:
+  - name: pirate-haikus
+    from: openai:gpt-4o
+    params:
+      system_prompt: |
+        Write everything in Haiku like a pirate
+```
+
+Any request to [HTTP `v1/chat/completion`](/api/http/chat-completions) will include the configured system prompt.
diff --git a/spiceaidocs/docs/features/configurable-llms/index.md b/spiceaidocs/docs/features/configurable-llms/index.md
@@ -0,0 +1,17 @@
+---
+title: 'Configuring Language Models'
+sidebar_label: 'Configuring LLMs'
+description: 'Learn how to configure language models in Spice.'
+sidebar_position: 7
+pagination_prev: null
+pagination_next: null
+---
+
+Spice supports language models (LLMs) from several sources (see [model components](/components/models/index.md)) and provides configuration for how inference will be performed in the Spice runtime. This includes:
+ - Providing tools to the language model, enabling it to interact with the Spice runtime.
+ - Specifying system prompts and overriding defaults for [`v1/chat/completion`](/api/http/chat-completions.md).
+
+
+import DocCardList from '@theme/DocCardList';
+
+<DocCardList />
diff --git a/spiceaidocs/docs/features/configurable-llms/runtime_tools.md b/spiceaidocs/docs/features/configurable-llms/runtime_tools.md
@@ -0,0 +1,28 @@
+---
+title: 'Giving Language Models Runtime Tools'
+sidebar_label: 'Runtime tools'
+description: 'Learn how LLMs can interact with the spice runtime.'
+sidebar_position: 2
+pagination_prev: null
+pagination_next: null
+---
+
+Spice provides a set of tools that let LLMs interact with the runtime. To provide these tools to a Spice model, specify them in its `params.spice_tools`.
+```yaml
+models:
+  - name: sql-model
+    from: openai:gpt-4o
+    params:
+      spice_tools: list_datasets, sql, table_schema
+
+  - name: full-runtime
+    from: openai:gpt-4o
+    params:
+      spice_tools: auto # Use all available tools
+```
+
+## Available tools
+ - `list_datasets`: List all available datasets in the runtime.
+ - `sql`: Execute SQL queries on the runtime.
+ - `table_schema`: Get the schema of a specific SQL table.
+ - `document_similarity`: For datasets with an embedding column, retrieve documents based on an input query. It is equivalent to [/v1/search](/api/http/search).
diff --git a/spiceaidocs/docs/reference/spicepod/datasets.md b/spiceaidocs/docs/reference/spicepod/datasets.md
@@ -325,3 +325,28 @@ datasets:
         # alternatively "drop" can be used instead of "upsert" to drop the data update.
         hash: upsert
 ```
+
+## `embeddings`
+
+Optional. Create vector embeddings for specific columns of the dataset.
+
+```yaml
+datasets:
+  - from: spice.ai/eth.recent_blocks
+    name: eth.recent_blocks
+    embeddings:
+      - column: extra_data
+        use: hf_minilm
+```
+
+## `embeddings[*].column`
+
+The column name to create an embedding for.
+
+## `embeddings[*].use`
+
+The embedding model to use, specific the component name `embeddings[*].name`.
+
+## `embeddings[*].column_pk`
+
+Optional. For datasets without a primary key, explicitly specify column(s) that uniquely identify a row.
diff --git a/spiceaidocs/docs/reference/spicepod/models.md b/spiceaidocs/docs/reference/spicepod/models.md
@@ -16,6 +16,15 @@ The model specifications are in early preview and are subject to change.
 Spice supports both traditional machine learning (ML) models and language models (LLMs). The configuration allows you to specify either type from a variety of sources. The model type is automatically determined based on the model source and files.
 
 
+| field         | Description                                                             |
+| ------------- | ----------------------------------------------------------------------- |
+| `name`        | Unique, readable name for the model within the Spicepod.                |
+| `from`        | Source-specific address to uniquely identify a model                    |
+| `description` | Additional details about the model, useful for displaying to users      |
+| `datasets`    | Datasets that the model depends on for inference                        |
+| `files`       | Specify additional files, or override default files needed by the model |
+| `params`      | Additional parameters to be passed to the model                         |
+
 ## `models`
 
 The `models` section in your configuration allows you to specify one or more models to be used with your datasets.
@@ -41,13 +50,13 @@ models:
 
 ### `from`
 
-The `from` field specifies both the source of the model, and the unique identifier of the model (relative to the source). The `from` value expects the following format
+The `from` field specifies both the source of the model (e.g Huggingface, or a local file), and the unique identifier of the model (relative to the source). The `from` value expects the following format
 
 ```yaml
 - from: <model_source>/<model id>
 ```
 
-### Model Source
+#### Model Source
 
 The `<model_source>` prefix of the `from` field indicates where the model is sourced from:
 
@@ -56,7 +65,7 @@ The `<model_source>` prefix of the `from` field indicates where the model is sou
 - `openai` - OpenAI (or compatible) models
 - `spiceai` - Spice AI models
 
-### Model ID
+#### Model ID
 
 The `<model_id>` suffix of the `from` field is a unique (per source) identifier for the model:
 
@@ -65,13 +74,17 @@ The `<model_id>` suffix of the `from` field is a unique (per source) identifier
 - For Hugging Face: A repo_id and, optionally, revision hash or tag.
     - `Qwen/Qwen1.5-0.5B` (no revision)
     - `meta-llama/Meta-Llama-3-8B:cd892e8f4da1043d4b01d5ea182a2e8412bf658f` (with revision hash)
-- For local files: Represents the absolute or relative path to the model weights file on the local file system. See [below](#files) for the accepted model weight types and formats. 
+- For local files: Represents the absolute or relative path to the model weights file on the local file system. See [below](#files) for the accepted model weight types and formats.
 - For OpenAI: Only supports LMs. For OpenAI models, valid IDs can be found in their model [documentation](https://platform.openai.com/docs/models/continuous-model-upgrades). For OpenAI compatible providers, specify the value  required in their `v1/chat/completion` [payload](https://platform.openai.com/docs/api-reference/chat/create#chat-create-model).
 
 ### `name`
 
 A unique identifier for this model component.
 
+### `description`
+
+Additional details about the model, useful for displaying to users 
+
 ### `files`
 
 Optional. A list of files associated with this model. Each file has:
@@ -106,8 +119,8 @@ Optional. A map of key-value pairs for additional parameters specific to the mod
 
 ### `datasets`
 
-Optional. A list of [dataset names](./datasets.md#name) that this model should be applied to. For ML models, this preselects the dataset to use for inference. 
+Optional. A list of [dataset names](./datasets.md#name) that this model should be applied to. For ML models, this preselects the dataset to use for inference.
 
 ### `dependsOn`
 
-Optional. A list of dependencies that must be loaded and available before this model.
+Optional. A list of dependencies that must be loaded and available before this model.