From b3f8af4e230aff00d7f1e425645cefa21405ddc0 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Mon, 5 May 2025 18:24:10 -0400
Subject: [PATCH 01/17] [WIP] Add documentation for failure stores.

TBD on recipes. Most links are not complete and need updating from "???".
---
 .../data-store/data-streams/failure-store.md  | 607 ++++++++++++++++++
 manage-data/toc.yml                           |   1 +
 2 files changed, 608 insertions(+)
 create mode 100644 manage-data/data-store/data-streams/failure-store.md

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
new file mode 100644
index 000000000..501718fd0
--- /dev/null
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -0,0 +1,607 @@
+---
+applies_to:
+  stack: ga 8.19.0
+  serverless: ga 9.1.0
+---
+
+# Failure store [failure-store]
+
+Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured and persisted to be analysed later, returning a successful response to the client in the meantime.
+
+## Set up a data stream failure store [set-up-failure-store]
+
+Each data stream has its own failure store that can be enabled to accept failures. By default, this failure store is disabled and any ingestion problems are raised in the response to write operations.
+
+### Set up for new data streams [set-up-failure-store-new]
+
+You can specify on a data stream's template if it should enable the failure store when it is first created. The `data_stream_options` field in a [template](../templates.md) contains the settings required to enable a data stream's failure store.
+
+:::{note}
+Unlike the `settings` and `mappings` fields on an [index template](../templates.md) which are repeatedly applied to new data stream write indices over time, the `data_stream_options` section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put data stream options API.
+:::
+
+To enable the failure store on a new data stream, enable it in the `data_stream_options` of the template:
+
+```console
+PUT _index_template/my-index-template
+{
+  "index_patterns": ["my-datastream-*"],
+  "data_stream": { },
+  "template": {
+    "data_stream_options": { <1>
+      "failure_store": {
+        "enabled": true <2>
+      }
+    }
+  }
+}
+```
+
+1. The options for a data stream to be applied at creation time.
+2. The failure store feature will be enabled for new data streams that match this template.
+
+
+After a matching data stream is created, its failure store will be enabled.
+
+### Set up for existing data streams [set-up-failure-store-existing]
+
+Enabling the failure store via [index templates](../templates.md) can only affect data streams that are newly created. Existing data streams that use a template will not apply any changes to the template's `data_stream_options` after they have been created.
+
+To modify an existing data stream's options, use the [put data stream options](???) API:
+
+```console
+PUT my-datastream-existing/_options
+{
+  "failure_store": {
+    "enabled": true <1>
+  }
+}
+```
+
+1. The failure store option will now be enabled.
+
+
+The failure store redirection can be suspended using this API as well. When the failure store is disabled, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by deletion or by retention.
+
+```console
+PUT my-datastream-existing/_options
+{
+  "failure_store": {
+    "enabled": false <1>
+  }
+}
+```
+
+1. Redirecting failed documents into the failure store will now be disabled.
+
+### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
+
+If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](???) API, you can instead configure a set of patterns in the [cluster settings](???) which will enable the failure store feature by default.
+
+Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
+
+```console
+PUT _cluster/settings
+{
+  "persistent" : {
+    "data_streams.failure_store.enabled" : [ "my-datastream-*", "logs-*" ] <1>
+  }
+}
+```
+
+1. Indices that match `my-datastream-*` or `logs-*` will redirect failures to the failure store unless explicitly disabled.
+
+## Using a failure store [use-failure-store]
+
+The failure store is meant to ease the burden of detecting and handling failures when ingesting data to {{es}}. Clients are less likely to encounter unrecoverable failures when writing documents, and developers are more easily able to troubleshoot faulty pipelines and mappings.
+
+### Failure redirection [use-failure-store-redirect]
+
+Once a failure store is enabled for a data stream it will begin redirecting documents that fail due to common ingestion problems instead of returning errors in write operations. Clients are notified in a non-intrusive way when a document is redirected to the failure store.
+
+Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
+
+When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](???) and [index](???) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
+
+Here we have a bulk operation that sends two documents. Both are writing to the `id` field which is mapped as a `long` field type. The first document will be accepted, but the second document would cause a failure because the value `invalid_text` cannot be parsed as a `long`. This second document will be redirected to the failure store: 
+
+```console
+POST my-datastream/_bulk
+{"create":{}}
+{"@timestamp": "2025-05-01T00:00:00Z", "id": 1234} <1>
+{"create":{}}
+{"@timestamp": "2025-05-01T00:00:00Z", "id": "invalid_text"} <2>
+```
+1. A correctly formatted document.
+2. Invalid document that cannot be parsed using the current mapping.
+
+```console-result
+{
+  "errors": false, <1>
+  "took": 400,
+  "items": [
+    {
+      "create": {
+        "_index": ".ds-my-datastream-2025.05.01-000001", <2>
+        "_id": "YUvQipYB_ZAKuDfZRosB",
+        "_version": 1,
+        "result": "created",
+        "_shards": {
+          "total": 1,
+          "successful": 1,
+          "failed": 0
+        },
+        "_seq_no": 3,
+        "_primary_term": 1,
+        "status": 201
+      }
+    },
+    {
+      "create": {
+        "_index": ".fs-my-datastream-2025.05.01-000002", <3>
+        "_id": "lEu8jZYB_ZAKuDfZNouU",
+        "_version": 1,
+        "result": "created",
+        "_shards": {
+          "total": 1,
+          "successful": 1,
+          "failed": 0
+        },
+        "_seq_no": 10,
+        "_primary_term": 1,
+        "failure_store": "used", <4>
+        "status": 201
+      }
+    }
+  ]
+}
+```
+
+1. The response code is 200 OK, and the response body does not report any errors encountered.
+2. The first document is accepted into the data stream's write index.
+3. The second document encountered a problem during ingest and was redirected to the data stream's failure store.
+4. The response is annotated with a field indicating that the failure store was used to persist the second document.
+
+
+If the document was redirected to a data stream's failure store due to a problem, then the `failure_store` field on the response will be `used`, and the response will not return any error information:
+
+```console-result
+{
+  "_index": ".fs-my-datastream-2025.05.01-000002", <1>
+  "_id": "lEu8jZYB_ZAKuDfZNouU",
+  "_version": 1,
+  "result": "created",
+  "_shards": {
+    "total": 1,
+    "successful": 1,
+    "failed": 0
+  },
+  "_seq_no": 11,
+  "_primary_term": 1,
+  "failure_store": "used" <2>
+}
+```
+
+1. The document for this index operation was sent to the failure store's write index.
+2. The response is annotated with a flag indicating the document was redirected.
+
+
+If the document could have been redirected to a data stream's failure store but the failure store was disabled, then the `failure_store` field on the response will be `not_enabled`, and the response will display the error encountered as normal.
+
+```console-result
+{
+  "error": {
+    "root_cause": [ <1>
+      {
+        "type": "document_parsing_exception",
+        "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'"
+      }
+    ],
+    "type": "document_parsing_exception",
+    "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
+    "caused_by": {
+      "type": "illegal_argument_exception",
+      "reason": "For input string: \"invalid_text\""
+    },
+    "failure_store": "not_enabled" <2>
+  },
+  "status": 400 <3>
+}
+```
+
+1. The failure is returned to the client as normal when the failure store is not enabled.
+2. The response is annotated with a flag indicating the failure store could have accepted the document, but it was not enabled.
+3. Status of 400 Bad Request due to the mapping problem.
+
+
+If the document was redirected to a data stream's failure store but that failed document could not be stored (e.g. due to shard unavailability or a similar problem), then the `failure_store` field on the response will be `failed`, and the response will display the error for the original failure, as well as a suppressed error detailing why the failure could not be stored:
+
+```console-result
+{
+  "error": {
+    "root_cause": [
+      {
+        "type": "document_parsing_exception", <1>
+        "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
+        "suppressed": [
+          {
+            "type": "cluster_block_exception", <2>
+            "reason": "index [.fs-my-datastream-2025.05.01-000002] blocked by: [FORBIDDEN/5/index read-only (api)];"
+          }
+        ]
+      }
+    ],
+    "type": "document_parsing_exception", <3>
+    "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
+    "caused_by": {
+      "type": "illegal_argument_exception",
+      "reason": "For input string: \"invalid_text\""
+    },
+    "suppressed": [
+      {
+        "type": "cluster_block_exception",
+        "reason": "index [.fs-my-datastream-2025.05.01-000002] blocked by: [FORBIDDEN/5/index read-only (api)];"
+      }
+    ],
+    "failure_store": "failed" <4>
+  },
+  "status": 400 <5>
+}
+```
+
+1. The root cause of the problem was a mapping mismatch.
+2. The document could not be redirected because the failure store was not able to accept writes at this time due to an unforeseeable issue.
+3. The complete exception tree is present on the response.
+4. The response is annotated with a flag indicating the failure store would have accepted the document, but it was not able to.
+5. Status of 400 Bad Request due to the original mapping problem.
+
+
+### Searching failures [use-failure-store-searching]
+
+Once you have accumulated some failures, they can be searched much like a regular index.
+
+:::{warning}
+Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.
+
+Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](???) or [field level security](???).
+
+To limit visibility on potentially sensitive data, users require the [`read_failure_store`](???) index privilege for a data stream in order to search that data stream's failure store data.
+:::
+
+Searching a data stream's failure store can be done by making use of the existing search APIs available in {{es}}. 
+
+To indicate that the search should be performed on failure store data, use the [index component selector syntax](???) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
+
+:::::{tab-set}
+
+::::{tab-item} {{esql}}
+```console
+POST _query?format=txt
+{
+    "query": """FROM my-datastream::failures | DROP error.stack_trace | LIMIT 1""" <1>
+}
+```
+1. We drop the `error.stack_trace` field here just to keep the example free of newlines.
+
+An example of a search result with the failed document present:
+
+```console-result
+       @timestamp       |    document.id     |document.index |document.routing|                                                            error.message                                                            |error.pipeline |error.pipeline_trace|error.processor_tag|error.processor_type|        error.type        
+------------------------+--------------------+---------------+----------------+-------------------------------------------------------------------------------------------------------------------------------------+---------------+--------------------+-------------------+--------------------+--------------------------
+2025-05-01T12:00:00.000Z|Y0vQipYB_ZAKuDfZR4sR|my-datastream  |null            |[1:45] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'|null           |null                |null               |null                |document_parsing_exception
+```
+
+:::{note}
+Because the `document.source` field is unmapped, it is absent from the {{esql}} results. 
+:::
+
+::::
+
+::::{tab-item} _search API
+```console
+GET my-datastream::failures/_search
+```
+
+An example of a search result with the failed document present:
+
+```console-result
+{
+  "took": 0,
+  "timed_out": false,
+  "_shards": {
+    "total": 1,
+    "successful": 1,
+    "skipped": 0,
+    "failed": 0
+  },
+  "hits": {
+    "total": {
+      "value": 1,
+      "relation": "eq"
+    },
+    "max_score": 1,
+    "hits": [
+      {
+        "_index": ".fs-my-datastream-2025.05.01-000002", <1>
+        "_id": "lEu8jZYB_ZAKuDfZNouU",
+        "_score": 1,
+        "_source": {
+          "@timestamp": "2025-05-01T12:00:00.000Z", <2>
+          "document": { <3>
+            "id": "Y0vQipYB_ZAKuDfZR4sR",
+            "index": "my-datastream",
+            "source": {
+              "@timestamp": "2025-05-01T00:00:00Z",
+              "id": "invalid_text"
+            }
+          },
+          "error": { <4>
+            "type": "document_parsing_exception",
+            "message": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
+            "stack_trace": """o.e.i.m.DocumentParsingException: [1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'
+	at o.e.i.m.FieldMapper.rethrowAsDocumentParsingException(FieldMapper.java:241)
+	at o.e.i.m.FieldMapper.parse(FieldMapper.java:194)
+	... 24 more
+Caused by: j.l.IllegalArgumentException: For input string: "invalid_text"
+	at o.e.x.s.AbstractXContentParser.toLong(AbstractXContentParser.java:189)
+	at o.e.x.s.AbstractXContentParser.longValue(AbstractXContentParser.java:210)
+	... 31 more
+"""
+          }
+        }
+      }
+    ]
+  }
+}
+```
+
+1. The document belongs to a failure store index on the data stream.
+2. The failure document timestamp is when the failure occurred in {{es}}.
+3. The document that was sent is captured inside the failure document. Failure documents capture the id of the document at time of failure, along with which data stream the document was being written to, and the contents of the document. The `document.source` fields are unmapped to ensure failures are always captured.
+4. The failure document captures information about the error encountered, like the type of error, the error message, and a compressed stack trace.
+::::
+
+::::{tab-item} SQL
+```console
+POST _sql?format=txt
+{
+    "query": """SELECT * FROM "my-datastream::failures" LIMIT 1"""
+}
+```
+
+An example of a search result with the failed document present:
+
+```console-result
+       @timestamp       |    document.id     |document.index |document.routing|                                                            error.message                                                            |error.pipeline |error.pipeline_trace|error.processor_tag|error.processor_type|                                                                                                                                                                                                                                                                            error.stack_trace                                                                                                                                                                                                                                                                            |        error.type        
+------------------------+--------------------+---------------+----------------+-------------------------------------------------------------------------------------------------------------------------------------+---------------+--------------------+-------------------+--------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------
+2025-05-05T20:49:10.899Z|sXk1opYBL1dfU_1htCAE|my-datastream  |null            |[1:45] failed to parse field [id] of type [long] in document with id 'sXk1opYBL1dfU_1htCAE'. Preview of field's value: 'invalid_text'|null           |null                |null               |null                |o.e.i.m.DocumentParsingException: [1:45] failed to parse field [id] of type [long] in document with id 'sXk1opYBL1dfU_1htCAE'. Preview of field's value: 'invalid_text'
+	at o.e.i.m.FieldMapper.rethrowAsDocumentParsingException(FieldMapper.java:241)
+	at o.e.i.m.FieldMapper.parse(FieldMapper.java:194)
+	... 19 more
+Caused by: j.l.IllegalArgumentException: For input string: "invalid_text"
+	at o.e.x.s.AbstractXContentParser.toLong(AbstractXContentParser.java:189)
+	at o.e.x.s.AbstractXContentParser.longValue(AbstractXContentParser.java:210)
+	... 26 more
+|document_parsing_exception
+```
+
+:::{note}
+Because the `document.source` field is unmapped, it is absent from the SQL results.
+:::
+::::
+:::::
+
+Failure documents have a uniform structure that is handled internally by {{es}}.
+
+`@timestamp`
+:   (`date`) The timestamp at which the document encountered a failure in {{es}}.
+
+`document`
+:   (`object`) The document at time of failure. If the document failed in an ingest pipeline, then the document will be the unprocessed version of the document as it arrived in the original indexing request. If the document failed due to a mapping issue, then the document will be as it was after any ingest pipelines were applied to it.
+    
+    `document.id`
+    :   (`keyword`) The id of the original document at the time of failure.
+    
+    `document.routing`
+    :   (`keyword`, optional) The routing of the original document at the time of failure if it was specified.
+    
+    `document.index`
+    :   (`keyword`) The index that the document was being written to when it failed.
+
+    `document.source`
+    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](???) on the search request.
+
+`error`
+:   (`object`) Information about the failure that prevented this document from being indexed.
+
+    `error.message`
+    :   (`match_only_text`) The error message that describes the failure.
+
+    `error.stack_trace`
+    :   (`text`) A compressed stack trace from {{es}} for the failure.
+
+    `error.type`
+    :   (`keyword`) The type classification of failure. Values are the same type returned within failed indexing API responses.
+
+    `error.pipeline`
+    :   (`keyword`, optional) If the failure occurred in an ingest pipeline, this will contain the name of the pipeline.
+
+    `error.pipeline_trace`
+    :   (`keyword`, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
+
+    `error.processor_tag`
+    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](???), the tag contents will be present here.
+
+    `error.processor_type`
+    :   (`keyword`, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. `script`, `append`, `enrich`, etc.)
+    
+
+## Manage a data stream's failure store [manage-failure-store]
+
+Failure data can accumulate in a data stream over time. To help manage this accumulation, most administrative operations that can be done on a data stream can be applied to the data stream's failure store.
+
+### Failure store rollover [manage-failure-store-rollover]
+
+A data stream treats its failure store much like a secondary set of [backing indices](???). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](???) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
+
+```console
+POST my-datastream::failures/_rollover
+```
+
+```console-result
+{
+  "acknowledged": true,
+  "shards_acknowledged": true,
+  "old_index": ".fs-my-datastream-2025.05.01-000002",
+  "new_index": ".fs-my-datastream-2025.05.01-000003",
+  "rolled_over": true,
+  "dry_run": false,
+  "lazy": false,
+  "conditions": {}
+}
+```
+
+### Failure store lifecycle [manage-failure-store-lifecycle]
+
+Failure stores have their retention managed using an internal [data stream lifecycle](???). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](???):
+
+```console
+GET _data_stream/my-datastream
+```
+
+```console-result
+{
+  "data_streams": [
+    {
+      "name": "my-datastream",
+      "timestamp_field": {
+        "name": "@timestamp"
+      },
+      "indices": [
+        {
+          "index_name": ".ds-my-datastream-2025.05.01-000001",
+          "index_uuid": "jUbUNf-8Re-Nca8vJkHnkA",
+          "managed_by": "Data stream lifecycle",
+          "prefer_ilm": true,
+          "index_mode": "standard"
+        }
+      ],
+      "generation": 2,
+      "status": "GREEN",
+      "template": "my-datastream-template",
+      "lifecycle": {
+        "enabled": true
+      },
+      "next_generation_managed_by": "Data stream lifecycle",
+      "prefer_ilm": true,
+      "hidden": false,
+      "system": false,
+      "allow_custom_routing": false,
+      "replicated": false,
+      "rollover_on_write": false,
+      "index_mode": "standard",
+      "failure_store": { <1>
+        "enabled": true,
+        "rollover_on_write": false,
+        "indices": [
+          {
+            "index_name": ".fs-my-datastream-2025.05.05-000002",
+            "index_uuid": "oYS2WsjkSKmdazWuS4RP9Q",
+            "managed_by": "Data stream lifecycle"  <2>
+          }
+        ],
+        "lifecycle": {
+          "enabled": true,
+          "effective_retention": "30d",  <3> 
+          "retention_determined_by": "default_failures_retention"  <4>
+        }
+      }
+    }
+  ]
+}
+```
+1. Information about the failure store is presented in the response under its own field.
+2. Indices are managed by data stream lifecycles by default.
+3. An effective retention period of thirty days (30d) is present by default.
+4. The retention is currently determined by the default.  
+
+:::{note}
+The default retention respects any maximum retention values. If [maximum retention](???) is configured lower than thirty days then the maximum retention will be used as the default value.
+:::
+
+You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
+
+```console
+PUT _cluster/settings
+{
+  "persistent": {
+    "data_streams.lifecycle.retention.failures_default": "15d"
+  }
+}
+```
+
+You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](???) API for existing data streams.
+
+```console
+PUT _data_stream/my-datastream/_options
+{
+    "failure_store": {
+        "enabled": true, <1>
+        "lifecycle": {
+            "data_retention": "10d" <2>
+        }
+    }
+}
+```
+1. Ensure that the failure store remains enabled.
+2. Set only this data stream's failure store retention to ten days.
+
+### Add and remove from failure store [manage-failure-store-indices]
+
+Failure stores support adding and removing indices from them using the [modify data stream](???) API.
+
+```console
+POST _data_stream/_modify
+{
+  "actions":[
+    {
+      "remove_backing_index": { <1>
+        "data_stream": "my-datastream", 
+        "index": ".fs-my-datastream-2025.05.05-000002", <2>
+        "failure_store": true <3>
+      }
+    },
+    {
+      "add_backing_index": { <4>
+        "data_stream": "my-datastream",
+        "index": "restored-failure-index", <5>
+        "failure_store": true <6>
+      }
+    }
+  ]
+}
+```
+1. Action to remove a backing index.
+2. The name of an auto-generated failure store index that should be removed.
+3. Set `failure_store` to true to have the modify API target operate on the data stream's failure store.
+4. Action to add a backing index.
+5. The name of an index that should be added to the failure store.
+6. Set `failure_store` to true to have the modify API target operate on the data stream's failure store.
+
+This API gives you fine grained control over the indices in your failure store, allowing you to manage backup and restoration operations as well as isolate failure data for later remediation.
+
+## Failure store recipes and use cases [recipes]
+
+TBD
+
+### Troubleshooting ingest pipelines effectively [recipes-ingest-troubleshoot]
+
+TBD
+
+### Alerting on failed ingestion [recipes-alerting]
+
+TBD
+
+### Data remediation [recipes-remediation]
+
+TBD
diff --git a/manage-data/toc.yml b/manage-data/toc.yml
index 9275294ab..72cafaace 100644
--- a/manage-data/toc.yml
+++ b/manage-data/toc.yml
@@ -20,6 +20,7 @@ toc:
               - file: data-store/data-streams/run-downsampling-using-data-stream-lifecycle.md
               - file: data-store/data-streams/reindex-tsds.md
           - file: data-store/data-streams/logs-data-stream.md
+          - file: data-store/data-streams/failure-store.md
       - file: data-store/mapping.md
         children:
           - file: data-store/mapping/dynamic-mapping.md

From 32d93367c191a334c72b45919324c133fc9c9eac Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Tue, 6 May 2025 02:44:15 -0400
Subject: [PATCH 02/17] Change ??? to "wip"

---
 .../data-store/data-streams/failure-store.md  | 28 ++++++++++---------
 1 file changed, 15 insertions(+), 13 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 501718fd0..beb1c87b2 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -47,7 +47,7 @@ After a matching data stream is created, its failure store will be enabled.
 
 Enabling the failure store via [index templates](../templates.md) can only affect data streams that are newly created. Existing data streams that use a template will not apply any changes to the template's `data_stream_options` after they have been created.
 
-To modify an existing data stream's options, use the [put data stream options](???) API:
+To modify an existing data stream's options, use the [put data stream options](wip) API:
 
 ```console
 PUT my-datastream-existing/_options
@@ -76,7 +76,7 @@ PUT my-datastream-existing/_options
 
 ### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
 
-If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](???) API, you can instead configure a set of patterns in the [cluster settings](???) which will enable the failure store feature by default.
+If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](wip) API, you can instead configure a set of patterns in the [cluster settings](wip) which will enable the failure store feature by default.
 
 Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
 
@@ -101,7 +101,7 @@ Once a failure store is enabled for a data stream it will begin redirecting docu
 
 Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
 
-When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](???) and [index](???) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
+When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](wip) and [index](wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
 
 Here we have a bulk operation that sends two documents. Both are writing to the `id` field which is mapped as a `long` field type. The first document will be accepted, but the second document would cause a failure because the value `invalid_text` cannot be parsed as a `long`. This second document will be redirected to the failure store: 
 
@@ -263,14 +263,14 @@ Once you have accumulated some failures, they can be searched much like a regula
 :::{warning}
 Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.
 
-Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](???) or [field level security](???).
+Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](wip) or [field level security](wip).
 
-To limit visibility on potentially sensitive data, users require the [`read_failure_store`](???) index privilege for a data stream in order to search that data stream's failure store data.
+To limit visibility on potentially sensitive data, users require the [`read_failure_store`](wip) index privilege for a data stream in order to search that data stream's failure store data.
 :::
 
 Searching a data stream's failure store can be done by making use of the existing search APIs available in {{es}}. 
 
-To indicate that the search should be performed on failure store data, use the [index component selector syntax](???) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
+To indicate that the search should be performed on failure store data, use the [index component selector syntax](wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
 
 :::::{tab-set}
 
@@ -409,7 +409,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`) The index that the document was being written to when it failed.
 
     `document.source`
-    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](???) on the search request.
+    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](wip) on the search request.
 
 `error`
 :   (`object`) Information about the failure that prevented this document from being indexed.
@@ -430,7 +430,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
 
     `error.processor_tag`
-    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](???), the tag contents will be present here.
+    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](wip), the tag contents will be present here.
 
     `error.processor_type`
     :   (`keyword`, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. `script`, `append`, `enrich`, etc.)
@@ -442,7 +442,7 @@ Failure data can accumulate in a data stream over time. To help manage this accu
 
 ### Failure store rollover [manage-failure-store-rollover]
 
-A data stream treats its failure store much like a secondary set of [backing indices](???). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](???) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
+A data stream treats its failure store much like a secondary set of [backing indices](wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
 
 ```console
 POST my-datastream::failures/_rollover
@@ -463,7 +463,7 @@ POST my-datastream::failures/_rollover
 
 ### Failure store lifecycle [manage-failure-store-lifecycle]
 
-Failure stores have their retention managed using an internal [data stream lifecycle](???). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](???):
+Failure stores have their retention managed using an internal [data stream lifecycle](wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](wip):
 
 ```console
 GET _data_stream/my-datastream
@@ -526,7 +526,7 @@ GET _data_stream/my-datastream
 4. The retention is currently determined by the default.  
 
 :::{note}
-The default retention respects any maximum retention values. If [maximum retention](???) is configured lower than thirty days then the maximum retention will be used as the default value.
+The default retention respects any maximum retention values. If [maximum retention](wip) is configured lower than thirty days then the maximum retention will be used as the default value.
 :::
 
 You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
@@ -540,7 +540,7 @@ PUT _cluster/settings
 }
 ```
 
-You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](???) API for existing data streams.
+You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](wip) API for existing data streams.
 
 ```console
 PUT _data_stream/my-datastream/_options
@@ -558,7 +558,7 @@ PUT _data_stream/my-datastream/_options
 
 ### Add and remove from failure store [manage-failure-store-indices]
 
-Failure stores support adding and removing indices from them using the [modify data stream](???) API.
+Failure stores support adding and removing indices from them using the [modify data stream](wip) API.
 
 ```console
 POST _data_stream/_modify
@@ -605,3 +605,5 @@ TBD
 ### Data remediation [recipes-remediation]
 
 TBD
+
+# WIP [wip]

From 356cd6208788533ebc70ec404fbc1866eaee0363 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Tue, 6 May 2025 10:03:05 -0400
Subject: [PATCH 03/17] Fix wip links

---
 .../data-store/data-streams/failure-store.md  | 28 ++++++++++---------
 1 file changed, 15 insertions(+), 13 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index beb1c87b2..f432859ed 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -47,7 +47,7 @@ After a matching data stream is created, its failure store will be enabled.
 
 Enabling the failure store via [index templates](../templates.md) can only affect data streams that are newly created. Existing data streams that use a template will not apply any changes to the template's `data_stream_options` after they have been created.
 
-To modify an existing data stream's options, use the [put data stream options](wip) API:
+To modify an existing data stream's options, use the [put data stream options](failure-store#wip) API:
 
 ```console
 PUT my-datastream-existing/_options
@@ -76,7 +76,7 @@ PUT my-datastream-existing/_options
 
 ### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
 
-If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](wip) API, you can instead configure a set of patterns in the [cluster settings](wip) which will enable the failure store feature by default.
+If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](failure-store#wip) API, you can instead configure a set of patterns in the [cluster settings](failure-store#wip) which will enable the failure store feature by default.
 
 Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
 
@@ -101,7 +101,7 @@ Once a failure store is enabled for a data stream it will begin redirecting docu
 
 Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
 
-When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](wip) and [index](wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
+When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](failure-store#wip) and [index](failure-store#wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
 
 Here we have a bulk operation that sends two documents. Both are writing to the `id` field which is mapped as a `long` field type. The first document will be accepted, but the second document would cause a failure because the value `invalid_text` cannot be parsed as a `long`. This second document will be redirected to the failure store: 
 
@@ -263,14 +263,14 @@ Once you have accumulated some failures, they can be searched much like a regula
 :::{warning}
 Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.
 
-Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](wip) or [field level security](wip).
+Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](failure-store#wip) or [field level security](failure-store#wip).
 
-To limit visibility on potentially sensitive data, users require the [`read_failure_store`](wip) index privilege for a data stream in order to search that data stream's failure store data.
+To limit visibility on potentially sensitive data, users require the [`read_failure_store`](failure-store#wip) index privilege for a data stream in order to search that data stream's failure store data.
 :::
 
 Searching a data stream's failure store can be done by making use of the existing search APIs available in {{es}}. 
 
-To indicate that the search should be performed on failure store data, use the [index component selector syntax](wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
+To indicate that the search should be performed on failure store data, use the [index component selector syntax](failure-store#wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
 
 :::::{tab-set}
 
@@ -409,7 +409,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`) The index that the document was being written to when it failed.
 
     `document.source`
-    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](wip) on the search request.
+    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](failure-store#wip) on the search request.
 
 `error`
 :   (`object`) Information about the failure that prevented this document from being indexed.
@@ -430,7 +430,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
 
     `error.processor_tag`
-    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](wip), the tag contents will be present here.
+    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](failure-store#wip), the tag contents will be present here.
 
     `error.processor_type`
     :   (`keyword`, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. `script`, `append`, `enrich`, etc.)
@@ -442,7 +442,7 @@ Failure data can accumulate in a data stream over time. To help manage this accu
 
 ### Failure store rollover [manage-failure-store-rollover]
 
-A data stream treats its failure store much like a secondary set of [backing indices](wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
+A data stream treats its failure store much like a secondary set of [backing indices](failure-store#wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](failure-store#wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
 
 ```console
 POST my-datastream::failures/_rollover
@@ -463,7 +463,7 @@ POST my-datastream::failures/_rollover
 
 ### Failure store lifecycle [manage-failure-store-lifecycle]
 
-Failure stores have their retention managed using an internal [data stream lifecycle](wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](wip):
+Failure stores have their retention managed using an internal [data stream lifecycle](failure-store#wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](failure-store#wip):
 
 ```console
 GET _data_stream/my-datastream
@@ -526,7 +526,7 @@ GET _data_stream/my-datastream
 4. The retention is currently determined by the default.  
 
 :::{note}
-The default retention respects any maximum retention values. If [maximum retention](wip) is configured lower than thirty days then the maximum retention will be used as the default value.
+The default retention respects any maximum retention values. If [maximum retention](failure-store#wip) is configured lower than thirty days then the maximum retention will be used as the default value.
 :::
 
 You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
@@ -540,7 +540,7 @@ PUT _cluster/settings
 }
 ```
 
-You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](wip) API for existing data streams.
+You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](failure-store#wip) API for existing data streams.
 
 ```console
 PUT _data_stream/my-datastream/_options
@@ -558,7 +558,7 @@ PUT _data_stream/my-datastream/_options
 
 ### Add and remove from failure store [manage-failure-store-indices]
 
-Failure stores support adding and removing indices from them using the [modify data stream](wip) API.
+Failure stores support adding and removing indices from them using the [modify data stream](failure-store#wip) API.
 
 ```console
 POST _data_stream/_modify
@@ -607,3 +607,5 @@ TBD
 TBD
 
 # WIP [wip]
+
+Placeholder link
\ No newline at end of file

From 6c18018dbf39826ca75d64e110d333fb33c7373d Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Tue, 6 May 2025 13:35:10 -0400
Subject: [PATCH 04/17] fix wip please?

---
 .../data-store/data-streams/failure-store.md  | 26 +++++++++----------
 1 file changed, 13 insertions(+), 13 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index f432859ed..d9e9c99d4 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -47,7 +47,7 @@ After a matching data stream is created, its failure store will be enabled.
 
 Enabling the failure store via [index templates](../templates.md) can only affect data streams that are newly created. Existing data streams that use a template will not apply any changes to the template's `data_stream_options` after they have been created.
 
-To modify an existing data stream's options, use the [put data stream options](failure-store#wip) API:
+To modify an existing data stream's options, use the [put data stream options](#wip) API:
 
 ```console
 PUT my-datastream-existing/_options
@@ -76,7 +76,7 @@ PUT my-datastream-existing/_options
 
 ### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
 
-If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](failure-store#wip) API, you can instead configure a set of patterns in the [cluster settings](failure-store#wip) which will enable the failure store feature by default.
+If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](#wip) API, you can instead configure a set of patterns in the [cluster settings](#wip) which will enable the failure store feature by default.
 
 Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
 
@@ -101,7 +101,7 @@ Once a failure store is enabled for a data stream it will begin redirecting docu
 
 Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
 
-When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](failure-store#wip) and [index](failure-store#wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
+When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](#wip) and [index](#wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
 
 Here we have a bulk operation that sends two documents. Both are writing to the `id` field which is mapped as a `long` field type. The first document will be accepted, but the second document would cause a failure because the value `invalid_text` cannot be parsed as a `long`. This second document will be redirected to the failure store: 
 
@@ -263,14 +263,14 @@ Once you have accumulated some failures, they can be searched much like a regula
 :::{warning}
 Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.
 
-Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](failure-store#wip) or [field level security](failure-store#wip).
+Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](#wip) or [field level security](#wip).
 
-To limit visibility on potentially sensitive data, users require the [`read_failure_store`](failure-store#wip) index privilege for a data stream in order to search that data stream's failure store data.
+To limit visibility on potentially sensitive data, users require the [`read_failure_store`](#wip) index privilege for a data stream in order to search that data stream's failure store data.
 :::
 
 Searching a data stream's failure store can be done by making use of the existing search APIs available in {{es}}. 
 
-To indicate that the search should be performed on failure store data, use the [index component selector syntax](failure-store#wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
+To indicate that the search should be performed on failure store data, use the [index component selector syntax](#wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
 
 :::::{tab-set}
 
@@ -409,7 +409,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`) The index that the document was being written to when it failed.
 
     `document.source`
-    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](failure-store#wip) on the search request.
+    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](#wip) on the search request.
 
 `error`
 :   (`object`) Information about the failure that prevented this document from being indexed.
@@ -430,7 +430,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
 
     `error.processor_tag`
-    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](failure-store#wip), the tag contents will be present here.
+    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](#wip), the tag contents will be present here.
 
     `error.processor_type`
     :   (`keyword`, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. `script`, `append`, `enrich`, etc.)
@@ -442,7 +442,7 @@ Failure data can accumulate in a data stream over time. To help manage this accu
 
 ### Failure store rollover [manage-failure-store-rollover]
 
-A data stream treats its failure store much like a secondary set of [backing indices](failure-store#wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](failure-store#wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
+A data stream treats its failure store much like a secondary set of [backing indices](#wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](#wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
 
 ```console
 POST my-datastream::failures/_rollover
@@ -463,7 +463,7 @@ POST my-datastream::failures/_rollover
 
 ### Failure store lifecycle [manage-failure-store-lifecycle]
 
-Failure stores have their retention managed using an internal [data stream lifecycle](failure-store#wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](failure-store#wip):
+Failure stores have their retention managed using an internal [data stream lifecycle](#wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](#wip):
 
 ```console
 GET _data_stream/my-datastream
@@ -526,7 +526,7 @@ GET _data_stream/my-datastream
 4. The retention is currently determined by the default.  
 
 :::{note}
-The default retention respects any maximum retention values. If [maximum retention](failure-store#wip) is configured lower than thirty days then the maximum retention will be used as the default value.
+The default retention respects any maximum retention values. If [maximum retention](#wip) is configured lower than thirty days then the maximum retention will be used as the default value.
 :::
 
 You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
@@ -540,7 +540,7 @@ PUT _cluster/settings
 }
 ```
 
-You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](failure-store#wip) API for existing data streams.
+You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](#wip) API for existing data streams.
 
 ```console
 PUT _data_stream/my-datastream/_options
@@ -558,7 +558,7 @@ PUT _data_stream/my-datastream/_options
 
 ### Add and remove from failure store [manage-failure-store-indices]
 
-Failure stores support adding and removing indices from them using the [modify data stream](failure-store#wip) API.
+Failure stores support adding and removing indices from them using the [modify data stream](#wip) API.
 
 ```console
 POST _data_stream/_modify

From 075abe26a898cbbde82092590877bf946e245eba Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Tue, 6 May 2025 15:44:45 -0400
Subject: [PATCH 05/17] fix

---
 .../data-store/data-streams/failure-store.md  | 28 +++++++++----------
 1 file changed, 14 insertions(+), 14 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index d9e9c99d4..beb26bcf0 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -47,7 +47,7 @@ After a matching data stream is created, its failure store will be enabled.
 
 Enabling the failure store via [index templates](../templates.md) can only affect data streams that are newly created. Existing data streams that use a template will not apply any changes to the template's `data_stream_options` after they have been created.
 
-To modify an existing data stream's options, use the [put data stream options](#wip) API:
+To modify an existing data stream's options, use the [put data stream options](./failure-store.md) API:
 
 ```console
 PUT my-datastream-existing/_options
@@ -76,7 +76,7 @@ PUT my-datastream-existing/_options
 
 ### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
 
-If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](#wip) API, you can instead configure a set of patterns in the [cluster settings](#wip) which will enable the failure store feature by default.
+If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](./failure-store.md) API, you can instead configure a set of patterns in the [cluster settings](./failure-store.md) which will enable the failure store feature by default.
 
 Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
 
@@ -101,7 +101,7 @@ Once a failure store is enabled for a data stream it will begin redirecting docu
 
 Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
 
-When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](#wip) and [index](#wip) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
+When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](./failure-store.md) and [index](./failure-store.md) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
 
 Here we have a bulk operation that sends two documents. Both are writing to the `id` field which is mapped as a `long` field type. The first document will be accepted, but the second document would cause a failure because the value `invalid_text` cannot be parsed as a `long`. This second document will be redirected to the failure store: 
 
@@ -263,14 +263,14 @@ Once you have accumulated some failures, they can be searched much like a regula
 :::{warning}
 Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.
 
-Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](#wip) or [field level security](#wip).
+Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus are not supported by [document level security](./failure-store.md) or [field level security](./failure-store.md).
 
-To limit visibility on potentially sensitive data, users require the [`read_failure_store`](#wip) index privilege for a data stream in order to search that data stream's failure store data.
+To limit visibility on potentially sensitive data, users require the [`read_failure_store`](./failure-store.md) index privilege for a data stream in order to search that data stream's failure store data.
 :::
 
 Searching a data stream's failure store can be done by making use of the existing search APIs available in {{es}}. 
 
-To indicate that the search should be performed on failure store data, use the [index component selector syntax](#wip) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
+To indicate that the search should be performed on failure store data, use the [index component selector syntax](./failure-store.md) to indicate which part of the data stream to target in the search operation. Appending the `::failures` suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.
 
 :::::{tab-set}
 
@@ -409,7 +409,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`) The index that the document was being written to when it failed.
 
     `document.source`
-    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](#wip) on the search request.
+    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](./failure-store.md) on the search request.
 
 `error`
 :   (`object`) Information about the failure that prevented this document from being indexed.
@@ -430,7 +430,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
 
     `error.processor_tag`
-    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](#wip), the tag contents will be present here.
+    :   (`keyword`, optional) If the failure occurred in an ingest processor that is annotated with a [tag](./failure-store.md), the tag contents will be present here.
 
     `error.processor_type`
     :   (`keyword`, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. `script`, `append`, `enrich`, etc.)
@@ -442,7 +442,7 @@ Failure data can accumulate in a data stream over time. To help manage this accu
 
 ### Failure store rollover [manage-failure-store-rollover]
 
-A data stream treats its failure store much like a secondary set of [backing indices](#wip). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](#wip) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
+A data stream treats its failure store much like a secondary set of [backing indices](./failure-store.md). Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the [rollover](./failure-store.md) API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.
 
 ```console
 POST my-datastream::failures/_rollover
@@ -463,7 +463,7 @@ POST my-datastream::failures/_rollover
 
 ### Failure store lifecycle [manage-failure-store-lifecycle]
 
-Failure stores have their retention managed using an internal [data stream lifecycle](#wip). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](#wip):
+Failure stores have their retention managed using an internal [data stream lifecycle](./failure-store.md). A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the [get data stream API](./failure-store.md):
 
 ```console
 GET _data_stream/my-datastream
@@ -526,7 +526,7 @@ GET _data_stream/my-datastream
 4. The retention is currently determined by the default.  
 
 :::{note}
-The default retention respects any maximum retention values. If [maximum retention](#wip) is configured lower than thirty days then the maximum retention will be used as the default value.
+The default retention respects any maximum retention values. If [maximum retention](./failure-store.md) is configured lower than thirty days then the maximum retention will be used as the default value.
 :::
 
 You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
@@ -540,7 +540,7 @@ PUT _cluster/settings
 }
 ```
 
-You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](#wip) API for existing data streams.
+You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the [put data stream options](./failure-store.md) API for existing data streams.
 
 ```console
 PUT _data_stream/my-datastream/_options
@@ -558,12 +558,12 @@ PUT _data_stream/my-datastream/_options
 
 ### Add and remove from failure store [manage-failure-store-indices]
 
-Failure stores support adding and removing indices from them using the [modify data stream](#wip) API.
+Failure stores support adding and removing indices from them using the [modify data stream](./failure-store.md) API.
 
 ```console
 POST _data_stream/_modify
 {
-  "actions":[
+  "actions":[   
     {
       "remove_backing_index": { <1>
         "data_stream": "my-datastream", 

From eac400518baacf1454fce36baccc859bc504feef Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 15:43:39 -0400
Subject: [PATCH 06/17] Update
 manage-data/data-store/data-streams/failure-store.md

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index beb26bcf0..92750a1c9 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -6,7 +6,7 @@ applies_to:
 
 # Failure store [failure-store]
 
-Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured and persisted to be analysed later, returning a successful response to the client in the meantime.
+Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later, returning a successful response to the client in the meantime.
 
 ## Set up a data stream failure store [set-up-failure-store]
 

From dfbefc1ff9d349e39fc9537ab8f1b6eec883faaa Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 15:49:30 -0400
Subject: [PATCH 07/17] clarify that we do not redirect backpressure or version
 conflicts

---
 manage-data/data-store/data-streams/failure-store.md | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 92750a1c9..17cefdf2e 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -6,7 +6,7 @@ applies_to:
 
 # Failure store [failure-store]
 
-Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later, returning a successful response to the client in the meantime.
+Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later, returning a successful response to the client in the meantime. Failure stores do not capture failures caused by backpressure or document version conflicts. These failures are always returned as-is since they warrant specific action by the client.
 
 ## Set up a data stream failure store [set-up-failure-store]
 
@@ -588,7 +588,7 @@ POST _data_stream/_modify
 5. The name of an index that should be added to the failure store.
 6. Set `failure_store` to true to have the modify API target operate on the data stream's failure store.
 
-This API gives you fine grained control over the indices in your failure store, allowing you to manage backup and restoration operations as well as isolate failure data for later remediation.
+This API gives you fine-grained control over the indices in your failure store, allowing you to manage backup and restoration operations as well as isolate failure data for later remediation.
 
 ## Failure store recipes and use cases [recipes]
 

From 20193676da9877a81e266699e82cb38bf6dd7a0b Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 15:52:19 -0400
Subject: [PATCH 08/17] Update
 manage-data/data-store/data-streams/failure-store.md

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 17cefdf2e..fe989e77f 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -17,7 +17,7 @@ Each data stream has its own failure store that can be enabled to accept failure
 You can specify on a data stream's template if it should enable the failure store when it is first created. The `data_stream_options` field in a [template](../templates.md) contains the settings required to enable a data stream's failure store.
 
 :::{note}
-Unlike the `settings` and `mappings` fields on an [index template](../templates.md) which are repeatedly applied to new data stream write indices over time, the `data_stream_options` section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put data stream options API.
+Unlike the `settings` and `mappings` fields on an [index template](../templates.md) which are repeatedly applied to new data stream write indices on rollover, the `data_stream_options` section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put data stream options API.
 :::
 
 To enable the failure store on a new data stream, enable it in the `data_stream_options` of the template:

From 2847fe94c8bb03b2f771a1bcaf6ea161623c3dc0 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 15:52:29 -0400
Subject: [PATCH 09/17] Update
 manage-data/data-store/data-streams/failure-store.md

Co-authored-by: Lee Hinman <dakrone@users.noreply.github.com>
---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index fe989e77f..2c6d274a9 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -61,7 +61,7 @@ PUT my-datastream-existing/_options
 1. The failure store option will now be enabled.
 
 
-The failure store redirection can be suspended using this API as well. When the failure store is disabled, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by deletion or by retention.
+The failure store redirection can be suspended using this API as well. When the failure store is disabled, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by manual deletion or by retention.
 
 ```console
 PUT my-datastream-existing/_options

From 840a8bef9d7495e55cdbd6229e1142f8baf4bd50 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 15:57:46 -0400
Subject: [PATCH 10/17] suspended -> disabled

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 2c6d274a9..6d9ff1567 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -61,7 +61,7 @@ PUT my-datastream-existing/_options
 1. The failure store option will now be enabled.
 
 
-The failure store redirection can be suspended using this API as well. When the failure store is disabled, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by manual deletion or by retention.
+The failure store redirection can be disabled using this API as well. When the failure store is deactivated, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by manual deletion or by retention.
 
 ```console
 PUT my-datastream-existing/_options

From 1591e7bbbf8f132dff0995b91b729006d3545885 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 16:27:52 -0400
Subject: [PATCH 11/17] reword document.source docs

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 6d9ff1567..b7e091a16 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -409,7 +409,7 @@ Failure documents have a uniform structure that is handled internally by {{es}}.
     :   (`keyword`) The index that the document was being written to when it failed.
 
     `document.source`
-    :   (unmapped object) The body of the document. This field is unmapped and unindexed to ensure failures are indexed reliably. If you need to include fields from the document source in your queries, use [runtime fields](./failure-store.md) on the search request.
+    :   (unmapped object) The body of the original document. This field is unmapped and only present in the failure document's source. This prevents mapping conflicts in the failure store when redirecting failed documents. If you need to include fields from the original document's source in your queries, use [runtime fields](./failure-store.md) on the search request.
 
 `error`
 :   (`object`) Information about the failure that prevented this document from being indexed.

From cec305f90adc5f0426147e91cadcadf35b6140b1 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 16:29:37 -0400
Subject: [PATCH 12/17] clarify default retention application

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index b7e091a16..d6e9dcca3 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -529,7 +529,7 @@ GET _data_stream/my-datastream
 The default retention respects any maximum retention values. If [maximum retention](./failure-store.md) is configured lower than thirty days then the maximum retention will be used as the default value.
 :::
 
-You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. Data streams that have no retention configured on their failure stores will use this value to determine their retention period.
+You can update the default retention period for failure stores in your deployment by updating the `data_streams.lifecycle.retention.failures_default` cluster setting. New and existing data streams that have no retention configured on their failure stores will use this value to determine their retention period.
 
 ```console
 PUT _cluster/settings

From f1cb727ab544fe408ed0bd25b18debed997f3ea4 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 16:32:50 -0400
Subject: [PATCH 13/17] Add placeholder link for put data streams api

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index d6e9dcca3..9c9413905 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -17,7 +17,7 @@ Each data stream has its own failure store that can be enabled to accept failure
 You can specify on a data stream's template if it should enable the failure store when it is first created. The `data_stream_options` field in a [template](../templates.md) contains the settings required to enable a data stream's failure store.
 
 :::{note}
-Unlike the `settings` and `mappings` fields on an [index template](../templates.md) which are repeatedly applied to new data stream write indices on rollover, the `data_stream_options` section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put data stream options API.
+Unlike the `settings` and `mappings` fields on an [index template](../templates.md) which are repeatedly applied to new data stream write indices on rollover, the `data_stream_options` section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put [data stream options API](./failure-store.md).
 :::
 
 To enable the failure store on a new data stream, enable it in the `data_stream_options` of the template:

From 28287fe40db256eb68c0f7dfa57162fa5069bd53 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 17:10:31 -0400
Subject: [PATCH 14/17] Mention flags in the intro

---
 manage-data/data-store/data-streams/failure-store.md | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 9c9413905..761c0de09 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -6,7 +6,9 @@ applies_to:
 
 # Failure store [failure-store]
 
-Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later, returning a successful response to the client in the meantime. Failure stores do not capture failures caused by backpressure or document version conflicts. These failures are always returned as-is since they warrant specific action by the client.
+Failure stores are a secondary set of indices inside a data stream dedicated to storing failed documents. Failed documents are any documents that cause ingest pipeline exceptions or have a structure that conflicts with a data stream's mappings. These failures normally cause the indexing operation to fail, returning the error message in the response. 
+
+When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later. Clients receive a successful response with a flag indicating the failure was redirected. Failure stores do not capture failures caused by backpressure or document version conflicts. These failures are always returned as-is since they warrant specific action by the client.
 
 ## Set up a data stream failure store [set-up-failure-store]
 

From 08b2ac31923ebc65010a7a59e17e881b846ae867 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Thu, 8 May 2025 17:15:12 -0400
Subject: [PATCH 15/17] Note lazy creation

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 761c0de09..c65bdd5fd 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -101,7 +101,7 @@ The failure store is meant to ease the burden of detecting and handling failures
 
 Once a failure store is enabled for a data stream it will begin redirecting documents that fail due to common ingestion problems instead of returning errors in write operations. Clients are notified in a non-intrusive way when a document is redirected to the failure store.
 
-Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy.
+Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These failure indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, they can be rolled over, and are automatically cleaned up over time subject to a lifecycle policy. Failure indices are lazily created the first time they are needed to store a failed document.
 
 When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the `failure_store` field which describes how {{es}} responded to that problem. The `failure_store` field is present on both the [bulk](./failure-store.md) and [index](./failure-store.md) API responses when applicable. Clients can use this information to augment their behavior based on the response from {{es}}.
 

From e978f6e9974fa9a565a458b5abe5013a9c6e15c5 Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Fri, 9 May 2025 01:42:24 -0400
Subject: [PATCH 16/17] edit configuration section for more clarity

---
 .../data-store/data-streams/failure-store.md  | 30 +++++++++++++++----
 1 file changed, 24 insertions(+), 6 deletions(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index c65bdd5fd..1534bd666 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -52,7 +52,7 @@ Enabling the failure store via [index templates](../templates.md) can only affec
 To modify an existing data stream's options, use the [put data stream options](./failure-store.md) API:
 
 ```console
-PUT my-datastream-existing/_options
+PUT _data_stream/my-datastream-existing/_options
 {
   "failure_store": {
     "enabled": true <1>
@@ -66,7 +66,7 @@ PUT my-datastream-existing/_options
 The failure store redirection can be disabled using this API as well. When the failure store is deactivated, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by manual deletion or by retention.
 
 ```console
-PUT my-datastream-existing/_options
+PUT _data_stream/my-datastream-existing/_options
 {
   "failure_store": {
     "enabled": false <1>
@@ -78,9 +78,7 @@ PUT my-datastream-existing/_options
 
 ### Enable failure store via cluster setting [set-up-failure-store-cluster-setting]
 
-If you have a large number of existing data streams you may want an easier way to control if failures should be redirected. Instead of enabling the failure store using the [put data stream options](./failure-store.md) API, you can instead configure a set of patterns in the [cluster settings](./failure-store.md) which will enable the failure store feature by default.
-
-Configure a list of patterns using the `data_streams.failure_store.enabled` dynamic cluster setting. If a data stream matches a pattern in this setting and does not have the failure store explicitly disabled in its options, then the failure store will default to being enabled for that matching data stream.
+If you have a large number of existing data streams you may want to enable their failure stores in one place. Instead of updating each of their options individually, set `data_streams.failure_store.enabled` to a list of index patterns in the [cluster settings](./failure-store.md). Any data streams that match one of these patterns will operate with their failure store enabled.
 
 ```console
 PUT _cluster/settings
@@ -90,9 +88,29 @@ PUT _cluster/settings
   }
 }
 ```
-
 1. Indices that match `my-datastream-*` or `logs-*` will redirect failures to the failure store unless explicitly disabled.
 
+Matching data streams will ignore this configuration if the failure store is explicitly enabled or disabled in their [data stream options](./failure-store.md).
+
+```console
+PUT _cluster/settings
+{
+  "persistent" : {
+    "data_streams.failure_store.enabled" : [ "my-datastream-*", "logs-*" ] <1>
+  }
+}
+```
+```console
+PUT _data_stream/my-datastream-1/_options
+{
+  "failure_store": {
+    "enabled": false <2>
+  }
+}
+```
+1. Enabling the failure stores for `my-datastream-*` and `logs-*`
+2. The failure store for `my-datastream-1` is disabled even though it matches `my-datastream-*`.
+
 ## Using a failure store [use-failure-store]
 
 The failure store is meant to ease the burden of detecting and handling failures when ingesting data to {{es}}. Clients are less likely to encounter unrecoverable failures when writing documents, and developers are more easily able to troubleshoot faulty pipelines and mappings.

From 2db178a1e0dc2e3bab149da96733c8b89a6865ef Mon Sep 17 00:00:00 2001
From: James Baiera <james.baiera@gmail.com>
Date: Fri, 9 May 2025 01:48:01 -0400
Subject: [PATCH 17/17] mention override directly

---
 manage-data/data-store/data-streams/failure-store.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/manage-data/data-store/data-streams/failure-store.md b/manage-data/data-store/data-streams/failure-store.md
index 1534bd666..fa2fb0b35 100644
--- a/manage-data/data-store/data-streams/failure-store.md
+++ b/manage-data/data-store/data-streams/failure-store.md
@@ -109,7 +109,7 @@ PUT _data_stream/my-datastream-1/_options
 }
 ```
 1. Enabling the failure stores for `my-datastream-*` and `logs-*`
-2. The failure store for `my-datastream-1` is disabled even though it matches `my-datastream-*`.
+2. The failure store for `my-datastream-1` is disabled even though it matches `my-datastream-*`. The data stream options override the cluster setting.
 
 ## Using a failure store [use-failure-store]