[FEATURE] Add OpenSearchProvider for evaluating traces stored in OpenSearch

## Problem Statement

There is currently no way to run strands-evals against agent traces stored in OpenSearch. This affects users of [observability-stack](https://github.com/opensearch-project/observability-stack), [agent-health](https://github.com/opensearch-project/agent-health), Amazon OpenSearch Service, or anyone else using OpenSearch as their storage backend for OTel GenAI agent traces.

## Proposed Solution

Add `OpenSearchProvider(TraceProvider)` and `OpenSearchSessionMapper(SessionMapper)`, same pattern as [CloudWatchProvider](https://github.com/strands-agents/evals/blob/main/src/strands_evals/providers/cloudwatch_provider.py) and [LangfuseProvider](https://github.com/strands-agents/evals/blob/main/src/strands_evals/providers/langfuse_provider.py).

**OpenSearchProvider**
- Wraps [`OpenSearchTraceRetriever`](https://github.com/opensearch-project/genai-observability-sdk-py) from `opensearch-genai-observability-sdk-py` for querying and auth (basic, SigV4, none)
- Implements `get_evaluation_data(session_id)` returning `TaskOutput`
- Queries by conversation ID or trace ID (retriever handles both)

**OpenSearchSessionMapper**
- Converts genai-sdk `SpanRecord` objects to strands-evals `Session`/`Trace`/`Span` types
- Maps `invoke_agent` to `AgentInvocationSpan`, `execute_tool` to `ToolExecutionSpan`, `chat` to `InferenceSpan`
- Scopes tool attribution to parent agent spans for correct multi-agent behavior

**Dependency**: `opensearch-genai-observability-sdk-py[opensearch]>=0.2.7` as optional extra (`pip install strands-agents-evals[opensearch]`)

## Use Case

### Local / development (basic auth)

```python
from strands_evals.providers import OpenSearchProvider
from strands_evals.evaluators import HelpfulnessEvaluator

provider = OpenSearchProvider(
    host="https://localhost:9200",
    auth=("admin", "password"),
    verify_certs=False,
)

task_output = provider.get_evaluation_data(session_id="my-session")
evaluator = HelpfulnessEvaluator(model="us.anthropic.claude-sonnet-4-20250514-v1:0")
results = evaluator.evaluate(task_output)
```

### Amazon OpenSearch Service (SigV4)

```python
from opensearchpy import RequestsAWSV4SignerAuth
import boto3

credentials = boto3.Session().get_credentials()
auth = RequestsAWSV4SignerAuth(credentials, "us-east-1", "es")
provider = OpenSearchProvider(
    host="https://my-domain.us-east-1.es.amazonaws.com",
    auth=auth,
)
```

## Alternatives Considered

Could use `opensearch-py` directly instead of wrapping the genai-sdk (similar to how CloudWatchProvider uses boto3). The genai-sdk was chosen because it already handles Data Prepper's de-dotted field mappings and OTel message parsing, which is non-trivial to reimplement, plus auth handling (basic, SigV4 signing for AWS managed OpenSearch). Open to the direct approach if maintainers prefer fewer transitive dependencies.

## Prior Art

- [CloudWatchProvider](https://github.com/strands-agents/evals/pull/147)
- [LangfuseProvider](https://github.com/strands-agents/evals/pull/144)
- [TraceProvider interface](https://github.com/strands-agents/evals/pull/140) ([#97](https://github.com/strands-agents/evals/issues/97))

## Implementation

PR: #192


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add OpenSearchProvider for evaluating traces stored in OpenSearch #191

Problem Statement

Proposed Solution

Use Case

Local / development (basic auth)

Amazon OpenSearch Service (SigV4)

Alternatives Considered

Prior Art

Implementation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[FEATURE] Add OpenSearchProvider for evaluating traces stored in OpenSearch #191

Description

Problem Statement

Proposed Solution

Use Case

Local / development (basic auth)

Amazon OpenSearch Service (SigV4)

Alternatives Considered

Prior Art

Implementation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions