Skip to content

SMA [26] Data migration strategy evals#45

Open
seanoc5 wants to merge 3 commits into
mainfrom
feature/sma-26-data-migration-eval
Open

SMA [26] Data migration strategy evals#45
seanoc5 wants to merge 3 commits into
mainfrom
feature/sma-26-data-migration-eval

Conversation

@seanoc5
Copy link
Copy Markdown
Collaborator

@seanoc5 seanoc5 commented Apr 27, 2026

Summary

  • 3 eval tests for data migration strategy (SolrReader/RFS recommendation, no-backup fallback, plain-language)
  • Clarifies RFS provenance in 04-architecture.md (originally ES→OS; SolrReader extends for Solr)
  • LLM rubric assertions only (no keyword checks)

Test plan

  • tests/scripts/run_evals.sh tests/evals/eval-blue-sky.yaml --filter-pattern "^datamig-01" passes
  • grep -c 'contains-any' tests/evals/eval-blue-sky.yaml returns 0
  • YAML validates: python3 -c "import yaml; yaml.safe_load(open('tests/evals/eval-blue-sky.yaml'))"

🤖 Generated with Claude Code

seanoc5 and others added 3 commits April 19, 2026 16:38
Update 04-architecture.md to recommend SolrReader/RFS as the primary
data migration approach when Solr backups are available. Add two eval
tests (datamig-*) verifying the advisor recommends SolrReader rather
than generic Bulk API advice.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…rtions with LLM rubrics, add less-technical test

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
- Add response_format: json_object to judge config (prevents flaky
  JSON parse errors, per promptfoo#2084)
- Document max_tokens: 512 decision with guidance for increasing
- Add trailing newlines

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants