Overview
PR #344 introduced the flagd-api-testkit with a dedicated evaluator test suite in evaluator/gherkin/.
This issue tracks the migration of all SDKs to use this new testkit and eliminates duplicated evaluation
tests currently scattered across individual SDK repositories.
Current State
Goals
- Migrate all SDKs to use the centralized evaluator testkit from flagd-testbed
- Remove duplicate evaluation test definitions from each SDK
- Keep only SDK-specific integration/provider tests in each repository
- Establish clear boundaries: evaluation tests (testkit) vs. integration tests (SDK repo)
Required Changes Per SDK
For each SDK (Java, Go, JS, .NET, Python, etc.):
Example Structure
SDK Repository
├── tests/integration/ # Only provider/connection/config tests
│ └── gherkin/
├── flagd-testbed/ # Git submodule
│ └── evaluator/gherkin/ # Shared evaluator tests (via testkit)
Benefits
- ✅ Single source of truth for evaluator tests
- ✅ Faster CI/CD (evaluator tests run without Docker where possible)
- ✅ Reduced maintenance burden
- ✅ Consistent evaluation behavior across all SDKs
Out of Scope (for this issue)
- Creating new features or test scenarios
- Changes to the evaluator testkit itself (separate issues/PRs)
Overview
PR #344 introduced the
flagd-api-testkitwith a dedicated evaluator test suite inevaluator/gherkin/.This issue tracks the migration of all SDKs to use this new testkit and eliminates duplicated evaluation
tests currently scattered across individual SDK repositories.
Current State
Goals
Required Changes Per SDK
For each SDK (Java, Go, JS, .NET, Python, etc.):
Example Structure
Benefits
Out of Scope (for this issue)