Skip to content

Commit b2122dd

Browse files
ci: replace kimi-k2-thinking with kimi-k2.6 in integration test defaults (#3102)
Co-authored-by: openhands <openhands@all-hands.dev>
1 parent 12ec83f commit b2122dd

2 files changed

Lines changed: 2 additions & 2 deletions

File tree

.github/workflows/integration-runner.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ on:
5050
env:
5151
N_PROCESSES: 4 # Global configuration for number of parallel processes for evaluation
5252
# Default models for scheduled/label-triggered runs (subset of models from resolve_model_config.py)
53-
DEFAULT_MODEL_IDS: claude-sonnet-4-6,deepseek-v3.2-reasoner,kimi-k2-thinking,gemini-3.1-pro
53+
DEFAULT_MODEL_IDS: claude-sonnet-4-6,deepseek-v4-flash,kimi-k2.6,gemini-3.1-pro
5454

5555
jobs:
5656
setup-matrix:

tests/integration/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@ Defined in `.github/workflows/integration-runner.yml`, this workflow runs integr
7272
2. **Manual Trigger**: Via workflow dispatch with a required reason
7373
3. **Scheduled Runs**: Daily at 10:30 PM UTC (cron: `30 22 * * *`)
7474

75-
**Test Coverage:** Runs across 6 LLM models (Claude Sonnet 4.5, GPT-5.1 Codex Max, Deepseek, Kimi K2, Gemini 3.1 Pro, Devstral 2512)
75+
**Test Coverage:** Runs across 4 LLM models (Claude Sonnet 4.6, DeepSeek V4 Flash, Kimi K2.6, Gemini 3.1 Pro)
7676

7777
### Condenser Tests Workflow
7878

0 commit comments

Comments
 (0)