Add counterfactual dataset #2327
CI.yml
on: pull_request
select-category
18s
lint-and-test
28s
Matrix: mock-evaluation
summarize-results
/
Results
18s
Annotations
1 error and 4 warnings
|
lint-and-test
Process completed with exit code 1.
|
|
lint-and-test
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/cache@v4. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
|
|
bcbench.results.base
Result for microsoftInternal__NAV-192565__cf-1 missing metrics: execution_time, llm_duration, turn_count, prompt_tokens, completion_tokens, tool_usage
|
|
bcbench.results.base
Result for microsoftInternal__NAV-213683__cf-1 missing metrics: prompt_tokens, completion_tokens
|
|
bcbench.results.base
Result for microsoftInternal__NAV-214926__cf-1 missing metrics: turn_count, completion_tokens
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
evaluation-summary
Expired
|
564 Bytes |
sha256:34687b780d8095fc59b28ce5dad13c458f978a4c74abde9e3f488b650b36acb6
|
|
|
microsoftInternal__NAV-192565__cf-1
Expired
|
471 Bytes |
sha256:8f32339564f7e4c5cc2eab3c07b5b85e8b653757c8fa0638b4d788396f0c7fdb
|
|
|
microsoftInternal__NAV-210528__cf-1
Expired
|
574 Bytes |
sha256:1ee75c4560f7a807e1b5f71b4ac7689fd7e9918e43f1aa82bdfd05af2372a14e
|
|
|
microsoftInternal__NAV-213683__cf-1
Expired
|
548 Bytes |
sha256:4adf48809c3ff3b00afaa0fac782040209ebe3879968d6187f0d5cf286469082
|
|
|
microsoftInternal__NAV-214926__cf-1
Expired
|
563 Bytes |
sha256:99ef36d0baf29a46dcf6d97e1282ac0fb19a52a324cbc8ff1fc775c3a4ebd480
|
|