Commit 7804055
[Inference] Evaluations framework (elastic#227453)
introduces `@kbn/evals` to enable offline evaluation suites for
LLM-based workflows in Kibana, using:
- `@kbn/scout` and Playwright for running the evaluations
- `@kbn/inference-plugin` for handling LLM interactions
- [Phoenix](https://github.com/Arize-ai/phoenix) for storing evaluations
See:
https://github.com/dgieselaar/kibana/blob/evaluation-runner/x-pack/platform/packages/shared/kbn-evals/README.md
Other changes in this PR:
- Implement support for `exports` in package.json in
`@kbn/import-resolver` (this causes some eslint-disables to no longer be
necessary)
- Graceful shutdown for OpenTelemetry span processors
- Add `scripts/playwright.js` script which sets up babel transforms,
tracing etc
Notes:
- used o3 to implement support for `exports` (and then iterated on it)
- used o3 to generate a
[README](https://github.com/dgieselaar/kibana/blob/evaluation-runner/x-pack/platform/packages/shared/kbn-evals/README.md)
(and made some minor changes)
Dependencies:
- Added @arizeai/phoenix-client to interact with Phoenix's API. The
alternative is using REST/GraphQL directly but there's a lot of
client-side logic in the SDK.
---------
Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>
Co-authored-by: Tiago Costa <tiago.costa@elastic.co>
Co-authored-by: Tiago Costa <tiagoffcc@hotmail.com>
Co-authored-by: Viduni Wickramarachchi <viduni.ushanka@gmail.com>1 parent 8ea4ace commit 7804055
82 files changed
Lines changed: 3233 additions & 87 deletions
File tree
- .github
- scripts
- src
- cli
- core/packages/usage-data/server-internal/src
- dev/license_checker
- platform/packages
- private
- kbn-import-resolver/src
- __fixtures__/node_modules/exports-pkg
- dist
- sub
- integration_tests
- kbn-repo-packages
- modern
- setup_node_env
- x-pack
- platform
- plugins
- private
- discover_enhanced/test/scout/ui
- painless_lab/test/scout/ui
- solutions/observability
- packages/kbn-evals-suite-obs-ai-assistant
- evals/esql
- src
- plugins
- apm/ftr_e2e
- observability_onboarding
- e2e/playwright
- test/scout/ui
- observability/test/scout/ui
- slo/test/scout/ui
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
795 | 795 | | |
796 | 796 | | |
797 | 797 | | |
| 798 | + | |
798 | 799 | | |
799 | 800 | | |
800 | 801 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
869 | 869 | | |
870 | 870 | | |
871 | 871 | | |
| 872 | + | |
872 | 873 | | |
873 | 874 | | |
874 | 875 | | |
| |||
1051 | 1052 | | |
1052 | 1053 | | |
1053 | 1054 | | |
| 1055 | + | |
1054 | 1056 | | |
1055 | 1057 | | |
1056 | 1058 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
157 | 157 | | |
158 | 158 | | |
159 | 159 | | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
160 | 163 | | |
161 | 164 | | |
162 | 165 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1134 | 1134 | | |
1135 | 1135 | | |
1136 | 1136 | | |
| 1137 | + | |
1137 | 1138 | | |
1138 | 1139 | | |
1139 | 1140 | | |
| |||
1397 | 1398 | | |
1398 | 1399 | | |
1399 | 1400 | | |
| 1401 | + | |
1400 | 1402 | | |
1401 | 1403 | | |
1402 | 1404 | | |
| |||
1542 | 1544 | | |
1543 | 1545 | | |
1544 | 1546 | | |
| 1547 | + | |
| 1548 | + | |
1545 | 1549 | | |
1546 | 1550 | | |
1547 | 1551 | | |
| |||
1942 | 1946 | | |
1943 | 1947 | | |
1944 | 1948 | | |
| 1949 | + | |
1945 | 1950 | | |
1946 | 1951 | | |
1947 | 1952 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
197 | 197 | | |
198 | 198 | | |
199 | 199 | | |
200 | | - | |
| 200 | + | |
| 201 | + | |
201 | 202 | | |
202 | 203 | | |
203 | 204 | | |
| |||
1932 | 1933 | | |
1933 | 1934 | | |
1934 | 1935 | | |
| 1936 | + | |
| 1937 | + | |
| 1938 | + | |
| 1939 | + | |
| 1940 | + | |
| 1941 | + | |
| 1942 | + | |
| 1943 | + | |
| 1944 | + | |
| 1945 | + | |
| 1946 | + | |
| 1947 | + | |
| 1948 | + | |
| 1949 | + | |
| 1950 | + | |
| 1951 | + | |
| 1952 | + | |
| 1953 | + | |
| 1954 | + | |
1935 | 1955 | | |
1936 | 1956 | | |
1937 | 1957 | | |
| |||
4357 | 4377 | | |
4358 | 4378 | | |
4359 | 4379 | | |
| 4380 | + | |
4360 | 4381 | | |
4361 | 4382 | | |
4362 | 4383 | | |
| |||
Lines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
11 | | - | |
| 10 | + | |
| 11 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | | - | |
19 | | - | |
20 | | - | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
21 | 21 | | |
Lines changed: 0 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
15 | 14 | | |
16 | 15 | | |
17 | 16 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
| 99 | + | |
99 | 100 | | |
0 commit comments