Heavy Tests #891
Annotations
9 errors
|
AIAgentIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/agent/AIAgentIntegrationTest.kt#L464
org.opentest4j.AssertionFailedError: There should be more or equal than 2 single tool calls in a Sequential single run scenario
1 should be >= 2
|
|
AIAgentIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/agent/AIAgentIntegrationTest.kt#L451
kotlinx.serialization.json.internal.JsonDecodingException: Unexpected JSON token at offset 44: Expected EOF after parsing, but had { instead at path: $
JSON input: {"operation": "MULTIPLY", "a": 7, "b": 2} {"milliseconds": 5***}
|
|
agent-tests
Process completed with exit code 1.
|
|
MultipleLLMPromptExecutorIntegrationTest.[1] LLModel(provider=OpenAI, id=gpt-5.2, capabilities=[Completion, Temperature, Basic, Standard, Speculation, Tools, ToolChoice, Image, Document, MultipleChoices, Completions, Responses], contextLength=400000, maxOutputTokens=128000)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/MultipleLLMPromptExecutorIntegrationTest.kt#L265
java.util.NoSuchElementException: Collection contains no element matching the predicate.
|
|
MultipleLLMPromptExecutorIntegrationTest.[13] LLModel(provider=MistralAI, id=mistral-medium-latest, capabilities=[Temperature, Completion, Tools, ToolChoice, Basic, Standard, Image, Document, MultipleChoices], contextLength=128000, maxOutputTokens=null)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/MultipleLLMPromptExecutorIntegrationTest.kt#L205
org.opentest4j.AssertionFailedError: "this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format.
here are some key aspects of the image:
1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format.
2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts.
3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity.
4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality.
5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality.
overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality." should include substring "test image"
expected:<test image> but was:<this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format.
here are some key aspects of the image:
1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format.
2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts.
3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity.
4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality.
5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality.
overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality.>
|
|
multiple-llm-executor-tests
Process completed with exit code 1.
|
|
SingleLLMPromptExecutorIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/SingleLLMPromptExecutorIntegrationTest.kt#L187
org.opentest4j.AssertionFailedError: expected:<calculator> but was:<nothing>
|
|
SingleLLMPromptExecutorIntegrationTest.[1] LLModel(provider=OpenAI, id=gpt-5.2, capabilities=[Completion, Temperature, Basic, Standard, Speculation, Tools, ToolChoice, Image, Document, MultipleChoices, Completions, Responses], contextLength=400000, maxOutputTokens=128000)[jvm]:
integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/SingleLLMPromptExecutorIntegrationTest.kt#L309
java.util.NoSuchElementException: Collection contains no element matching the predicate.
|
|
single-llm-executor-tests
Process completed with exit code 1.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
reports-ubuntu-latest-agent-tests
|
327 KB |
sha256:2b548b989147b71a65848b0a7f7b21f907fada12aa7c8904c17d3ae4d3db4397
|
|
|
reports-ubuntu-latest-anthropic-schema-test
|
287 KB |
sha256:9cfbc409cbb3afc8e8ed53a97dbc8f02d3c72a9ed32aacdd7d064d5ae69af07e
|
|
|
reports-ubuntu-latest-bedrock-credentials-test
|
284 KB |
sha256:b336308ab755cee7c50d363a4eadffd19e0071a5a1f0e64700aa21f47d84c24b
|
|
|
reports-ubuntu-latest-bedrock-tests
|
284 KB |
sha256:86bd1d0ffd47d6995cebe5babf76d831582454ddd7d9c603dcf36c3d19ecd92b
|
|
|
reports-ubuntu-latest-capabilities-tests
|
295 KB |
sha256:42598fde940b0237398c127f259dfb77b8b22825fd089c1960edc971cd01a358
|
|
|
reports-ubuntu-latest-embeddings-test
|
284 KB |
sha256:0f2d520c573216160780881a16cc471fd6fee60f11749b719cd4d92a64109283
|
|
|
reports-ubuntu-latest-multiple-llm-executor-tests
|
306 KB |
sha256:cbf5837b5673f18e78ae0e682e799403f3ff363d1cc55301312a151c68acca5a
|
|
|
reports-ubuntu-latest-other-executor-tests
|
284 KB |
sha256:2f7d1509008dfe51401c87ddf652727e8613ef36a1492a769517f35971bc3176
|
|
|
reports-ubuntu-latest-single-llm-executor-tests
|
306 KB |
sha256:e174133060717e0baf11d7e062584824dfbde6877d01303d701074e0a837e10f
|
|