Skip to content

Heavy Tests

Heavy Tests #891

Manually triggered December 16, 2025 13:35
Status Failure
Total duration 10m 38s
Artifacts 9

heavy-tests.yml

on: workflow_dispatch
Matrix: integration-tests
Fit to window
Zoom out
Zoom in

Annotations

9 errors
AIAgentIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/agent/AIAgentIntegrationTest.kt#L451
kotlinx.serialization.json.internal.JsonDecodingException: Unexpected JSON token at offset 44: Expected EOF after parsing, but had { instead at path: $ JSON input: {"operation": "MULTIPLY", "a": 7, "b": 2} {"milliseconds": 5***}
agent-tests
Process completed with exit code 1.
MultipleLLMPromptExecutorIntegrationTest.[13] LLModel(provider=MistralAI, id=mistral-medium-latest, capabilities=[Temperature, Completion, Tools, ToolChoice, Basic, Standard, Image, Document, MultipleChoices], contextLength=128000, maxOutputTokens=null)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/MultipleLLMPromptExecutorIntegrationTest.kt#L205
org.opentest4j.AssertionFailedError: "this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format. here are some key aspects of the image: 1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format. 2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts. 3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity. 4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality. 5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality. overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality." should include substring "test image" expected:<test image> but was:<this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format. here are some key aspects of the image: 1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format. 2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts. 3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity. 4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality. 5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality. overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality.>
multiple-llm-executor-tests
Process completed with exit code 1.
single-llm-executor-tests
Process completed with exit code 1.

Artifacts

Produced during runtime
Name Size Digest
reports-ubuntu-latest-agent-tests
327 KB
sha256:2b548b989147b71a65848b0a7f7b21f907fada12aa7c8904c17d3ae4d3db4397
reports-ubuntu-latest-anthropic-schema-test
287 KB
sha256:9cfbc409cbb3afc8e8ed53a97dbc8f02d3c72a9ed32aacdd7d064d5ae69af07e
reports-ubuntu-latest-bedrock-credentials-test
284 KB
sha256:b336308ab755cee7c50d363a4eadffd19e0071a5a1f0e64700aa21f47d84c24b
reports-ubuntu-latest-bedrock-tests
284 KB
sha256:86bd1d0ffd47d6995cebe5babf76d831582454ddd7d9c603dcf36c3d19ecd92b
reports-ubuntu-latest-capabilities-tests
295 KB
sha256:42598fde940b0237398c127f259dfb77b8b22825fd089c1960edc971cd01a358
reports-ubuntu-latest-embeddings-test
284 KB
sha256:0f2d520c573216160780881a16cc471fd6fee60f11749b719cd4d92a64109283
reports-ubuntu-latest-multiple-llm-executor-tests
306 KB
sha256:cbf5837b5673f18e78ae0e682e799403f3ff363d1cc55301312a151c68acca5a
reports-ubuntu-latest-other-executor-tests
284 KB
sha256:2f7d1509008dfe51401c87ddf652727e8613ef36a1492a769517f35971bc3176
reports-ubuntu-latest-single-llm-executor-tests
306 KB
sha256:e174133060717e0baf11d7e062584824dfbde6877d01303d701074e0a837e10f