Heavy Tests · JetBrains/koog@77c8d0b

Manually triggered December 16, 2025 13:35

aozherelyeva

⁠ 77c8d0b

fix-tests

Status Failure

Total duration 10m 38s

Artifacts 9

heavy-tests.yml

on: workflow_dispatch

Matrix: integration-tests

Annotations

9 errors

AIAgentIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/agent/AIAgentIntegrationTest.kt#L464

org.opentest4j.AssertionFailedError: There should be more or equal than 2 single tool calls in a Sequential single run scenario 1 should be >= 2

AIAgentIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/agent/AIAgentIntegrationTest.kt#L451

kotlinx.serialization.json.internal.JsonDecodingException: Unexpected JSON token at offset 44: Expected EOF after parsing, but had { instead at path: $ JSON input: {"operation": "MULTIPLY", "a": 7, "b": 2} {"milliseconds": 5***}

agent-tests

Process completed with exit code 1.

MultipleLLMPromptExecutorIntegrationTest.[1] LLModel(provider=OpenAI, id=gpt-5.2, capabilities=[Completion, Temperature, Basic, Standard, Speculation, Tools, ToolChoice, Image, Document, MultipleChoices, Completions, Responses], contextLength=400000, maxOutputTokens=128000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/MultipleLLMPromptExecutorIntegrationTest.kt#L265

java.util.NoSuchElementException: Collection contains no element matching the predicate.

MultipleLLMPromptExecutorIntegrationTest.[13] LLModel(provider=MistralAI, id=mistral-medium-latest, capabilities=[Temperature, Completion, Tools, ToolChoice, Basic, Standard, Image, Document, MultipleChoices], contextLength=128000, maxOutputTokens=null)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/MultipleLLMPromptExecutorIntegrationTest.kt#L205

org.opentest4j.AssertionFailedError: "this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format. here are some key aspects of the image: 1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format. 2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts. 3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity. 4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality. 5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality. overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality." should include substring "test image" expected:<test image> but was:<this image is a png test pattern, often used to evaluate how well an image file handles transparency and color gradients when saved in the png format. here are some key aspects of the image: 1. **png test**: the text "png test" at the bottom indicates that this image is specifically designed to test the capabilities of the png format. 2. **color and transparency**: the image contains a variety of colors and gradients, which are used to test the color handling of the png format. the background has a grid with different shades and colors, which helps in identifying any color banding or artifacts. 3. **alpha transparency**: the grid with numbers and letters (like "1***", "75", etc.) is used to test the alpha transparency feature of png. each square in the grid has a different level of transparency, allowing you to see how well the format handles varying degrees of opacity. 4. **image content**: the central image of a man with a camera is used to test how well the png format preserves detail and color accuracy in complex images. the man is depicted with a mix of solid colors and gradients, which helps in assessing the image quality. 5. **color depth and compression**: the image tests the png format's ability to maintain color depth and handle compression without losing quality. overall, this image is a comprehensive tool for evaluating the performance of the png format in terms of color accuracy, transparency, and image quality.>

multiple-llm-executor-tests

Process completed with exit code 1.

SingleLLMPromptExecutorIntegrationTest.[10] LLModel(provider=OpenRouter, id=qwen/qwen3-vl-8b-instruct, capabilities=[Temperature, Speculation, Tools, Completion, Image, Standard, ToolChoice], contextLength=131072, maxOutputTokens=33000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/SingleLLMPromptExecutorIntegrationTest.kt#L187

org.opentest4j.AssertionFailedError: expected:<calculator> but was:<nothing>

SingleLLMPromptExecutorIntegrationTest.[1] LLModel(provider=OpenAI, id=gpt-5.2, capabilities=[Completion, Temperature, Basic, Standard, Speculation, Tools, ToolChoice, Image, Document, MultipleChoices, Completions, Responses], contextLength=400000, maxOutputTokens=128000)[jvm]: integration-tests/src/jvmTest/kotlin/ai/koog/integration/tests/executor/SingleLLMPromptExecutorIntegrationTest.kt#L309

java.util.NoSuchElementException: Collection contains no element matching the predicate.

single-llm-executor-tests

Process completed with exit code 1.

Artifacts

Produced during runtime

Name	Size	Digest
reports-ubuntu-latest-agent-tests	327 KB	`sha256:2b548b989147b71a65848b0a7f7b21f907fada12aa7c8904c17d3ae4d3db4397`
reports-ubuntu-latest-anthropic-schema-test	287 KB	`sha256:9cfbc409cbb3afc8e8ed53a97dbc8f02d3c72a9ed32aacdd7d064d5ae69af07e`
reports-ubuntu-latest-bedrock-credentials-test	284 KB	`sha256:b336308ab755cee7c50d363a4eadffd19e0071a5a1f0e64700aa21f47d84c24b`
reports-ubuntu-latest-bedrock-tests	284 KB	`sha256:86bd1d0ffd47d6995cebe5babf76d831582454ddd7d9c603dcf36c3d19ecd92b`
reports-ubuntu-latest-capabilities-tests	295 KB	`sha256:42598fde940b0237398c127f259dfb77b8b22825fd089c1960edc971cd01a358`
reports-ubuntu-latest-embeddings-test	284 KB	`sha256:0f2d520c573216160780881a16cc471fd6fee60f11749b719cd4d92a64109283`
reports-ubuntu-latest-multiple-llm-executor-tests	306 KB	`sha256:cbf5837b5673f18e78ae0e682e799403f3ff363d1cc55301312a151c68acca5a`
reports-ubuntu-latest-other-executor-tests	284 KB	`sha256:2f7d1509008dfe51401c87ddf652727e8613ef36a1492a769517f35971bc3176`
reports-ubuntu-latest-single-llm-executor-tests	306 KB	`sha256:e174133060717e0baf11d7e062584824dfbde6877d01303d701074e0a837e10f`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Heavy Tests #891

Summary

Heavy Tests #891

Uh oh!

heavy-tests.yml

Annotations

Artifacts