feat: Add flux to built-in toolsets #1208

shlok007 · 2025-12-19T10:37:39Z

Summary by CodeRabbit

New Features
- Added a Flux CD toolset exposing read-only Flux CLIs for health, status, sources, kustomizations, helmreleases, diffs, exports, tracing, and debugging aids.
Documentation
- Added comprehensive Flux CD docs (prereqs, config snippets, examples, capabilities table, troubleshooting) and integrated into site navigation and toolset index.
Tests
- Added unit tests and fixtures covering Flux command parsing, safety rules, edge cases, and representative Flux outputs.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

CLAassistant · 2025-12-19T10:37:47Z

All committers have signed the CLA.

coderabbitai · 2025-12-19T10:37:56Z

Walkthrough

Adds Flux CD support across docs, a new Bash FluxCommand with allow/deny command lists, a flux/core toolset configuration, test fixtures modeling a Flux sync-failure scenario, and unit tests validating Flux CLI parsing and safety.

Changes

Cohort / File(s)	Summary
Documentation `docs/data-sources/builtin-toolsets/.nav.yml`, `docs/data-sources/builtin-toolsets/index.md`, `docs/data-sources/builtin-toolsets/flux.md`	Added Flux CD nav entry and a new Flux toolset document (prereqs, examples, config snippets, capabilities, troubleshooting).
Bash Flux Command & Constants `holmes/plugins/toolsets/bash/flux/__init__.py`, `holmes/plugins/toolsets/bash/flux/constants.py`	New `FluxCommand` subclass for parsing/serializing `flux` CLI usage; introduced `ALLOWED_FLUX_COMMANDS` and `DENIED_FLUX_COMMANDS` static dictionaries.
Command Registry `holmes/plugins/toolsets/bash/parse_command.py`	Imported and registered `FluxCommand()` in the Bash AVAILABLE_COMMANDS list.
Toolset Configuration `holmes/plugins/toolsets/flux.yaml`	New `flux/core` toolset config with metadata, llm_instructions, prerequisites, and many templated Flux inspection/debug/read-only tools.
Test Fixtures `tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/*`	Added fixture set (flux_check.txt, flux_events.txt, flux_get_all.txt, flux_get_kustomizations.txt, flux_get_sources_git.txt, test_case.yaml, toolsets.yaml) representing a Flux sync failure scenario.
Unit Tests `tests/plugins/toolsets/bash/test_flux_commands.py`	New parametrized tests covering safe vs blocked Flux command parsing, edge cases (grep combos, namespaces, logs options, status selectors, export), and expected error messages.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Inspect allowlist/denylist completeness and intended semantics in holmes/plugins/toolsets/bash/flux/constants.py.
Verify parser wiring in holmes/plugins/toolsets/bash/flux/__init__.py (argument parsing, validation delegation, and escape_shell_args usage).
Confirm FluxCommand() integration in holmes/plugins/toolsets/bash/parse_command.py doesn't conflict with existing commands.
Review holmes/plugins/toolsets/flux.yaml templates for parameter correctness and potential injection/templating issues.
Run the new unit tests and ensure fixtures align with the expected tool output formats.

Suggested reviewers

aantn
arikalon1
Avi-Robusta

Pre-merge checks

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 76.92% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat: Add flux to built-in toolsets' directly and accurately describes the main change—adding Flux CD support as a new built-in toolset across documentation, configuration, and implementation files.

📜 Recent review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6f5dcc2 and 671f54b.

📒 Files selected for processing (15)

docs/data-sources/builtin-toolsets/.nav.yml (1 hunks)
docs/data-sources/builtin-toolsets/flux.md (1 hunks)
docs/data-sources/builtin-toolsets/index.md (1 hunks)
holmes/plugins/toolsets/bash/flux/__init__.py (1 hunks)
holmes/plugins/toolsets/bash/flux/constants.py (1 hunks)
holmes/plugins/toolsets/bash/parse_command.py (2 hunks)
holmes/plugins/toolsets/flux.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_check.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_events.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_all.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_kustomizations.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_sources_git.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/test_case.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/toolsets.yaml (1 hunks)
tests/plugins/toolsets/bash/test_flux_commands.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (11)

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_sources_git.txt
docs/data-sources/builtin-toolsets/index.md
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_events.txt
docs/data-sources/builtin-toolsets/.nav.yml
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/test_case.yaml
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/toolsets.yaml
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_check.txt
holmes/plugins/toolsets/bash/flux/constants.py
holmes/plugins/toolsets/bash/parse_command.py
tests/plugins/toolsets/bash/test_flux_commands.py
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_kustomizations.txt

🧰 Additional context used

🪛 LanguageTool

docs/data-sources/builtin-toolsets/flux.md

[style] ~71-~71: Using many exclamation marks might seem excessive (in this case: 6 exclamation marks for a text that’s 3454 characters long)
Context: ...me Kubernetes access as your Flux CLI. !!! info "Flux Version Compatibility" ...

(EN_EXCESSIVE_EXCLAMATION)

🪛 markdownlint-cli2 (0.18.1)

docs/data-sources/builtin-toolsets/flux.md

41-41: Code block style
Expected: fenced; Actual: indented

(MD046, code-block-style)

59-59: Code block style
Expected: fenced; Actual: indented

(MD046, code-block-style)

70-70: Code block style
Expected: fenced; Actual: indented

(MD046, code-block-style)

74-74: Code block style
Expected: fenced; Actual: indented

(MD046, code-block-style)

🪛 Ruff (0.14.8)

holmes/plugins/toolsets/bash/flux/__init__.py

41-41: Unused method argument: original_command

(ARG002)

41-41: Unused method argument: config

(ARG002)

52-52: Unused method argument: original_command

(ARG002)

52-52: Unused method argument: config

(ARG002)

🔇 Additional comments (11)

holmes/plugins/toolsets/bash/flux/__init__.py (4)

1-18: LGTM!

The import structure and class definition follow the established pattern for Bash command integrations. The inheritance from BashCommand with the "flux" prefix is appropriate.

20-38: LGTM!

The parser construction correctly uses argparse.REMAINDER to capture Flux's variable subcommands and options, and exit_on_error=False ensures proper error handling without abrupt exits.

40-60: LGTM!

Both methods correctly implement the BashCommand interface contract. The validate_command method safely delegates to the validator, and stringify_command properly reconstructs commands with shell escaping via escape_shell_args to prevent injection attacks.

Note: The unused parameter warnings from Ruff are false positives—these parameters are part of the interface contract inherited from BashCommand.

63-65: LGTM!

The helper function provides a clean API for parser creation and follows a clear factory pattern.

docs/data-sources/builtin-toolsets/flux.md (1)

1-133: Excellent documentation!

The documentation is comprehensive and well-structured, covering prerequisites, installation, configuration, examples, capabilities, and troubleshooting. The use of MkDocs Material features (tabs, admonitions) enhances readability.

Note: The static analysis hints about exclamation marks and code block style are false positives—the !!! syntax is standard MkDocs admonition markup, and indented code blocks within === tabs are required by MkDocs Material.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_all.txt (1)

1-15: LGTM!

The test fixture correctly simulates a Flux sync failure scenario with realistic output showing SSH authentication failure and downstream dependency issues. The format matches expected Flux CLI output.

holmes/plugins/toolsets/flux.yaml (5)

1-43: Excellent LLM instructions!

The metadata and LLM instructions provide comprehensive guidance covering Flux concepts, investigation workflows, and common issues. This will help the AI agent effectively troubleshoot GitOps deployments.

44-61: LGTM!

The health and overview tools provide a solid foundation for Flux troubleshooting with clear descriptions and appropriate commands.

63-111: LGTM!

The source and resource status tools use consistent Jinja2 templating patterns with safe conditional namespace and name filtering. The commands cover all major Flux resource types comprehensively.

113-133: LGTM! Previous issues properly addressed.

The troubleshooting tools are well-implemented. Specifically:

Line 119 correctly guards the --for flag with {% if kind and name %} to prevent malformed output (✅ fixed from previous review)

Lines 121-133 include proper parameter validation for flux_trace with required fields (✅ fixed from previous review)

135-212: LGTM! All parameter validations properly implemented.

All debug, diff, and export commands now include proper parameter schemas with:

Required parameters marked with required: true

Sensible defaults (flux-system namespace)

Clear descriptions

This addresses the previous review concern about missing parameter validation (✅ fixed from previous review).

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3f1d3a3 and c2323da.

📒 Files selected for processing (15)

docs/data-sources/builtin-toolsets/.nav.yml (1 hunks)
docs/data-sources/builtin-toolsets/flux.md (1 hunks)
docs/data-sources/builtin-toolsets/index.md (1 hunks)
holmes/plugins/toolsets/bash/flux/__init__.py (1 hunks)
holmes/plugins/toolsets/bash/flux/constants.py (1 hunks)
holmes/plugins/toolsets/bash/parse_command.py (2 hunks)
holmes/plugins/toolsets/flux.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_check.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_events.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_all.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_kustomizations.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_sources_git.txt (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/test_case.yaml (1 hunks)
tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/toolsets.yaml (1 hunks)
tests/plugins/toolsets/bash/test_flux_commands.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (3)

holmes/plugins/toolsets/bash/flux/__init__.py (5)

holmes/plugins/toolsets/bash/common/bash_command.py (1)

BashCommand (9-38)

holmes/plugins/toolsets/bash/common/config.py (1)

BashExecutorConfig (13-14)

holmes/plugins/toolsets/bash/common/stringify.py (1)

escape_shell_args (10-38)

holmes/plugins/toolsets/bash/common/validators.py (1)

validate_command_and_operations (28-44)

holmes/plugins/toolsets/bash/parse_command.py (2)

validate_command (88-94)

stringify_command (97-111)

tests/plugins/toolsets/bash/test_flux_commands.py (2)

holmes/plugins/toolsets/bash/common/config.py (1)

BashExecutorConfig (13-14)

holmes/plugins/toolsets/bash/parse_command.py (1)

make_command_safe (155-179)

holmes/plugins/toolsets/bash/parse_command.py (1)

holmes/plugins/toolsets/bash/flux/__init__.py (1)

FluxCommand (16-60)

🪛 Ruff (0.14.8)

holmes/plugins/toolsets/bash/flux/__init__.py

41-41: Unused method argument: original_command

(ARG002)

41-41: Unused method argument: config

(ARG002)

52-52: Unused method argument: original_command

(ARG002)

52-52: Unused method argument: config

(ARG002)

🔇 Additional comments (19)

docs/data-sources/builtin-toolsets/.nav.yml (1)

15-15: LGTM!

The navigation entry is correctly formatted and alphabetically positioned between Docker and GitHub.

docs/data-sources/builtin-toolsets/index.md (1)

21-21: LGTM!

The Flux CD entry is properly formatted with an appropriate icon (material-source-branch) and correctly positioned alphabetically.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/toolsets.yaml (1)

1-7: LGTM!

The toolset configuration correctly enables the necessary toolsets (kubernetes/core, flux/core, kubernetes/logs) for the Flux sync failure test scenario.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/test_case.yaml (1)

1-6: LGTM!

The test case is well-defined with a clear user prompt and specific, measurable expected output criteria that align with the accompanying fixture data showing SSH authentication failures.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_get_sources_git.txt (1)

1-8: LGTM!

The fixture accurately represents a realistic Flux GitRepository status output with one source experiencing SSH authentication failures and another successfully reconciled. The format is consistent with the expected tool invocation structure.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_check.txt (1)

1-12: LGTM!

The fixture correctly represents a successful Flux health check, showing all controllers as ready. This provides a good baseline for troubleshooting scenarios where Flux itself is healthy but specific resources are failing.

docs/data-sources/builtin-toolsets/flux.md (2)

1-130: Excellent documentation!

The Flux CD toolset documentation is comprehensive, well-structured, and covers all essential aspects:

Clear prerequisites and multi-platform installation instructions

Configuration examples for both Holmes CLI and Helm deployments

Realistic example questions for common troubleshooting scenarios

Extensive capabilities table covering 26 Flux tools

Helpful troubleshooting guidance for common issues

87-114: All documented Flux tools are implemented in the toolset configuration. The table lists 25 tools (not 26 as stated), and verification confirms all 25 are present in holmes/plugins/toolsets/flux.yaml with proper command definitions and descriptions.

tests/llm/fixtures/test_ask_holmes/176_flux_sync_failure/flux_events.txt (1)

1-12: LGTM!

The fixture effectively models a realistic Flux event timeline, showing repeated reconciliation failures due to SSH authentication issues and the cascading impact on dependent Kustomizations. The events are consistent with the other fixtures in this test scenario.

holmes/plugins/toolsets/bash/parse_command.py (1)

12-12: LGTM! Clean integration of FluxCommand.

The FluxCommand is properly imported and added to AVAILABLE_COMMANDS following the established pattern for other toolsets (kubectl, argocd, aws, etc.).

Also applies to: 37-37

holmes/plugins/toolsets/flux.yaml (1)

1-44: Excellent toolset documentation and metadata.

The LLM instructions provide clear context about Flux CD concepts, investigation workflow, and common issues. The prerequisite check and documentation links are helpful.

holmes/plugins/toolsets/bash/flux/constants.py (2)

1-48: Well-designed allowlist for read-only Flux operations.

The ALLOWED_FLUX_COMMANDS properly restricts access to safe, read-only operations:

Status queries (get, check, stats)

Observability (logs, events, trace, tree)

Read-only inspection (debug, diff, export)

The nested dictionary structure allows for future fine-grained control of subcommands.

50-76: Comprehensive blocklist prevents state-modifying operations.

The DENIED_FLUX_COMMANDS correctly blocks all potentially dangerous operations:

State changes (reconcile, suspend, resume, create, delete)

Cluster modifications (bootstrap, install, uninstall)

Artifact modifications (push, tag, build)

This ensures the toolset cannot accidentally modify the cluster state.

holmes/plugins/toolsets/bash/flux/__init__.py (3)

16-38: Clean parser implementation following established patterns.

The FluxCommand parser correctly:

Captures the main flux command (get, check, logs, etc.)

Uses nargs=argparse.REMAINDER to capture all subcommands and options

Sets exit_on_error=False for graceful error handling

40-49: Proper validation using allowlist/blocklist approach.

The validation correctly delegates to validate_command_and_operations with the ALLOWED_FLUX_COMMANDS and DENIED_FLUX_COMMANDS constants, ensuring only safe read-only operations are permitted.

The hasattr check on Line 43 is defensive programming since the parser always creates the options attribute (default=[]).

Note: The static analysis warnings about unused original_command and config parameters are false positives. These parameters are required by the BashCommand abstract interface (as shown in the relevant code snippets from bash_command.py). Based on learnings, this is the expected pattern across all command implementations.

51-60: Safe command stringification with proper escaping.

The stringify_command method correctly:

Rebuilds the command from parsed components

Uses escape_shell_args to prevent injection attacks

Handles optional options gracefully

tests/plugins/toolsets/bash/test_flux_commands.py (3)

15-142: Excellent comprehensive coverage of safe Flux commands.

The parametrized tests cover:

All read-only command categories (version, check, stats, get, logs, events, trace, tree, debug, diff, export)

Various flag combinations (-A, -n, --all-namespaces, -o, -w, --status-selector)

Multiple resource types (sources, kustomizations, helmreleases, alerts, images)

The assertion on exact string equality ensures command stringification is consistent and safe.

145-278: Thorough validation of unsafe command blocking.

The tests properly verify that all state-modifying operations are blocked:

State changes (reconcile, suspend, resume, create, delete)

Cluster modifications (bootstrap, install, uninstall)

Artifact operations (push, build)

Invalid commands/operations

The tests correctly assert both the exception type and the error message content, ensuring clear feedback to users.

280-381: Well-designed edge case tests.

The edge case tests cover important scenarios:

Command piping with grep (safe and unsafe combinations)

Namespace flag variations (short/long forms, all-namespaces)

Logs command with multiple filter options

Case sensitivity validation (prevents command spoofing)

Status selectors and export operations

These tests ensure robust handling of real-world usage patterns.

holmes/plugins/toolsets/flux.yaml

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

holmes/plugins/toolsets/flux.yaml (1)

113-115: Consider adding parameter schema for consistency.

The flux_logs command uses several optional parameters (flux_namespace, kind, name, level, since) without an explicit parameters: block. While the conditional templates handle these gracefully, adding explicit parameter definitions would improve consistency with other parameterized commands in the toolset and provide better documentation of available options.

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c2323da and 6f5dcc2.

📒 Files selected for processing (1)

holmes/plugins/toolsets/flux.yaml (1 hunks)

🔇 Additional comments (1)

holmes/plugins/toolsets/flux.yaml (1)

124-212: Excellent parameter schema additions!

The explicit parameter blocks for flux_trace, flux_tree, flux_debug_*, flux_diff_*, and flux_export_* commands successfully address the previous review feedback. Required parameters are now properly validated before execution, preventing malformed commands from being generated when the LLM fails to provide necessary arguments. The consistent format and clear descriptions across all parameter definitions improve both maintainability and the user experience.

holmes/plugins/toolsets/flux.yaml

Signed-off-by: shlok007 <[email protected]>

aantn · 2025-12-24T17:07:14Z

Hi @shlok007, thank you for the PR.
You've added the flux toolset in two separate ways in this PR - once as a YAML toolset and once via the bash toolset as a whitelisted command. You don't need both of these! I would recommend:

Doing this only as a YAML toolset
Reducing the number of tools to a much smaller number of more powerful tools. e.g. flux_get_sources_all, flux_get_sources_git, flux_get_sources_helm, etc could all be expressed as a single flux_get command with a kind parameter

Finally, eval 176 won't actually pass - you don't actually setup the test scenario in the cluster. There are also a bunch of unrelated files.

I strongly suggest reading a few evals first to understand the format, then writing an eval of your own, then trying to make sure it passes reliably.

shlok007 mentioned this pull request Dec 19, 2025

Add Flux in built-in toolsets #1203

Open

coderabbitai bot reviewed Dec 19, 2025

View reviewed changes

holmes/plugins/toolsets/flux.yaml Show resolved Hide resolved

coderabbitai bot reviewed Dec 19, 2025

View reviewed changes

holmes/plugins/toolsets/flux.yaml Outdated Show resolved Hide resolved

feat: add flux to built-in toolsets

671f54b

Signed-off-by: shlok007 <[email protected]>

shlok007 force-pushed the flux-toolset branch from 9889b43 to 671f54b Compare December 19, 2025 11:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add flux to built-in toolsets #1208

feat: Add flux to built-in toolsets #1208

Uh oh!

shlok007 commented Dec 19, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

CLAassistant commented Dec 19, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented Dec 19, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

aantn commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add flux to built-in toolsets #1208

Are you sure you want to change the base?

feat: Add flux to built-in toolsets #1208

Uh oh!

Conversation

shlok007 commented Dec 19, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

CLAassistant commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Suggested reviewers

Pre-merge checks

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aantn commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shlok007 commented Dec 19, 2025 •

edited by coderabbitai bot

Loading

CLAassistant commented Dec 19, 2025 •

edited

Loading

coderabbitai bot commented Dec 19, 2025 •

edited

Loading