Command Filter Modules

Scope

Command execution and output filtering. Every module here calls an external CLI tool (Command::new("some_tool")), transforms its stdout/stderr to reduce token consumption, and records savings via core/tracking.

Owns: all command-specific filter logic, organized by ecosystem (git, rust, js, python, go, dotnet, cloud, system). Cross-ecosystem routing (e.g., lint_cmd detecting Python and delegating to ruff_cmd) is an intra-component concern.

Does not own: the TOML DSL filter engine (that's core/toml_filter), hook interception (that's hooks/), or analytics dashboards (that's analytics/). This component writes to the tracking DB; analytics reads from it.

Boundary rule: a module belongs here if and only if it executes an external command and filters its output. Infrastructure that serves multiple modules without calling external commands belongs in core/.

When to Write a Rust Module (vs TOML Filter)

Rust modules exist here because they need capabilities TOML filters don't have: parsing structured output (JSON, NDJSON), state machine parsing across phases, injecting CLI flags (--format json), cross-command routing, or flag-aware filtering — detecting user-requested verbose flags (e.g., --nocapture) and adjusting compression accordingly (see Design Philosophy and TOML vs Rust decision table).

Ecosystem placement: Match the command's language/toolchain. Use system/ for language-agnostic commands. New ecosystem when 3+ related commands justify it.

For the full contribution checklist (including discover/rules.rs registration), see Adding a New Command Filter below.

Purpose

All command-specific filter modules that execute CLI commands and transform their output to minimize LLM token consumption. Each module follows a consistent pattern: execute the underlying command, filter its output through specialized parsers, track token savings, and propagate exit codes.

Ecosystems

Each subdirectory has its own README with file descriptions, parsing strategies, and cross-command dependencies.

git/ — git, gh, gt, diff — trailing_var_arg parsing, gh markdown filtering, gt passthrough
rust/ — cargo, runner (err/test) — Cargo sub-enum routing, runner dual-mode
js/ — npm, pnpm, vitest, lint, tsc, next, prettier, playwright, prisma — Package manager auto-detection, lint routing, cross-deps with python
python/ — ruff, pytest, mypy, pip — JSON check vs text format, state machine parsing, uv auto-detection
go/ — go test/build/vet, golangci-lint — NDJSON streaming, Go sub-enum pattern
dotnet/ — dotnet, binlog, trx, format_report — DotnetCommands sub-enum, internal helper modules
cloud/ — aws, docker/kubectl, curl, wget, psql — Docker/Kubectl sub-enums, JSON forced output
system/ — ls, tree, read, grep, find, wc, env, json, log, deps, summary, format, smart — format_cmd routing, filter levels, language detection
ruby/ — rake/rails test, rspec, rubocop — JSON injection pattern, ruby_exec() bundle exec auto-detection

Execution Flow

The shared wrappers in core/runner.rs encapsulate the execution skeleton. Modules build the Command (custom arg logic), then delegate to a runner entry point. All runners handle tracking, tee recovery, and exit code propagation automatically.

 run_streaming()       Filter applied              tee_and_hint()
      |                (per-line or post-hoc)            |
      v                       |                          v
 +---------+  stdout  +-------+-------+  filtered  +-------+
 | Spawn   |--------->| filter        |----------->| Print |
 +---------+  stderr  +---------------+            +-------+
      |        (live)                                    |
      v                                                  v
 +----------+                                    +---------+
 | raw =    |                                    | Track   |
 | stdout + |                                    | savings |
 | stderr   |                                    +---------+
 +----------+                                          |
                                                       v
                                                 +-----------+
                                                 | Ok(code)  |
                                                 | returned  |
                                                 +-----------+

Filter modes

All execution goes through core::stream::run_streaming() with one of four FilterMode variants. The runner entry points (run_filtered, run_streamed, run_passthrough) select the appropriate mode automatically — module authors don't interact with FilterMode directly.

FilterMode	How it works	Used by
`CaptureOnly`	Buffers all stdout silently, then passes the full string to `filter_fn` post-hoc. Stderr streams to terminal in real time.	`run_filtered()` (default path)
`Buffered`	Buffers all stdout, applies filter, then prints the result. Stderr streams live. Chosen automatically by `run_filtered()` when `filter_stdout_only` is set.	`run_filtered()` (stdout-only path)
`Streaming`	Feeds each stdout line to a `StreamFilter::feed_line()` as it arrives. Emitted lines print immediately. Calls `flush()` after process exits for final output.	`run_streamed()`
`Passthrough`	Inherits the parent TTY directly — no piping, no buffering. `raw`/`filtered` are empty.	`run_passthrough()`

When to use which

Scenario	Runner	FilterMode	Why
Parse structured output (JSON, tables)	`run_filtered()`	CaptureOnly/Buffered	Filter needs full text to parse structure
Long-running, line-parseable output	`run_streamed()`	Streaming	Low memory, real-time output
No filtering, just track usage	`run_passthrough()`	Passthrough	Zero overhead, inherits TTY
Custom logic (multi-command, file I/O)	Manual with `exec_capture()`	CaptureOnly	Full control over execution

Phases

Spawn — run_streaming() starts the child process with piped stdout/stderr (or inherited TTY for Passthrough)
Filter — stdout is processed per the FilterMode; stderr is forwarded to the terminal in real time via a dedicated reader thread
Print — filtered output is written to stdout (live for Streaming, post-hoc for CaptureOnly/Buffered); if tee enabled, appends recovery hint on failure
Track — timer.track() records raw vs filtered for token savings
Exit code — returns Ok(exit_code) to caller; main.rs calls process::exit(code) once

RunOptions builder:

Constructor	Behavior
`RunOptions::default()`	Combined stdout+stderr to filter, no tee
`RunOptions::with_tee("label")`	Combined filtering + tee recovery
`RunOptions::stdout_only()`	Stdout-only to filter, stderr passthrough, no tee
`RunOptions::stdout_only().tee("label")`	Stdout-only + tee recovery

Example — filtered command (recommended):

pub fn run(args: &[String], verbose: u8) -> Result<i32> {
    let mut cmd = resolved_command("mycmd");
    for arg in args { cmd.arg(arg); }
    if verbose > 0 { eprintln!("Running: mycmd {}", args.join(" ")); }

    runner::run_filtered(
        cmd, "mycmd", &args.join(" "),
        filter_mycmd_output,
        runner::RunOptions::stdout_only().tee("mycmd"),
    )
}

Exit code handling is fully automatic when using run_filtered() — the wrapper extracts the exit code (including Unix signal handling via 128+signal), tracks savings, and returns Ok(exit_code). Module authors just return the result.

Streaming filters (line-by-line):

Use runner::run_streamed() when the command is long-running or produces unbounded output that should be filtered line-by-line. Three levels of abstraction, from simplest to most flexible:

Level 1: RegexBlockFilter — regex start pattern + indent continuation (3-5 lines)

For block-based errors where blocks start with a regex match and continue on indented lines. Handles skip prefixes, block counting, and summary automatically.

use crate::core::stream::{BlockStreamFilter, RegexBlockFilter};

pub fn run(args: &[String], verbose: u8) -> Result<i32> {
    let mut cmd = resolved_command("mycmd");
    for arg in args { cmd.arg(arg); }

    let filter = RegexBlockFilter::new("mycmd", r"^error\[")
        .skip_prefixes(&["warning:", "note:"]);

    runner::run_streamed(
        cmd, "mycmd", &args.join(" "),
        Box::new(BlockStreamFilter::new(filter)),
        runner::RunOptions::with_tee("mycmd"),
    )
}

RegexBlockFilter provides: regex-based block start detection, indent-based continuation (space/tab), configurable line skipping via prefixes, and automatic summary ("mycmd: 3 blocks in output" or "mycmd: no errors found").

Level 2: BlockHandler trait — custom block detection with state tracking

When you need custom block start/continuation logic or stateful parsing beyond regex + indent. Implement the BlockHandler trait and wrap in BlockStreamFilter.

use crate::core::stream::{BlockHandler, BlockStreamFilter};

struct MyHandler { error_count: usize }

impl BlockHandler for MyHandler {
    fn should_skip(&mut self, line: &str) -> bool { line.is_empty() }
    fn is_block_start(&mut self, line: &str) -> bool {
        if line.starts_with("FAIL") { self.error_count += 1; true } else { false }
    }
    fn is_block_continuation(&mut self, line: &str, _block: &[String]) -> bool {
        line.starts_with("  ") || line.starts_with("at ")
    }
    fn format_summary(&self, _exit_code: i32, _raw: &str) -> Option<String> {
        Some(format!("{} failures\n", self.error_count))
    }
}

See cmds/rust/cargo_cmd.rs::CargoBuildHandler and cmds/js/tsc_cmd.rs::TscHandler for production examples.

Level 3: StreamFilter trait — full line-by-line control

When block-based parsing doesn't fit (e.g., state machines, multi-phase output, line transforms). Implement StreamFilter directly.

use crate::core::stream::StreamFilter;

struct MyFilter { state: State }

impl StreamFilter for MyFilter {
    fn feed_line(&mut self, line: &str) -> Option<String> {
        // Return Some(text) to emit, None to suppress
        if line.contains("error") { Some(format!("{}\n", line)) } else { None }
    }
    fn flush(&mut self) -> String { String::new() }
    fn on_exit(&mut self, exit_code: i32, raw: &str) -> Option<String> { None }
}

See cmds/rust/runner.rs::ErrorStreamFilter for a complete reference implementation (state machine that tracks error blocks across lines).

Example — passthrough command (no filtering):

pub fn run_passthrough(args: &[OsString], verbose: u8) -> Result<i32> {
    runner::run_passthrough("mycmd", args, verbose)
}

Example — manual execution (custom logic):

pub fn run(args: &[String], verbose: u8) -> Result<i32> {
    let output = resolved_command("mycmd").args(args)
        .output().context("Failed to run mycmd")?;
    let exit_code = exit_code_from_output(&output, "mycmd");
    // ... custom filtering, tracking ...
    Ok(exit_code)
}

Modules with deviations (subcommand dispatch, parser trait systems, two-command fallback, synthetic output).

Cross-Command Dependencies

lint_cmd routes to mypy_cmd or ruff_cmd when detecting Python projects
format_cmd routes to prettier_cmd or ruff_cmd depending on the formatter detected
gh_cmd imports compact_diff() from git for diff formatting (markdown helpers are defined in gh_cmd itself)

Cross-Cutting Behavior Contracts

These behaviors must be uniform across all command modules. Full audit details in docs/ISO_ANALYZE.md.

Exit Code Propagation

All module run() functions return Result<i32> where the i32 is the underlying command's exit code. main.rs calls std::process::exit(code) once at the single exit point — modules never call process::exit() directly.

Return value	Meaning	Who exits
`Ok(0)`	Command succeeded	`main.rs` exits 0
`Ok(N)`	Command failed with code N	`main.rs` exits N
`Err(e)`	RTK itself failed (not the command)	`main.rs` prints error, exits 1

How exit codes are extracted:

Execution style	Helper	Signal handling
`cmd.output()` (filtered)	`exit_code_from_output(&output, "tool")`	128+signal on Unix
`cmd.status()` (passthrough)	`exit_code_from_status(&status, "tool")`	128+signal on Unix
`run_filtered()` (wrapper)	Automatic — no manual code needed	Built-in

When using run_filtered(): exit code handling is fully automatic. The wrapper extracts the exit code, handles signals, and returns Ok(exit_code). Module authors just return the wrapper's result — no exit code logic needed.

When doing manual execution: use exit_code_from_output() or exit_code_from_status() and return Ok(exit_code). Never call process::exit(), never use .code().unwrap_or(1) (loses signal info).

Filter Failure Passthrough

When filtering fails, fall back to raw output and warn on stderr. Never block the user.

Tee Recovery

Modules that parse structured output (JSON, NDJSON, state machines) must call tee::tee_and_hint() so users can recover full output on failure.

Stderr Handling

Modules must capture stderr and include it in the raw string passed to timer.track(), so token savings reflect total output.

Tracking Completeness

All modules must call timer.track() on every path — success, failure, and fallback. Since modules return Ok(exit_code) instead of calling process::exit(), tracking always runs before the program exits.

Verbose Flag

All modules accept verbose: u8. Use it to print debug info (command being run, savings %, filter tier). Do not accept and ignore it.

Adding a New Command Filter

Adding a new filter or command requires changes in multiple places. For TOML-vs-Rust decision criteria, see CONTRIBUTING.md.

Rust module (structured output, flag injection, state machines)

Create module in src/cmds/<ecosystem>/mycmd_cmd.rs:
- Write the filter_mycmd() function (pure: &str -> String, no side effects)
- Write pub fn run(...) -> Result<i32> using runner::run_filtered() — build the Command, choose RunOptions, delegate
- Use RunOptions::stdout_only() when the filter parses structured stdout (JSON, NDJSON) — stderr would corrupt parsing
- Use RunOptions::default() when filtering combined text output
- Add .tee("label") when the filter parses structured output (enables raw output recovery on failure)
- Exit codes: handled automatically by run_filtered() — just return its result
Register module:
- Ecosystem mod.rs files use automod::dir!() — any .rs file in the directory becomes a public module automatically. No manual pub mod needed, but be aware: WIP or helper files will also be exposed. Only commit command-ready modules.
- Add variant to Commands enum in main.rs with #[arg(trailing_var_arg = true, allow_hyphen_values = true)]
- Add routing match arm in main.rs: Commands::Mycmd { args } => mycmd_cmd::run(&args, cli.verbose)?,
Add rewrite pattern — Entry in src/discover/rules.rs (PATTERNS + RULES arrays at matching index) so hooks auto-rewrite the command
Write tests — Real fixture, snapshot test, token savings >= 60% (see testing rules)
Update docs — Ecosystem README (CHANGELOG.md is auto-generated by release-please)

TOML filter (simple line-based filtering)

Create filter in src/filters/
Add rewrite pattern in src/discover/rules.rs
Write tests and update docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Command Filter Modules

Scope

When to Write a Rust Module (vs TOML Filter)

Purpose

Ecosystems

Execution Flow

Filter modes

When to use which

Phases

Cross-Command Dependencies

Cross-Cutting Behavior Contracts

Exit Code Propagation

Filter Failure Passthrough

Tee Recovery

Stderr Handling

Tracking Completeness

Verbose Flag

Adding a New Command Filter

Rust module (structured output, flag injection, state machines)

TOML filter (simple line-based filtering)

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Command Filter Modules

Scope

When to Write a Rust Module (vs TOML Filter)

Purpose

Ecosystems

Execution Flow

Filter modes

When to use which

Phases

Cross-Command Dependencies

Cross-Cutting Behavior Contracts

Exit Code Propagation

Filter Failure Passthrough

Tee Recovery

Stderr Handling

Tracking Completeness

Verbose Flag

Adding a New Command Filter

Rust module (structured output, flag injection, state machines)

TOML filter (simple line-based filtering)