Contributing to lazy-cat

Thanks for being here. This project lives or dies on real evidence, so the contribution bar is simple: real outputs, real numbers, nothing simulated.

By contributing you agree to our code of conduct. We respond to issues and PRs within 5 working days.

The most useful contribution: a benchmark

A benchmark is a real task where Claude defaulted to the expensive path when a better one existed. If you've seen it happen in your own sessions, it belongs here.

Run the task twice in a real session — once without lazy-cat, once with. Interactive sessions, not claude -p one-shots: the failure modes lazy-cat fixes (wrong strategy, scope creep) barely show up in print mode, so print-mode comparisons under-measure.
Save the actual outputs — full code, not summaries or pseudocode.
Count characters, estimate tokens at chars ÷ 4.
Copy the format from any file in tests/ and open a PR with tests/NN-benchmark-your-scenario.md (NN = next free number).

A good benchmark has: a task someone would actually give Claude, real code for both conditions, and a clear winner with a reason. Check tests/summary.md for what's already covered — duplicates of an existing scenario are only interesting if they contradict it.

Negative results are welcome. A task where lazy-cat made things worse (or did nothing) is as valuable as a win — we publish those too, like the dark-mode case where surgical alone beat both skills combined.

Contributing a skill

New skills carry a real cost: every installed skill adds input tokens to every session. So the bar is higher than "it seems useful":

Target a failure mode the current skills don't cover. Open an issue first describing the failure mode with at least one real transcript — let's agree it's real before you write the skill.
Write the SKILL.md following the existing two as templates. The description field decides when your skill fires — make trigger and non-trigger conditions concrete (request shapes, size thresholds), not vibes.
Benchmark it in real sessions: at least 3 benchmark files showing ≥2× median improvement on its target failure mode, plus a check that existing benchmarks don't regress when your skill is installed alongside think-twice and surgical.
Include the trivial-task negative test: show your skill correctly does nothing on a task below its threshold. A skill that fires on everything is a tax, not a tool.

Code contributions (installer, stats, tooling)

git clone https://github.com/albertobarnabo/lazy-cat
cd lazy-cat
npm test          # installer + stats test suites, no network needed

Keep diffs minimal — this repo eats its own dog food. Build exactly what the issue asks for, note what you deliberately left out in the PR description.

Other ways to help

Verified results in a language we haven't tested (Go, Rust, Swift, Java)
Platform-specific install instructions (Cursor, Codex CLI, Gemini CLI)
A real-world saving from your own codebase — even a one-liner in an issue
Your lazy-cat stats week-over-week numbers (see the README) — real-world usage data beats benchmarks

The best contributions, like the best code, do exactly what's needed — nothing more.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to lazy-cat

The most useful contribution: a benchmark

Contributing a skill

Code contributions (installer, stats, tooling)

Other ways to help

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to lazy-cat

The most useful contribution: a benchmark

Contributing a skill

Code contributions (installer, stats, tooling)

Other ways to help