chore: Updating agents instructions + Adding Claude Code ones.

nicolasnoble · nicolasnoble · commit 5fc75516fda9 · 2025-12-17T16:38:23.000-08:00
Signed-off-by: Nicolas Pixel Noble &lt;pixel@nobis-crew.org&gt;
diff --git a/.cursor/rules/rust.mdc b/.cursor/rules/rust.mdc
@@ -32,8 +32,33 @@ The project is split in 3 separate crates:
 2. `common`: Provides shared utilities and data structures for the model. Any constant definitions should be placed here. As much as possible, any shared logic should also be placed here.
 3. `server`: Implements the server-side logic and API endpoints for ModelExpress in a stand alone server.
 
+## Adding CLI Arguments
+
+Client CLI arguments are defined in a shared struct to avoid duplication:
+
+1. **Add to `ClientArgs`** in `modelexpress_common/src/client_config.rs`:
+   - This is the single source of truth for shared arguments
+   - Use `#[arg(long, env = "MODEL_EXPRESS_...")]` for environment variable support
+   - Do NOT use `-v` short flag (reserved for CLI's verbose)
+
+2. **Update `ClientConfig::load()`** in the same file:
+   - Add override logic in the "APPLY CLI ARGUMENT OVERRIDES" section
+
+3. **Do NOT duplicate in `Cli`** (`modelexpress_client/src/bin/modules/args.rs`):
+   - `Cli` embeds `ClientArgs` via `#[command(flatten)]`
+   - Only add CLI-specific arguments there (e.g., `--format`, `--verbose`)
+
+4. **Add tests** in the `tests` module of `client_config.rs`
+
 # Code quality
 
 - Do **NOT** use emojis. These are unprofessional.
 - Do not create markdown files to document code changes or decisions.
 - Do not over-comment code. Removing code is fine without adding new comments to explain why.
+
+# AI Agent Instructions
+
+When introducing new patterns, conventions, or architectural decisions that affect how code should be written, update ALL AI agent instruction files:
+- `CLAUDE.md` (Claude Code)
+- `.github/copilot-instructions.md` (GitHub Copilot)
+- `.cursor/rules/rust.mdc` (Cursor)
diff --git a/.github/copilot-instructions.md b/.github/copilot-instructions.md
@@ -30,8 +30,33 @@ The project is split in 3 separate crates:
 2. `common`: Provides shared utilities and data structures for the model. Any constant definitions should be placed here. As much as possible, any shared logic should also be placed here.
 3. `server`: Implements the server-side logic and API endpoints for ModelExpress in a stand alone server.
 
+## Adding CLI Arguments
+
+Client CLI arguments are defined in a shared struct to avoid duplication:
+
+1. **Add to `ClientArgs`** in `modelexpress_common/src/client_config.rs`:
+   - This is the single source of truth for shared arguments
+   - Use `#[arg(long, env = "MODEL_EXPRESS_...")]` for environment variable support
+   - Do NOT use `-v` short flag (reserved for CLI's verbose)
+
+2. **Update `ClientConfig::load()`** in the same file:
+   - Add override logic in the "APPLY CLI ARGUMENT OVERRIDES" section
+
+3. **Do NOT duplicate in `Cli`** (`modelexpress_client/src/bin/modules/args.rs`):
+   - `Cli` embeds `ClientArgs` via `#[command(flatten)]`
+   - Only add CLI-specific arguments there (e.g., `--format`, `--verbose`)
+
+4. **Add tests** in the `tests` module of `client_config.rs`
+
 # Code quality
 
 - Do **NOT** use emojis. These are unprofessional.
 - Do not create markdown files to document code changes or decisions.
 - Do not over-comment code. Removing code is fine without adding new comments to explain why.
+
+# AI Agent Instructions
+
+When introducing new patterns, conventions, or architectural decisions that affect how code should be written, update ALL AI agent instruction files:
+- `CLAUDE.md` (Claude Code)
+- `.github/copilot-instructions.md` (GitHub Copilot)
+- `.cursor/rules/rust.mdc` (Cursor)
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -0,0 +1,103 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Build and Development Commands
+
+```bash
+# Build the project
+cargo build
+
+# Build in release mode
+cargo build --release
+
+# Run the server
+cargo run --bin modelexpress-server
+
+# Run tests
+cargo test
+
+# Run integration tests (starts server, runs test client)
+./run_integration_tests.sh
+
+# Run a specific test client
+cargo run --bin test_client -- --test-model "google-t5/t5-small"
+
+# Run clippy (required before submitting code)
+cargo clippy
+
+# Generate sample configuration file
+cargo run --bin config_gen -- --output model-express.yaml
+```
+
+## Architecture
+
+ModelExpress is a Rust-based model cache management service that accelerates inference by caching HuggingFace models. It can be deployed standalone or as a sidecar alongside inference solutions like NVIDIA Dynamo.
+
+### Workspace Structure
+
+The project is a Rust workspace with three crates:
+
+- **`modelexpress_server`** (`modelexpress-server`): gRPC server providing model services
+  - `services.rs`: Implements `HealthService`, `ApiService`, and `ModelService` gRPC services
+  - `database.rs`: SQLite-based model status persistence via `ModelDatabase`
+  - `cache.rs`: Cache eviction and management
+  - Uses global `MODEL_TRACKER` (`LazyLock<ModelDownloadTracker>`) for tracking download state
+
+- **`modelexpress_client`** (`modelexpress-client`): Client library and CLI tool
+  - `lib.rs`: Main `Client` struct with gRPC clients for health, API, and model services
+  - `bin/cli.rs`: HuggingFace CLI replacement for model downloads
+  - Supports automatic fallback to direct download when server unavailable
+
+- **`modelexpress_common`** (`modelexpress-common`): Shared code and protobuf definitions
+  - `grpc/` module contains generated proto code (health, api, model)
+  - `providers/huggingface.rs`: HuggingFace download implementation
+  - `download.rs`: Provider-agnostic download orchestration
+  - `cache.rs`, `config.rs`, `client_config.rs`: Configuration types
+
+### gRPC Services
+
+Protocol definitions are in `modelexpress_common/proto/`:
+- `health.proto`: Health check endpoint
+- `api.proto`: Generic request/response API
+- `model.proto`: Model download with streaming status updates
+
+### Key Patterns
+
+- Download status tracked in SQLite database with compare-and-swap for concurrent request handling
+- Streaming gRPC responses for download progress updates via `ModelStatusUpdate`
+- `CacheConfig::discover()` finds cache configuration from environment or config files
+- Configuration layering: CLI args > environment variables > config files > defaults
+
+### Adding CLI Arguments
+
+Client CLI arguments and environment variables are defined in a shared struct to avoid duplication:
+
+1. **`ClientArgs`** in `modelexpress_common/src/client_config.rs`:
+   - Single source of truth for shared client arguments (endpoint, timeout, cache settings, etc.)
+   - Add new arguments here with `#[arg(long, env = "MODEL_EXPRESS_...")]`
+   - Avoid `-v` short flag (reserved for CLI's verbose)
+
+2. **`ClientConfig::load()`** in the same file:
+   - Apply the new argument to the config struct in the "APPLY CLI ARGUMENT OVERRIDES" section
+
+3. **`Cli`** in `modelexpress_client/src/bin/modules/args.rs`:
+   - Embeds `ClientArgs` via `#[command(flatten)]`
+   - Only add CLI-specific arguments here (e.g., `--format`, `--verbose`)
+
+4. **Tests**: Add tests in `client_config.rs` for argument parsing and config loading
+
+## Code Standards
+
+- **No `unwrap()`**: Strictly forbidden except in benchmarks. Use `match`, `?`, or `expect()` (tests only)
+- **All dependencies in root `Cargo.toml`**: Sub-crates use workspace dependencies exclusively
+- **Clippy enforced**: `cargo clippy` must pass with no warnings (multiple lints set to deny)
+- **No emojis in code**
+- **No markdown documentation files for code changes**
+
+## AI Agent Instructions
+
+When introducing new patterns, conventions, or architectural decisions that affect how code should be written, update ALL AI agent instruction files:
+- `CLAUDE.md` (Claude Code)
+- `.github/copilot-instructions.md` (GitHub Copilot)
+- `.cursor/rules/rust.mdc` (Cursor)