CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Overview

Project Dashboard is a Rails 8.2 application that automatically discovers, analyzes, and tracks all active git repositories in your local development environment. It provides a centralized view via a CLI interface using Rake tasks, extracting metadata about tech stack, deployment status, and project state.

Architecture: SQLite database + ActiveRecord models + Plain Ruby classes (lib/) + Rake tasks interface. No web UI yet (planned for v1.1).

Tech Stack

Ruby: 3.4+
Rails: 8.2 (main branch)
Database: SQLite3
Frontend (future): Tailwind 4, Hotwire (Turbo + Stimulus)
Background Jobs: Solid Queue
Cache: Solid Cache
Asset Pipeline: Propshaft + Importmap

Essential Commands

Setup

bin/setup                    # Initial setup
bin/rails db:migrate         # Run migrations

Scanning Projects

bin/rake projects:scan                           # Scan and save to DB
DRY_RUN=true bin/rake projects:scan             # Preview without saving
SCAN_ROOT_PATH=/custom/path bin/rake projects:scan  # Custom directory
SCAN_CUTOFF_DAYS=180 bin/rake projects:scan     # 6 months instead of 8
bin/rake projects:config                         # Show configuration

Rails Console

bin/rails console           # Access models and query data
# Example queries:
# Project.order(last_commit_date: :desc).first(10)
# Project.where("metadata ->> 'inferred_type' = ?", "rails-app")

Testing

bin/rails test              # Run test suite
bin/rails test:system       # Run system tests

Code Quality

bin/rubocop                 # Ruby linting (omakase style)
bin/brakeman                # Security scanning
bundle audit                # Check for gem vulnerabilities

Development Server

bin/dev                     # Start Rails server + Tailwind watch
bin/rails server            # Rails server only

Core Architecture

Data Flow

Rake Task (lib/tasks/projects.rake)
    ↓
ProjectScanner (lib/project_scanner.rb)
    ├─ Finds all .git directories recursively
    ├─ Filters by commit date (default: 8 months)
    └─ For each repo:
        ↓
    ProjectData (lib/project_data.rb)
        ├─ Extracts git metadata (commits, authors, dates)
        ├─ Detects tech stack (Gemfile, package.json, etc.)
        ├─ Infers project type (rails-app, node-app, etc.)
        ├─ Finds reference files (README, CLAUDE.md, TODO.md)
        ├─ Assesses current state (active, paused, WIP)
        └─ Returns structured metadata
        ↓
    Project Model (app/models/project.rb)
        └─ Upserts to SQLite (find_or_initialize_by path)

Key Components

lib/project_scanner.rb

Recursive directory traversal to find .git repos
Filters repos by commit recency
Skips irrelevant directories (node_modules, hidden dirs)
Orchestrates scanning loop and database persistence
Prints progress indicators and summary reports

lib/project_data.rb

Extracts all metadata for a single repository
Runs git commands safely (captures stderr)
Detects tech stack by checking signature files (Gemfile, package.json, etc.)
Infers project type, deployment status, and current state
Parses documentation files for descriptions
Graceful error handling per extraction step

app/models/project.rb

ActiveRecord model with validations
Upsert logic via create_or_update_from_data(project_data)
JSON metadata column for flexible schema

Database Schema:

create_table :projects do |t|
  t.string :path              # Unique, indexed (repository path)
  t.string :name              # Project name (directory basename)
  t.string :last_commit_date  # Indexed for sorting
  t.text :last_commit_message
  t.json :metadata            # All extracted data (tech_stack, commits, etc.)
  t.timestamps
end

Metadata Structure

The metadata JSON column stores:

last_commit_author - Most recent committer
recent_commits - Array of last 10 commits (date + message)
commit_count_8m - Commits in last 8 months
contributors - All unique authors
reference_files - Hash of docs by category (root, ai, cursor, docs)
description - Extracted from README/CLAUDE.md
current_state - "active (committed 2 days ago), 3 open tasks"
tech_stack - Array: ["ruby", "rails", "node"]
inferred_type - rails-app | node-app | python-app | docs | script | unknown
deployment_status - Indicators from Dockerfile, deploy scripts, etc.
nested_repos - Subdirectories with their own .git
errors - Array of non-fatal errors during extraction

Configuration

Environment Variables

SCAN_ROOT_PATH - Root directory to scan (default: ~/Development)
SCAN_CUTOFF_DAYS - Skip projects older than N days (default: 240 = 8 months)
DRY_RUN - Set to "true" to scan without saving (default: false)

Autoloading

lib/ directory is autoloaded (except tasks/)
ProjectScanner and ProjectData are available in rake tasks and console
Configured in config/application.rb:29 via config.autoload_lib(ignore: %w[assets tasks])

Tech Stack Detection Logic

Signature files checked (in ProjectData#detect_tech_stack):

Ruby/Rails: Gemfile (+ check for rails gem), config/routes.rb
Node/JS: package.json (+ check for react, next, vue)
Python: requirements.txt or pyproject.toml
Go: go.mod

Project type inference (in ProjectData#infer_project_type):

rails-app if Rails detected
node-app if Node detected (no Rails)
python-app if Python detected
docs if docs/ directory has files
script if .rb or .js files in root
unknown otherwise

Development Patterns

Adding New Metadata Fields

Update ProjectData#extract_metadata to extract new data
No migration needed (JSON column)
Access via project.metadata['field_name'] or SQL: metadata ->> 'field_name'

Example:

# In lib/project_data.rb
def extract_metadata
  # ... existing fields ...
  @metadata[:git_remote] = run_git_command("config --get remote.origin.url")
end

# Query in console/code
Project.where("metadata ->> 'git_remote' LIKE ?", "%github.com%")

Adding New Tech Stack

Update ProjectData#detect_tech_stack to check for new files
Optionally update infer_project_type if new category needed

Example:

def detect_tech_stack
  stack = []
  # ... existing checks ...
  stack << "rust" if File.exist?(File.join(@path, "Cargo.toml"))
  stack.uniq
end

Error Handling Philosophy

Repository-level: Git failures are caught per-repo; repo is skipped but scan continues
Graceful degradation: Missing files fallback to defaults (e.g., directory name if no README)
Error tracking: Non-fatal errors stored in metadata['errors'] array
User feedback: Progress indicators (✓ success, ⊗ skipped, ✗ error)

Performance Notes

Sequential processing: ~1-2 repos/second (no parallelization yet)
File read limits: Max 1000 lines, skip files > 500KB
Directory pruning: Aggressively skips node_modules, .git internals, hidden dirs
Database: Indexed queries on path and last_commit_date; JSON operators for metadata
Git caching: Git's own object cache helps with repeated commands
Future optimization: v1.2 will add incremental updates (only re-scan changed repos)

Common Queries

# All projects, most recent first
Project.order(last_commit_date: :desc)

# Rails projects only
Project.where("metadata ->> 'inferred_type' = ?", "rails-app")

# Active projects (< 7 days old)
Project.select { |p| p.metadata.dig('current_state')&.include?('active') }

# Projects with open TODOs
Project.select { |p| p.metadata.dig('current_state')&.include?('open task') }

# Count by type
Project.all.group_by { |p| p.metadata['inferred_type'] }.transform_values(&:count)

# Most active by commit count
Project.all.sort_by { |p| p.metadata['commit_count_8m'] || 0 }.reverse.first(10)

Future Development

v1.1 - Web UI:

Dashboard showing active projects (Hotwire + Tailwind)
Filters by type, tech stack, activity
Search by name, description
Click to open in editor/terminal

v1.2 - Incremental Updates:

Only re-scan changed repos (via mtime or git hooks)
Background job for scheduled scans (Solid Queue)

v1.3 - Enhanced Analytics:

Commit frequency graphs
Contributor leaderboards
Project relationships (shared contributors)

Design Decisions

Why SQLite? Local-first; no external dependencies; file-based portability; fast for < 10k projects.

Why JSON metadata? Schema flexibility during rapid development; easy to add fields without migrations; SQLite has good JSON operators.

Why shell git commands vs git gems? Standard across systems; no dependencies; faster for simple ops; more portable.

Why separate ProjectData from Project? Separation of concerns (extraction vs persistence); ProjectData is plain Ruby with no Rails dependencies; easier to test; could extract to gem later.

Troubleshooting

"No repositories found"

Verify SCAN_ROOT_PATH is correct
Check that directories contain .git folders
Consider if SCAN_CUTOFF_DAYS is too restrictive

"Projects not saving"

Run bin/rails db:migrate to ensure schema is current
Check write permissions on storage/ directory
Look for validation errors in rake task output

"Metadata missing fields"

Ensure git CLI is installed and in PATH
Check file read permissions in scanned directories
Inspect metadata['errors'] for specific failures

"Slow scanning"

Large repos with many files will be slower
Network-mounted directories add latency
Consider narrowing SCAN_ROOT_PATH to exclude large subtrees

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Overview

Tech Stack

Essential Commands

Setup

Scanning Projects

Rails Console

Testing

Code Quality

Development Server

Core Architecture

Data Flow

Key Components

Metadata Structure

Configuration

Environment Variables

Autoloading

Tech Stack Detection Logic

Development Patterns

Adding New Metadata Fields

Adding New Tech Stack

Error Handling Philosophy

Performance Notes

Common Queries

Future Development

Design Decisions

Troubleshooting

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Overview

Tech Stack

Essential Commands

Setup

Scanning Projects

Rails Console

Testing

Code Quality

Development Server

Core Architecture

Data Flow

Key Components

Metadata Structure

Configuration

Environment Variables

Autoloading

Tech Stack Detection Logic

Development Patterns

Adding New Metadata Fields

Adding New Tech Stack

Error Handling Philosophy

Performance Notes

Common Queries

Future Development

Design Decisions

Troubleshooting