User Guide: Code Cartographer

Introduction

Code Cartographer is a comprehensive static analysis tool designed to help you understand complex Python codebases by mapping the relationships between functions, variables, classes, and other code elements. This guide will walk you through how to use the enhanced version of Code Cartographer to analyze your Python projects.

Installation

Clone the repository:

git clone https://github.com/stochastic-sisyphus/code-cartographer.git
cd code-cartographer

Install dependencies (recommended: use mise):

# With mise (recommended)
mise trust
mise install
mise run install

# Or manually with pip
pip install -e .
pip install -e ".[dev]"  # For development

Install system dependencies:

# For Ubuntu/Debian
sudo apt-get install graphviz graphviz-dev

# For macOS
brew install graphviz

# For Windows
# Download and install from https://graphviz.org/download/

Running Code Cartographer

Basic Usage

The simplest way to analyze your codebase is to use the provided shell script:

./analyze_codebase.sh /path/to/your/project

This will:

Analyze your codebase
Generate visualizations
Create comprehensive reports
Save all outputs to the analysis_output directory

Advanced Usage

For more control over the analysis process, you can use the Python module directly:

from code_cartographer.core.analyzer import ProjectAnalyzer, generate_markdown, generate_dependency_graph
from pathlib import Path

# Initialize the analyzer
analyzer = ProjectAnalyzer(
    project_root=Path("/path/to/your/project"),
    exclude_patterns=["tests/", "venv/", "build/"]
)

# Run the analysis
analysis = analyzer.execute()

# Generate reports
report_path = Path("/path/to/output/analysis.md")
generate_markdown(analysis, report_path)

# Generate dependency graph
graph_path = Path("/path/to/output/dependencies.dot")
generate_dependency_graph(analysis["dependencies"], graph_path)

print(f"Report generated at: {report_path}")
print(f"Dependency graph generated at: {graph_path}")

Understanding the Results

Code Analysis Report

The main output is a comprehensive Markdown report (code_analysis_report.md) that includes:

Summary Statistics
- Total files analyzed
- Function, class, and variable counts
- Orphaned code elements
- Code complexity metrics
Orphaned Code
- Functions defined but never called
- Classes defined but never instantiated
- Variables defined but never used
Code Variants
- Similar implementations across the codebase
- Potential refactoring opportunities
Dependency Analysis
- Initialization order requirements
- Circular dependencies
- Prerequisite relationships
Variable Usage
- Variable definitions and usages
- Scope analysis
- Redefinition detection

Visualizations

The tool generates several visualizations to help you understand your codebase:

Call Graph
- Shows which functions call which other functions
- Bidirectional relationships
- Entry points and leaf nodes
Dependency Graph
- Shows prerequisites between code elements
- Helps identify initialization order requirements
Variable Usage Chart
- Shows where variables are defined and used
- Highlights orphaned variables
Class Hierarchy
- Visualizes inheritance relationships
- Shows method overrides
Initialization Sequence
- Suggests a safe order for initializing components
- Handles circular dependencies

Interpreting the Results

Identifying Code Issues

Look for:

Orphaned Functions/Classes: These might be dead code that can be removed
Circular Dependencies: These can cause initialization problems
Variables with Multiple Definitions: Potential naming conflicts
High Complexity Metrics: Functions that might need refactoring

Improving Code Structure

Use the dependency analysis to:

Reorganize code to reduce circular dependencies
Identify modules that should be split
Find opportunities for better encapsulation

Cleaning Up Code

The orphan analysis helps you:

Remove unused code
Identify forgotten implementations
Clean up unused variables

Advanced Features

Custom Exclusion Patterns

You can specify patterns to exclude from analysis:

analyzer.analyze(exclude_patterns=[
    r"\.git/",
    r"\.venv/",
    r"__pycache__/",
    r"tests/",
    r"examples/"
])

Focus on Specific Areas

To analyze only certain parts of your codebase:

analyzer.analyze(focus_paths=[
    "src/core/",
    "src/utils/important_module.py"
])

Custom Reporting

Generate specialized reports:

# Generate a report focusing on orphaned code
analyzer.generate_orphan_report(results)

# Generate a report on variable usage
analyzer.generate_variable_report(results)

Troubleshooting

Common Issues

Missing Dependencies
- Ensure all Python dependencies are installed
- Verify Graphviz is installed correctly
Memory Issues with Large Codebases
- Analyze smaller portions of the codebase
- Increase available memory
Visualization Errors
- Check Graphviz installation
- Try different output formats (PNG, SVG, PDF)
Incorrect Analysis Results
- Verify exclusion patterns aren't too broad
- Check for dynamic code generation or eval usage

Getting Help

If you encounter issues or have questions:

Check the GitHub repository for updates
Open an issue with detailed information about your problem
Consult the API documentation for advanced usage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

User Guide: Code Cartographer

Introduction

Installation

Running Code Cartographer

Basic Usage

Advanced Usage

Understanding the Results

Code Analysis Report

Visualizations

Interpreting the Results

Identifying Code Issues

Improving Code Structure

Cleaning Up Code

Advanced Features

Custom Exclusion Patterns

Focus on Specific Areas

Custom Reporting

Troubleshooting

Common Issues

Getting Help

FilesExpand file tree

guide.md

Latest commit

History

guide.md

File metadata and controls

User Guide: Code Cartographer

Introduction

Installation

Running Code Cartographer

Basic Usage

Advanced Usage

Understanding the Results

Code Analysis Report

Visualizations

Interpreting the Results

Identifying Code Issues

Improving Code Structure

Cleaning Up Code

Advanced Features

Custom Exclusion Patterns

Focus on Specific Areas

Custom Reporting

Troubleshooting

Common Issues

Getting Help