-
-
Notifications
You must be signed in to change notification settings - Fork 11
CLI Reference
GoldenMatch provides 23 CLI commands via goldenmatch <command>. All commands support --help.
pip install goldenmatch
goldenmatch --versionMatch a target file against reference files.
goldenmatch match targets.csv --against reference.csv --config config.yaml --output-allLaunch the interactive terminal UI.
goldenmatch interactive customers.csv
goldenmatch interactive customers.csv --config config.yamlMatch new CSV records against an existing base dataset.
goldenmatch incremental base.csv --new new_records.csv --config config.yamlHandles exact matchkeys via Polars join and fuzzy matchkeys via match_one brute-force.
Build ground truth by labeling record pairs interactively. Type y (match), n (no match), or s (skip).
goldenmatch label customers.csv --config config.yaml --gt ground_truth.csvStart the MCP server for Claude Desktop integration.
goldenmatch mcp-serve --file customers.csv --config config.yamlSee MCP for tool details.
Explain why two records matched.
goldenmatch explain ID_A ID_B --run-dir results/Undo a previous merge run.
goldenmatch rollback RUN_ID --run-dir results/List previous runs available for rollback.
goldenmatch runs --run-dir results/Detect fake emails, placeholder data, and suspicious records.
goldenmatch anomaly customers.csvGenerate a before/after data quality dashboard.
goldenmatch dashboard --run-dir results/ --output dashboard.htmlWatch a database table and match new records continuously.
goldenmatch watch --table customers --connection-string "$DATABASE_URL" --interval 30
# Daemon mode with health endpoint and PID file
goldenmatch watch --table customers --connection-string "$DATABASE_URL" --daemon| Flag | Available On | Description |
|---|---|---|
--config, -c
|
dedupe, match | Path to YAML config file |
--output-all |
dedupe, match | Write golden, dupes, unique, lineage |
--output-dir |
dedupe, match | Output directory |
--llm-scorer |
dedupe | Enable LLM scoring for borderline pairs |
--llm-boost |
dedupe | LLM-labeled training + fine-tuning |
--backend ray |
dedupe, match | Use Ray distributed backend |
--preview |
dedupe | Show merge preview before writing |
--anomalies |
dedupe | Run anomaly detection |
--dashboard |
dedupe | Generate HTML dashboard |
--html-report |
dedupe | Generate HTML match report |
--diff |
dedupe | Generate diff report |
--chunked |
dedupe | Process in chunks for large files |
⚡ GoldenMatch — Entity resolution toolkit | PyPI | GitHub | Open in Colab | MIT License
🟡 Golden Suite (Monorepo)
Suite Packages
- GoldenCheck · data quality
- GoldenFlow · transforms
- GoldenPipe · orchestrator
- InferMap · schema mapping
Getting Started
- Installation
- Quick Start
- Auto-Config Controller · enhanced through v1.12
- Configuration
- Verification · new in v1.5
- CLI Reference
Core Concepts
AI Integration
Advanced
- PPRL
- Domain Packs
- Streaming / CDC
- Database Integration
- GPU & Vertex AI
- REST API
- Interactive TUI
- Web UI · new in v1.7
- Evaluation
Reference
pip install goldenmatch
npm install goldenmatch