-
-
Notifications
You must be signed in to change notification settings - Fork 12
Rest Api
bsevern edited this page Apr 6, 2026
·
1 revision
GoldenMatch includes a local HTTP server for real-time matching, cluster browsing, and data steward review.
| Method | Path | Description |
|---|---|---|
GET |
/health |
Health check |
GET |
/stats |
Dataset statistics |
POST |
/match |
Match a single record |
POST |
/match/batch |
Match multiple records |
POST |
/explain |
Explain why two records match |
GET |
/clusters |
List all clusters |
GET |
/clusters/<id> |
Get cluster detail |
GET |
/reviews |
Review queue (borderline pairs) |
GET |
/reviews/decisions |
List completed review decisions |
POST |
/reviews/decide |
Approve or reject a pair |
curl http://localhost:8080/stats{
"total_records": 5000,
"total_clusters": 847,
"singleton_count": 3200,
"match_rate": 0.12,
"avg_cluster_size": 2.1,
"max_cluster_size": 8
}curl -X POST http://localhost:8080/explain \
-H "Content-Type: application/json" \
-d '{"id_a": 42, "id_b": 108}'{
"overall_score": 0.92,
"explanation": "Strong match: name similarity 0.88 (jaro_winkler), zip exact match.",
"field_scores": [
{"field": "name", "scorer": "jaro_winkler", "score": 0.88},
{"field": "zip", "scorer": "exact", "score": 1.0}
]
}The review queue surfaces borderline pairs for data steward approval or rejection.
curl http://localhost:8080/reviews{
"reviews": [
{
"id": "rev_001",
"id_a": 42,
"id_b": 300,
"score": 0.82,
"record_a": {"name": "John Smith", "zip": "10001"},
"record_b": {"name": "J. Smithson", "zip": "10001"}
}
]
}curl -X POST http://localhost:8080/reviews/decide \
-H "Content-Type: application/json" \
-d '{"id": "rev_001", "decision": "approve"}'Decision values: "approve" or "reject".
curl http://localhost:8080/reviews/decisionsdocker run --rm -p 8080:8080 -v $(pwd):/data \
ghcr.io/benzsevern/goldenmatch:latest \
serve --file /data/customers.csv --port 8080⚡ GoldenMatch — Entity resolution toolkit | PyPI | GitHub | Open in Colab | MIT License
🟡 Golden Suite (Monorepo)
Suite Packages
- GoldenCheck · data quality
- GoldenFlow · transforms
- GoldenPipe · orchestrator
- InferMap · schema mapping
Getting Started
- Installation
- Quick Start
- Auto-Config Controller · enhanced through v1.12
- Configuration
- Verification · new in v1.5
- CLI Reference
Core Concepts
AI Integration
Advanced
- PPRL
- Domain Packs
- Streaming / CDC
- Database Integration
- GPU & Vertex AI
- REST API
- Interactive TUI
- Web UI · new in v1.7
- Evaluation
Reference
pip install goldenmatch
npm install goldenmatch