Natural language interface to US Census data for healthcare strategy teams
Transform 6-week $50K demographic consulting into 6-second $297/month queries
π Foundation Data β’ π Quick Start β’ π§ Get Early Access
- DuckDB 1.4 β New async API, MERGE statements, encryption support, query profiler
- MCP HTTP Transport β External client connections (Claude Desktop, Postman)
- Interactive Data Tables β TanStack Table with sorting, filtering, pagination, drill-down
- Chart Visualizations β Recharts bar/line charts with export controls
- Claude Agent SDK β Zod schema validation, parallel region comparison, conversational context
- Document Generation β Excel and PDF reports via MCP tools
- β TRUE MCP IMPLEMENTATION: Production SQL validation layer with security policies
- β SQL Injection Protection: Table/column allowlists, row limits, dangerous pattern blocking
- β Interactive Results: Sortable tables, filterable data, drill-down navigation
- β Parallel Queries: "Compare Tampa Bay vs Phoenix" runs both simultaneously
- β Conversational Context: "Now filter to income > $75K" understands prior query
- β Audit Compliance: All queries logged for HIPAA/GDPR
Current Data:
- County Level: 3,144 US counties with demographics
- Block Group Level: 239,741 block groups with 84 variables
Healthcare strategy teams wait 6+ weeks and pay $50K+ for demographic analysis that CensusChat delivers in seconds for $297/month.
"Compare Medicare eligible seniors in Tampa Bay vs Phoenix with income over $75K"
β Parallel queries, interactive comparison table, instant Excel exportAsk questions in plain English - no SQL knowledge required
- Claude Sonnet 4 translates natural language to SQL
- MCP Validation Layer validates SQL against security policies
- DuckDB 1.4 executes validated queries on Census data
- Agent SDK handles parallel queries and conversational context
- Sortable, filterable data tables
- Bar and line chart visualizations
- Drill-down from county to block groups
- Excel/PDF export with one click
Primary Users:
- Business Analysts at health systems and Medicare Advantage plans
- Strategy Teams planning market expansion and facility placement
- Healthcare Researchers conducting population health studies
const stack = {
backend: "Node.js 20 + TypeScript + Express 5",
frontend: "Next.js 15 + React 19 + Tailwind CSS 4",
databases: "PostgreSQL 15 + Redis 7 + DuckDB 1.4.3",
ai: "Claude Sonnet 4 + Agent SDK 0.2.30 + MCP SDK 1.25.3",
interactive: "TanStack Table + Recharts + MCP Apps",
deployment: "Docker + containerized testing"
};- Sub-2 Second Queries with MCP validation
- Parallel Execution for comparison queries
- 82%+ Test Coverage with comprehensive test suites
- HIPAA-Ready Architecture with encryption and audit logging
See API_KEY_SETUP.md for detailed instructions.
- Anthropic API: Get key at console.anthropic.com
- Census API: Request key at api.census.gov/data/key_signup.html
- Add both keys to
backend/.env
cd backend
./scripts/setup-database.sh # Takes 2-3 hours./demo-setup.shOpen http://localhost:3000 and start querying.
- Medicare eligibility calculations (age 65+ demographics)
- Senior care demographics for facility planning
- Population health indicators and social determinants
- Market analysis for health system expansion
- Click column headers to sort
- Filter rows without re-querying
- Drill down from county to block groups
- Export selected columns to Excel/CSV
- "Compare Tampa Bay vs Phoenix" runs in parallel
- Side-by-side demographic breakdowns
- Automatic chart generation for comparisons
- "Show seniors in Tampa Bay"
- "Now filter to income over $75K" (understands context)
- "Export that to Excel"
- HIPAA-ready architecture with encryption at rest
- SQL injection protection with validation layer
- Complete audit trail for compliance
- No PHI storage - only aggregated demographic data
Problem We Solve:
- $2.8B spent annually on demographic consulting
- 6,090 hospitals needing demographic analysis
- 30,000+ senior care facilities planning expansion
Our Solution:
- 200x cost reduction: $50K β $297/month
- 300x speed improvement: 6 weeks β seconds
- Unlimited iterations with conversational follow-up
- Write quality TypeScript with 90%+ test coverage
- Maintain sub-2s query response times
- Follow HIPAA-ready security practices
- Run
npm run secret-scanbefore committing
β Star this repository if CensusChat helps your healthcare data work
Transforming healthcare demographic analysis from weeks to seconds
10+ years healthcare data experience β’ Frustrated by $50K consulting delays