Searchy

Plug-and-play NL→SQL API. ExpressJS + TypeScript, Bun, Postgres-first with an adapter interface for future databases. It exposes two REST endpoints:

/query – takes a natural-language phrase, generates a safe SELECT-only SQL using an LLM (OpenAI-compatible JSON mode), executes it with guardrails, and returns JSON rows.
/explain – takes the same body and returns a concise explanation answering schema/relationship questions about your DB (no SQL is generated or executed).

Architecture

Client → /query (Express)
  ├─ Validate request, pick dbUrl
  ├─ LRU pool + PostgresAdapter
  ├─ Relation Cards cache (TTL)
  │    ├─ listRelations/describeRelation
  │    └─ listRelationships → join_hints
  ├─ pickTopK(cards, phrase)
  ├─ LLM(JSON mode) → {sql, params}
  ├─ SQL Guard: SELECT-only, LIMIT<=MAX
  ├─ set statement_timeout
  ├─ runSelect(sql, params)
  └─ Return { rows, rowCount, sql }

Quickstart (Bun)

Prereqs: Bun installed, Postgres URL.

bun install
bun run dev

curl -s localhost:7679/query \
  -H 'content-type: application/json' \
  -d '{"phrase":"last 10 paid orders with emails","dbUrl":"postgres://user:pass@host:5432/db"}' | jq .

Environment variables (.env example):

OPENAI_API_KEY=sk-...
LLM_MODEL=gpt-4o-mini
PG_URL=postgres://readonly@host:5432/db   # optional default
CARDS_TTL_SECONDS=900
STATEMENT_TIMEOUT_MS=3000
MAX_LIMIT=1000
PORT=7679
LOG_LEVEL=info
MOCK_LLM=0            # set to 1 in tests/dev to avoid network
MAX_RESPONSE_BYTES=5242880

Notes:

dbUrl in request overrides PG_URL. One must be set.
If you don’t set OPENAI_API_KEY, set MOCK_LLM=1 for deterministic local runs/tests.

Docker

docker build -t sqlgpt-express .
docker run --rm -p 7679:7679 -e OPENAI_API_KEY=$OPENAI_API_KEY sqlgpt-express

Compose with demo Postgres and seed:

docker-compose up --build
# API on :7679; DB on :5432; read-only role: app_ro/app_ro_pass

Endpoints

POST /query

Request:

{ "phrase": "top customers in floripa by revenue", "dbUrl": "postgres://user:pass@host:5432/db" }

Response:

{ "rows": [...], "rowCount": 123, "sql": "SELECT ... LIMIT 1000" }

POST /explain

Request:

{ "phrase": "how do orders link to customers?", "dbUrl": "postgres://user:pass@host:5432/db", "context": "Q: what tables exist? A: public.orders, public.customers" }

Response:

{ "answer": "orders.customer_id references customers.id", "references": ["public.orders","public.customers"] }

Security Defaults

SELECT-only gate; reject anything else.
Force LIMIT (cap at MAX_LIMIT).
statement_timeout per query.
Optional response size cap (MAX_RESPONSE_BYTES, default 5MB). If exceeded, rows are truncated and a truncated: true flag is added.
Centralized error handler; never echo stack traces.

Adapters

Implement IntrospectionAdapter from src/adapters/db.ts for a new DB:

testConnection()
listRelations() → user schemas only
describeRelation(name) → columns, PKs, indexed flags, kind, estimates
listRelationships() → FK graph for join hints
setTimeoutMs(ms) → apply per-connection timeout
runSelect(sql, params) → execute SELECT-only

Add a file like src/adapters/mysql.ts implementing the interface and wire getAdapter() accordingly.

LLM Wrapper

src/llm/llm.ts provides a provider-agnostic interface. Default is OpenAI-compatible JSON mode with temperature: 0. For tests/dev, enable MOCK_LLM=1 to avoid network calls.

For /query, the system prompt embeds top-K Relation Cards with hard rules (SELECT-only, join_hints, LIMIT<=1000, schema-qualified names) and requires strict JSON output: {"sql":"...","params":[...]}.
For /explain, the system prompt focuses on schema/relationship explanations and requires strict JSON output: {"answer":"...","references":[...]}.

Tests

Run: bun test

tests/unit.guard.test.ts – SQL guard behavior
tests/e2e.query.test.ts – Spins an Express server and hits /query. Uses MOCK_LLM=1. Requires TEST_PG_URL or PG_URL to be set; otherwise skips.
tests/e2e.explain.test.ts – Spins an Express server and hits /explain. Uses MOCK_LLM=1. Requires TEST_PG_URL or PG_URL to be set; otherwise skips.

Pragmatic choices

Statement timeout is set via SET statement_timeout on a borrowed client, then reset to DEFAULT.
Relation estimates rely on pg_class.reltuples only; sufficient for ranking. No ANALYZE is triggered.
pickTopK uses keyword scoring over names, columns, and join hints; deterministic and cheap.
Response truncation is approximate based on JSON byte length and keeps the first N rows.

License

MIT – see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docker/postgres		docker/postgres
public		public
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
bunfig.toml		bunfig.toml
docker-compose.yml		docker-compose.yml
index.ts		index.ts
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Searchy

Quickstart (Bun)

Docker

Endpoints

Security Defaults

Adapters

LLM Wrapper

Tests

Pragmatic choices

License

searchy

About

Uh oh!

Releases

Packages

Languages

License

IgorSilvestre/searchy

Folders and files

Latest commit

History

Repository files navigation

Searchy

Quickstart (Bun)

Docker

Endpoints

Security Defaults

Adapters

LLM Wrapper

Tests

Pragmatic choices

License

searchy

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages