News Summarization Proof of Concept

A Next.js + TypeScript project leveraging OpenAI or Ollama for ingesting, embedding, clustering, and exploring news articles.

Setup

1. Clone the repo and install dependencies

git clone https://github.com/jeromecovington/silver-skates.git
cd silver-skates
yarn install

npm install -g bun

2. Configure environment

Create a .env.local file at the project root:

NEWS_API_KEY=your_newsapi_key
INGEST_SECRET=your_custom_token
DATABASE_URL=postgresql://postgres:postgres@localhost:5432/mydb

Postgres Setup

Option A: Docker (recommended for local dev)

docker run --name news-postgres \
  -e POSTGRES_PASSWORD=postgres \
  -e POSTGRES_DB=mydb \
  -p 5432:5432 \
  -d postgres:15

Option B: Local Postgres install

Make sure Postgres is running and create the mydb database:

createdb mydb

Prisma

1. Generate Prisma client

npx prisma@6 generate

2. Run DB migration

npx prisma@6 migrate dev --name init

Ingest > Cluster > Preview

✅ Ingest articles from NewsAPI

Preferred method: script

bun run ingest

Deprecated method: api call

curl -X POST http://localhost:3000/api/ingest \
  -H "Authorization: Bearer your_custom_token"

This will:

Fetch new articles
Deduplicate
Extract keywords
Generate MiniLM embeddings
Store in Postgres

Cluster articles by semantic similarity

bun run cluster

This uses K-Means clustering on embeddings and stores cluster IDs in each article.

Summarize articles with GPT or local model

GPT (default)

bun run summarize

This uses OpenAI’s gpt-3.5-turbo model to generate concise 2–3 sentence summaries for articles that do not yet have a summary. Summaries are stored in the summary field of each article and surfaced via the /api/preview endpoint.

Local Model

LLM_MODE=local \
LLM_BASE_URL=http://localhost:11434 \
LLM_MODEL=llama3.1:8b \
bun run summarize

This assumes Ollama running locally or on your LAN, and installation of the llama3.1:8b model.

Pipeline

Ingestion, clustering, and summarizing can be run sequentially using:

bun run pipeline

Preview recent articles

curl "http://localhost:3000/api/preview?token=your_custom_token"

Returns latest articles, including keywords and cluster assignments.

Web App

Start the application locally, e.g.

yarn run dev

Cluster exploration

http://localhost:3000/clusters

Chat inteface

http://localhost:3000/chat

Notes

Embeddings generated via @xenova/transformers (MiniLM)
Clustering via ml-kmeans
Keywords via TF-IDF from natural

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
docs		docs
prisma		prisma
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bunfig.toml		bunfig.toml
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
tsconfig.scripts.json		tsconfig.scripts.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News Summarization Proof of Concept

Setup

1. Clone the repo and install dependencies

2. Configure environment

Postgres Setup

Option A: Docker (recommended for local dev)

Option B: Local Postgres install

Prisma

1. Generate Prisma client

2. Run DB migration

Ingest > Cluster > Preview

✅ Ingest articles from NewsAPI

Preferred method: script

Deprecated method: api call

Cluster articles by semantic similarity

Summarize articles with GPT or local model

GPT (default)

Local Model

Pipeline

Preview recent articles

Web App

Cluster exploration

Chat inteface

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

News Summarization Proof of Concept

Setup

1. Clone the repo and install dependencies

2. Configure environment

Postgres Setup

Option A: Docker (recommended for local dev)

Option B: Local Postgres install

Prisma

1. Generate Prisma client

2. Run DB migration

Ingest > Cluster > Preview

✅ Ingest articles from NewsAPI

Preferred method: script

Deprecated method: api call

Cluster articles by semantic similarity

Summarize articles with GPT or local model

GPT (default)

Local Model

Pipeline

Preview recent articles

Web App

Cluster exploration

Chat inteface

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages