Releases · padmanabhan-r/GrooveForge

What's New

Added

Sound Match — play any song, Gemini extracts the sonic fingerprint (BPM, key, mood, texture, instrumentation), and GrooveForge generates something completely original in the same vibe. Artist name and song title never leave your device.
Generated History — every track saved to localStorage. Replay, rename, or download MP3 across sessions.
Track delete — remove individual entries from history.
Datasets section in README — LP-MusicCaps-MSD (513K), FMA (106K), MSD full (1M), MusicCaps (5.5K).
Copyright-Safe by Design section — documents the four-layer safety approach.

Changed

Embedding split: offline pipeline runs all-MiniLM-L6-v2 locally on CPU (~1K vecs/sec); query-time embedding served via OpenRouter — identical 384-dim vectors, no model weights in the Railway container.
Data pipeline section condensed; Blueprint Schema removed from README (canonical in CLAUDE.md).
License: CC BY 4.0.

Fixed

Blueprint list scroll broken — min-h-0 on the flex child so overflow-y-auto takes effect.
pydantic-settings rejected POSTGRES_URL with extra_forbidden — extra="ignore" added to Settings so pipeline-only env vars don't break API startup.
Generate button hidden when blueprint list grew long — list is now scrollable, button always visible.
Visual border separator added between blueprint list and generation controls.
Hardcoded PostgreSQL URL removed from pipeline scripts; reads POSTGRES_URL from environment.

Full changelog: https://github.com/padmanabhan-r/GrooveForge/blob/main/CHANGELOG.md

[1.0.0] — 2026-04-13

GrooveForge is a retrieval-augmented music creation system — THE ULTIMATE AI TOOLKIT FOR ORIGINAL MUSIC CREATION.

What It Does

Music discovery is broken. Song titles tell you nothing about feel. Artist names lock you into what you already know. Playlists are curated by someone else's taste. GrooveForge starts where every other tool stops: with the vibe itself.

Instead of describing music in the abstract, you search by the actual structural properties that make music sound the way it does — key, tempo, mood, instrumentation, lyrical themes. GrooveForge gives you four powerful ways to create:

Vibe Graph — Click genre, mood, tempo, key, and theme nodes to compose a vibe. Every node selection tightens the search query.
Text-to-Music — Type a natural-language description: "moody synthwave, 110 BPM, instrumental". Your description is embedded with all-MiniLM-L6-v2 and searched with hybrid ANN + BM25 retrieval across both namespaces.
Lyrics-to-Music — Paste original lyrics. Gemini analyzes emotional tone, themes, energy level, and rhythmic structure. The derived traits drive blueprint retrieval. Lyrics are placed only in ElevenLabs lines fields — never mixed with style guidance.
Sound Match — Upload or record a reference clip. Gemini analyzes it for BPM, key, mood, texture, and instrumentation. Those derived traits drive blueprint retrieval — no audio file is ever stored or processed beyond the analysis step.

At its core, GrooveForge indexes 620,000+ audio blueprints enriched with features that define a song's DNA: genre, mood, key, tempo, energy, danceability, acousticness, valence, instrumentalness, and vocal characteristics.

How It Works

1. Describe your vibe — Select nodes, type a description, paste lyrics, or record a clip.

2. Retrieve blueprints — Input is embedded with all-MiniLM-L6-v2 and sent as a hybrid ANN + BM25 query to Turbopuffer across two namespaces simultaneously (lp_msd_minilm + fma_minilm). Both namespaces are queried concurrently via asyncio.gather, and all four ranked result lists (ANN + BM25 per namespace) are merged using Reciprocal Rank Fusion (RRF, k=60) to produce a unified, deduplicated ranking of the closest 5–10 blueprints.

3. Aggregate traits — Blueprint metadata is collapsed into a generation profile: average BPM, dominant key/mode, most-frequent genre and mood clusters, merged instrumentation list.

4. Generate your track — Gemini synthesizes a grounded text prompt (simple mode) or structured composition plan (advanced mode) strictly from the retrieved blueprint traits. ElevenLabs Music API produces the audio. Lyrics are placed only in ElevenLabs lines fields per section, never mixed with style guidance.

Every generated track comes with a visible reasoning trail — the exact blueprint cards and aggregated profile that drove the generation. No black boxes. No hallucinated characteristics.

Generation Modes

Simple (Prompt) — Fast iteration. One text prompt derived from aggregated blueprint traits sent to ElevenLabs.

Advanced (Composition Plan) — Structured songs with section-level control (intro/verse/chorus/bridge/outro), lyric placement per section, local style guides, and a negative_global_styles field to suppress unwanted traits.

Both modes support Review Before Generate — a dry-run that shows you the exact prompt or composition plan before committing to an ElevenLabs call.

Data Pipeline

The blueprint index was built in three offline stages:

Stage 1 — Raw data → PostgreSQL

load_msd_full.py: MSD SQLite (1M tracks) + HDF5 (tempo/key/mode/loudness) → msd_songs table
Views lp_musiccaps_msd_v and fma_all_v sit above the raw tables, exposing structured columns

Stage 2 — PostgreSQL views → Blueprint Parquets

ingest_blueprints.py: lp_musiccaps_msd_v (513,977 rows) → blueprints_lp_msd.parquet; fma_all_v (106,574 rows) → blueprints_fma.parquet
For MSD: tags parsed into genre/mood/themes via vocabulary sets; vocal type inferred from tag strings; energy derived from loudness; text field assembled from captions + tags + key/mode/BPM + artist
For FMA: genre from genre_top; mood from echonest valence threshold; vocal type from instrumentalness threshold

Stage 3 — Parquets → Turbopuffer

embed_blueprints.py: sentence-transformers/all-MiniLM-L6-v2 embedding (384-dim, L2-normalized); 256 rows per encode, 500 rows per upsert; checkpointed to data/.embed_checkpoints/
Schema: text (full-text search enabled), plus filterable string and numeric attributes
Two namespaces: lp_msd_minilm (513,977 records) + fma_minilm (106,574 records)

Turbopuffer Retrieval

Query embedding via all-MiniLM-L6-v2 through OpenRouter. Each namespace runs a hybrid ANN + BM25 query via multi_query. Both namespaces queried concurrently with asyncio.gather. Results merged with RRF (k=60) — deduplication across 4 ranked lists (lp_ann, lp_bm25, fma_ann, fma_bm25). Metadata filters (BPM range, vocal_type) applied to both ANN and BM25 sub-queries.

Artist Firewall

Artist names and song titles from retrieved blueprints are never passed to Gemini or ElevenLabs. Only derived traits (genre, BPM, key, mood, instrumentation, energy) are used.

Generated History

Every track generated is saved locally (localStorage). Replay any track, rename it, or download the MP3. History persists across sessions.

Tech Stack

Frontend: React 18, TypeScript, Vite, TailwindCSS, Framer Motion, react-flow, Radix UI
Backend: FastAPI, Python 3.13+, uv, Pydantic, asyncio
AI & Music: ElevenLabs Music API (prompt + composition-plan), Google Gemini 2.5-flash
Retrieval: Turbopuffer — ANN + BM25 hybrid, metadata filters, RRF merge across 2 namespaces
Embeddings: all-MiniLM-L6-v2 via OpenRouter API (384-dim)
Data Sources: LP-MusicCaps-MSD (513K), Free Music Archive (106K), MSD full (1M), MusicCaps (5.5K)
Deployment: Railway (backend) + Vercel (frontend)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's New

Added

Changed

Fixed

Uh oh!

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

[1.0.0] — 2026-04-13

What It Does

How It Works

Generation Modes

Data Pipeline

Turbopuffer Retrieval

Artist Firewall

Generated History

Tech Stack

Uh oh!

Releases: padmanabhan-r/GrooveForge

The Blueprint Drop

Uh oh!

v1.2.0 — The Blueprint Drop

What's New

Added

Changed

Fixed

Uh oh!

First Public Release

[1.0.0] — 2026-04-13

What It Does

How It Works

Generation Modes

Data Pipeline

Turbopuffer Retrieval

Artist Firewall

Generated History

Tech Stack

Uh oh!