- Clean up the original lyrical-miracles repo into a structured v2 layout (content/data/media/platform).
- Set up Cloudflare deployment scripts: R2 uploads, D1 schema seeding, Vectorize/AI ingestion.
- Prepare infra for automatic CMS updates and future RAG/chatbot integration.
-
Repo Restructure
- Created
lyrical-miracles-v2/with clear directories (content/,media/,data/,platform/, etc.). - Copied all HAWK-ARS assets and docs into the new structure; added pipeline documentation and structure docs.
- Created
-
Automation Scripts
scripts/normalize_assets.py: slug enforcement + asset map.scripts/build_manifest.py: aggregates content/media/catalog metadata.scripts/publish_cloudflare.sh: manifests, R2 uploads, D1 schema + seed (useslyrical-env.sh).
-
Vectorize / Worker Path
- Added Cloudflare Vectorize index (
lyrical-miracles) and ingestion CLI (scripts/autorag/ingest.py). - Deployed
platform/vectorize-worker/exposing/ingest(embeddings + upserts) and/search(query endpoint). - CLI now supports
--worker-urlto post chunk batches via the Worker.
- Added Cloudflare Vectorize index (
-
Docs & Env Updates
docs/PIPELINE.md,docs/EverLights_autorag_guidebook.mddescribe the end-to-end workflow (normalize → manifest → R2 → D1 → Vectorize ingest/query).lyrical-env.sh/.examplecontain all required env vars (CF account/token, R2 bucket, D1 DB, Vectorize worker URL, AI model).
-
GitHub Sync
- Repo committed and pushed to
https://github.com/OmniversalMediaLLC/lyrical-miraclesv2.git. - Large binaries excluded via
.gitignore; Worker + scripts are committed.
- Repo committed and pushed to
dist/manifest.jsongenerated with 200+ content entries.- D1 database (
lyrical-miracles) seeded via publish script. - R2 bucket
lyrical-miracles-v2holds audio/images (synced manually earlier). - Vectorize index populated via Worker ingestion (
CF_VECTORIZE_WORKER_URL=https://lyrical-vector-ingest.omniversalmail.workers.dev/ingest). - Worker
/search?q=...returns matches for site search/chat.
-
Workers API / CMS Exporters
- Create Worker endpoints that serve structured data (tracks/releases) for WordPress/Ghost/Drupal or static builds.
- Implement exporters under
platform/exporters/to push manifest data into each CMS.
-
Front-End Wiring
- Build a search/chat widget (using
/search) and integrate into the site. - Wire CMS/site to the new API so lyric pages auto-populate.
- Build a search/chat widget (using
-
CI/CD
- Add GitHub Actions to run
normalize_assets,build_manifest, optional tests, and maybe auto-trigger ingestion or Worker deploys.
- Add GitHub Actions to run
-
Further AI Features
- Expand Worker
/searchto include more filters or combined QA. - Add chat endpoint (RAG) using Vectorize matches + Workers AI text generation.
- Expand Worker
- To rehydrate on a new machine/server, simply
git clone https://github.com/OmniversalMediaLLC/lyrical-miraclesv2.git, copy.exampleenv tolyrical-env.sh, fill secrets, then run the helper scripts. - Worker deploy commands live in
platform/vectorize-worker; remember to redeploy after changes (wrangler deploy). - The ingestion CLI (
scripts/autorag/ingest.py) requiresrequestsand Python 3.10+.