This is a streamlined guide to get you up and running quickly with importing YC Winter 2026 companies.
- Neon database credentials configured
- n8n webhook URL configured
- npm dependencies installed
- Database connection tested
# 1. Install dependencies (if not already done)
cd backend && npm install
# 2. Test database connection
npm run db:test
# 3. Import YC Winter 2026 companies
npm run import:yc-w26The script will:
- Find 40+ companies from YC Winter 2026 batch
- Extract details for each company:
- Company name
- Description
- Website URL
- Sector/category
- Founders' LinkedIn profiles
- Store in Neon database with duplicate detection
- Trigger n8n workflows (if configured) for:
- AI report generation
- Company evaluation
- Scraping: ~2-3 minutes for 40 companies
- Rate limiting: 3 seconds between requests
- Total time: 5-7 minutes
Your .env file is already configured with:
# Neon Database
DATABASE_URL=postgresql://neondb_owner:npg_2kZoxdber7Fl@ep-ancient-wildflower-a4c1trj9-pooler.us-east-1.aws.neon.tech/neondb?sslmode=require&channel_binding=require
# n8n Webhooks
N8N_WEBHOOK_BASE_URL=https://bubblyducks.app.n8n.cloud/webhookTo test the scraper without importing to database:
npm run test:scraperThis will:
- Scrape the first page of Winter 2026 companies
- Show details for the first company
- Verify scraping is working correctly
The import script provides real-time progress:
[1/40] Processing: Veriad
Fetching details for veriad...
✓ Imported successfully
[2/40] Processing: Corelayer
Fetching details for corelayer...
⊘ Skipped (already_exists)
[3/40] Processing: Chasi
Fetching details for chasi...
✓ Imported successfully
Once complete, you can:
-
View in database:
psql 'postgresql://neondb_owner:npg_2kZoxdber7Fl@ep-ancient-wildflower-a4c1trj9-pooler.us-east-1.aws.neon.tech/neondb?sslmode=require&channel_binding=require' SELECT name, website FROM companies WHERE yc_batch = 'Winter 2026';
-
Check n8n workflows: Log in to https://bubblyducks.app.n8n.cloud and verify workflows are processing
-
View in frontend: Start the frontend app and browse companies
- YC website structure may have changed
- Try running
npm run test:scraperto diagnose
- Run
npm run db:testto verify connection - Check
.envfile for correctDATABASE_URL
- Verify workflows are activated in n8n dashboard
- Check
N8N_WEBHOOK_BASE_URLin.env - This is non-critical - companies will still be imported
See the comprehensive guide: IMPORT_GUIDE.md