Transform any YouTube/Bilibili video into structured AI-powered notes — just by chatting with Claude.
Drop a video URL in Claude Code, and get back:
- Structured markdown notes with key points, chapters, and code examples
- Smart formatting for technical content (commands, code blocks, examples)
- Privacy-first processing - audio never leaves your machine
- Ultra-low cost - ~$0.02 per video (only AI summary costs money)
Perfect for: Tutorial videos, tech talks, lectures, documentation videos
System Requirements:
Python Dependencies:
cd scripts
pip install -r requirements.txt- Visit OpenRouter.ai
- Sign up and create an API key
- Add $5-10 credits (enough for 200+ videos)
cp .env.example .env
# Edit .env and add: OPENROUTER_API_KEY=sk-or-your-key-hereThat's it! The skill is ready to use.
Just talk naturally to Claude:
English:
Convert this video to notes: https://www.youtube.com/watch?v=a9eR1xsfvHg
Summarize this tutorial in English: https://www.youtube.com/watch?v=...
Please save the notes to my Desktop/notes folder
中文:
请将这个视频转换为笔记:https://www.bilibili.com/video/BV1xx411c7XZ
帮我总结这个教程,用中文
把笔记保存到我的文档文件夹
Claude will:
- ✅ Check your environment automatically
- ✅ Download and process the video
- ✅ Generate beautiful markdown notes
- ✅ Save to file if you ask
See real example: examples/github-spec-kit-notes.md
Every note includes:
- Clear title based on video content
- Video metadata (source, duration, author, timestamp)
- Core takeaways summary (3-7 bullet points)
- Structured sections with hierarchical headings
- Technical terms formatted as
inline code - Code examples with syntax highlighting
- Smart examples when concepts are complex
| Feature | Details |
|---|---|
| Platforms | YouTube, Bilibili, Xiaohongshu |
| Languages | Auto-detect, or specify: Chinese, English, Japanese, etc. |
| Video Length | Up to 2 hours (configurable) |
| Speed | ~2-3 minutes for an 8-minute video |
| Cost | ~$0.02-0.05 per video (AI summary only) |
| Privacy | Audio processed locally with Whisper |
AI Models Available:
google/gemini-2.5-flash(default - fast & cheap)anthropic/claude-3.5-sonnet(best quality)openai/gpt-4-turbo(balanced)
Edit .env to customize:
# Required
OPENROUTER_API_KEY=sk-or-your-key-here
# Optional - customize these if needed
AI_MODEL=google/gemini-2.5-flash # Which AI to use
WHISPER_MODEL=base # Whisper model: tiny/base/small/medium/large
DEFAULT_LANGUAGE=zh # Default language: zh/en/auto
OUTPUT_DIRECTORY=. # Where to save notes| Model | Speed | Accuracy | RAM | Best For |
|---|---|---|---|---|
tiny |
⚡⚡⚡ | ⭐⭐ | 1GB | Quick drafts |
base |
⚡⚡ | ⭐⭐⭐ | 1GB | Default - balanced |
small |
⚡ | ⭐⭐⭐⭐ | 2GB | Better accuracy |
medium |
🐌 | ⭐⭐⭐⭐⭐ | 5GB | High quality |
Video URL → yt-dlp → FFmpeg → Whisper (local) → OpenRouter API → Markdown
If you prefer running scripts directly:
python scripts/process_video.py \
--url "https://www.youtube.com/watch?v=..." \
--language zh \
--save-to-file \
--output-path "./notes"MIT License - See LICENSE
Built with yt-dlp · FFmpeg · OpenAI Whisper · OpenRouter
⭐ Star this repo if it helps you!