Skip to content

neurox-io/ai-media-studio-cli

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎨 AI MEDIA STUDIO CLI

AI Media Studio CLI

πŸš€ Professional Multi-Modal AI Media Generation Tool

Generate videos, images, and music with Google's AI models using simple text prompts

Version Python License Developer

Multi-Modal Images Music Professional


πŸ–ΌοΈ Experience AI Media Studio CLI

✨ Beautiful Terminal Interface for AI Media Generation

AI Media Studio CLI Interface

🎬 Professional Multi-Modal AI Media Generation Tool with an intuitive terminal interface

🎯 Launch Interactive Mode

# Start the beautiful interactive experience
ai-studio interactive

Interactive Features:

  • 🎨 Smart Media Selection - Choose between Video, Image, or Music generation
  • πŸ€– AI Model Showcase - Compare capabilities of different models
  • πŸ“ Visual Parameter Config - Real-time validation and previews
  • πŸ’‘ Prompt Writing Guide - Built-in tips for each media type
  • πŸ“Š Generation Preview - See what you're about to create
  • 🎯 Intelligent Recommendations - AI-powered suggestions
  • πŸ“ Automatic Organization - Smart file management by media type

🎯 Why AI Media Studio CLI?

✨ One Tool, All Media Types

  • 🎬 Video Generation with Google's latest Veo models
  • πŸ–ΌοΈ Image Creation with Imagen (coming soon)
  • 🎡 Music Composition with MusicLM (planned)
  • 🎭 Beautiful Interactive Mode for easy media generation
  • πŸ“ Smart Organization - Auto-sorts all your generated media

πŸš€ Built for Creators

  • ⚑ Fast Generation - Optimized for quick results
  • 🎨 Professional Quality - Using Google's best AI models
  • πŸ’» Developer Friendly - Clean CLI with intuitive commands
  • πŸ”„ Extensible - Easy to add new models and features

πŸš€ Quick Start Guide

πŸ“¦ Installation

# Clone the repository
git clone https://github.com/Abdulrahman-Elsmmany/ai-media-studio-cli.git
cd ai-media-studio-cli

# Install dependencies with UV (recommended)
uv sync

# Alternative: Install with pip
pip install -e .

βš™οΈ Configuration

Create your .env file with Google AI credentials:

# πŸ”‘ Google AI API Configuration
GOOGLE_CLOUD_PROJECT=your-project-id
GOOGLE_CLOUD_LOCATION=us-central1
GOOGLE_GENAI_USE_VERTEXAI=True
GOOGLE_API_KEY=your-google-api-key

# πŸͺ£ Google Cloud Storage Configuration
GOOGLE_CLOUD_STORAGE_BUCKET=your-bucket-name
GOOGLE_CLOUD_STORAGE_PATH=videos

🎯 Get Started in 30 Seconds

# 🎬 Generate your first video
ai-studio generate video -p "a cinematic sunset over mountains"

# 🎭 Or use interactive mode (recommended)
ai-studio interactive

✨ Multi-Modal AI Capabilities

🎬 Advanced Video Generation

  • Google Veo 2.0 & 3.0 models
  • 5-8 second high-quality videos
  • Video extension and continuation
  • Multiple aspect ratios for all platforms

πŸ–ΌοΈ Professional Image Creation

  • Google Imagen models (coming soon)
  • Multiple resolutions up to 4K
  • Style control and customization
  • Batch generation for workflows

🎡 AI Music Composition

  • Google MusicLM integration (planned)
  • Custom length and style control
  • Genre-specific generation
  • High-quality audio output

πŸ–₯️ Premium User Experience

  • Unified CLI interface for all media types
  • Interactive mode with beautiful UI
  • Real-time progress tracking
  • Smart file organization and downloads
  • Automatic media organization (videos/, images/, audios/)
  • Concurrent downloads with progress tracking

🎯 Usage Examples

🎬 Quick Video Generation

# Simple video generation
ai-studio generate video -p "a majestic eagle soaring over mountains"

# With specific model and settings
ai-studio generate video \
  --prompt "cinematic drone shot of ocean waves at sunset" \
  --model veo3-001 \
  --aspect-ratio 16:9

πŸ“ Automatic Media Organization

# All generated media is auto-organized:
downloaded_media/
β”œβ”€β”€ videos/    # .mp4, .avi, .mov files
β”œβ”€β”€ images/    # .jpg, .png, .gif files  
β”œβ”€β”€ audios/    # .mp3, .wav, .flac files
└── unknown/   # Other file types

🎯 Advanced Examples

🎬 Professional Video Generation

# πŸŒ… Cinematic landscape video
ai-studio generate video \
  --prompt "golden hour cinematic shot of a serene lake with mountains reflected in still water" \
  --model veo3-001 \
  --aspect-ratio 16:9 \
  --resolution 1080 \
  --videos 2 \
  --duration 8

# πŸ“± Social media vertical video
ai-studio generate video \
  --prompt "trendy coffee shop aesthetic with latte art being created" \
  --model veo2-001 \
  --aspect-ratio 9:16 \
  --resolution 720 \
  --duration 6

βž• Video Extension Workflows

# πŸ”— Extend video from Google Cloud Storage
ai-studio generate video \
  --prompt "the butterfly gracefully lands on a blooming flower petal" \
  --model veo2-001 \
  --extend-video "gs://your-bucket/nature-scene.mp4"

# πŸ“ Extend local video file
ai-studio generate video \
  --prompt "the sunset transforms into a starry night sky" \
  --model veo2-001 \
  --extend-video "./videos/sunset-base.mp4"

πŸ–ΌοΈ Image Generation (Coming Soon)

# 🎨 High-resolution artwork
ai-studio generate image \
  --prompt "abstract digital art with vibrant colors and geometric patterns" \
  --model imagen-3-ultra \
  --resolution 2048x2048 \
  --style artistic

# 🏒 Professional photography
ai-studio generate image \
  --prompt "modern office interior with natural lighting" \
  --model imagen-3-001 \
  --resolution 1920x1080 \
  --style photorealistic

πŸ€– AI Model Ecosystem

🎬 Video Models

Model 🎯 Best For Videos Duration πŸš€ Special Features
veo2-001 🎨 Creative & Flexible 4 5-8s βž• Video Extension, πŸ–ΌοΈ Image-to-Video
veo3-001 🎬 Professional & Stable 4 8s ✨ AI Prompt Enhancement
veo3-preview πŸ”¬ Latest Features 4 8s πŸ–ΌοΈ Image-to-Video, πŸ†• Beta Features

πŸ–ΌοΈ Image Models (Coming Soon)

Model 🎯 Best For Images Resolution πŸš€ Special Features
imagen-3-ultra 🎨 Ultra High Quality 12 Up to 4K 🎨 Style Control, ⚑ Fast Generation
imagen-3-001 πŸ“Έ Photorealistic 8 Up to 2K πŸ“· Photo-realistic, 🎭 Face Generation

🎡 Music Models (Planned)

Model 🎯 Best For Length Quality πŸš€ Special Features
musiclm-v2 🎼 Composition 30-120s Hi-Fi 🎹 Instrument Control, 🎡 Genre Specific

🎨 Creative Prompt Engineering

πŸ“ Universal Prompt Structure

[STYLE] + [SUBJECT] + [ACTION] + [SETTING] + [TECHNICAL] + [MOOD]

βœ… Professional Examples by Media Type

🎬 Video Prompts

# Cinematic
"Cinematic wide shot of a lone figure walking through misty forest path, golden morning light filtering through ancient trees, slow dolly forward, mysterious atmosphere"

# Documentary
"Documentary-style close-up of artisan hands crafting pottery on spinning wheel, natural lighting, steady camera, focused concentration"

πŸ–ΌοΈ Image Prompts (Coming Soon)

# Artistic
"Abstract expressionist painting with bold brushstrokes, vibrant blues and oranges, dynamic composition, oil on canvas texture"

# Photographic
"Professional headshot of businesswoman in modern office, soft natural lighting, shallow depth of field, confident expression"

🎡 Music Prompts (Planned)

# Instrumental
"Uplifting piano melody with string accompaniment, major key, 120 BPM, inspiring and motivational mood"

# Ambient
"Ethereal ambient soundscape with nature sounds, gentle synthesizer pads, relaxing meditation music"

❌ Avoid These Mistakes

❌ "make video"                   # Too vague, no media type specified
❌ "cool image of stuff"          # Lacks specific details
❌ "amazing epic best music"      # Over-hyped without substance

πŸ› οΈ Professional Development

πŸ“ Extensible Architecture

ai-media-studio-cli/
β”œβ”€β”€ 🎬 ai_media_studio_cli/
β”‚   β”œβ”€β”€ main.py              # Unified CLI application
β”‚   β”œβ”€β”€ ui_components.py     # Beautiful UI components
β”‚   β”œβ”€β”€ models_config.py     # Multi-modal AI configurations
β”‚   β”œβ”€β”€ model_manager.py     # Dynamic model handling
β”‚   β”œβ”€β”€ download.py          # Smart media download & organization
β”‚   β”œβ”€β”€ animations.py        # Progress & loading animations
β”‚   β”œβ”€β”€ generators/
β”‚   β”‚   β”œβ”€β”€ video.py         # Video generation logic
β”‚   β”‚   β”œβ”€β”€ image.py         # Image generation (coming soon)
β”‚   β”‚   └── music.py         # Music generation (planned)
β”œβ”€β”€ πŸ“š docs/
β”‚   β”œβ”€β”€ ADDING_NEW_MODELS.md # Developer guide
β”‚   β”œβ”€β”€ VIDEO_GENERATION.md  # Video-specific docs
β”‚   └── ROADMAP.md           # Future feature roadmap
β”œβ”€β”€ πŸ§ͺ tests/                # Comprehensive test suite
β”‚   β”œβ”€β”€ test_download.py     # Download functionality tests
β”‚   └── test_models.py       # Model integration tests
β”œβ”€β”€ βš™οΈ pyproject.toml        # Modern Python packaging
└── πŸ“– README.md             # This documentation

πŸš€ Performance & Scalability

  • Modular architecture for easy extension to new AI models
  • Async processing for all media types and downloads
  • Smart caching to reduce API costs
  • Batch processing for efficient generation workflows
  • Memory optimization for large media files
  • Plugin system for third-party model integration
  • Concurrent downloads with progress tracking
  • Automatic file organization by media type
  • GCS cleanup to minimize storage costs

πŸ—ΊοΈ Roadmap & Future Features

🎯 Phase 1: Video Foundation βœ…

  • βœ… Google Veo 2.0 & 3.0 integration
  • βœ… Video extension capabilities
  • βœ… Professional CLI interface
  • βœ… Smart media download & organization
  • βœ… Automatic folder structure (videos/, images/, audios/)
  • βœ… Concurrent downloads with progress tracking

🎯 Phase 2: Image Generation 🚧

  • πŸ”„ Google Imagen integration
  • πŸ”„ Multiple resolution support
  • πŸ”„ Style control and customization
  • πŸ”„ Batch image processing

🎯 Phase 3: Music Composition πŸ“‹

  • πŸ“‹ Google MusicLM integration
  • πŸ“‹ Genre and style control
  • πŸ“‹ Custom length generation
  • πŸ“‹ Audio format optimization

🎯 Phase 4: Advanced Features πŸ“‹

  • πŸ“‹ Multi-modal workflows (video + music)
  • πŸ“‹ Template system for common use cases
  • πŸ“‹ Cloud storage integration (AWS, Azure)
  • πŸ“‹ API rate limiting and optimization
  • πŸ“‹ Advanced prompt engineering tools

🎯 Advanced Configuration

πŸ”§ Environment Variables

# πŸ”‘ Required - Google AI Configuration
GOOGLE_CLOUD_PROJECT=your-project-id
GOOGLE_CLOUD_LOCATION=us-central1
GOOGLE_GENAI_USE_VERTEXAI=True
GOOGLE_API_KEY=your-google-api-key

# πŸͺ£ Required - Google Cloud Storage
GOOGLE_CLOUD_STORAGE_BUCKET=your-bucket-name
GOOGLE_CLOUD_STORAGE_PATH=videos

πŸ”‘ Google AI API Setup

  1. Get your Google AI API Key:

    • Visit Google AI Studio
    • Create a new API key for your project
    • Add it to your .env file as GOOGLE_API_KEY
  2. Configure Google Cloud Project:

    • GOOGLE_CLOUD_PROJECT: Your Google Cloud project ID
    • GOOGLE_CLOUD_LOCATION: Recommended: us-central1
    • GOOGLE_GENAI_USE_VERTEXAI: Set to True for production use

πŸͺ£ Google Cloud Storage Setup

The tool requires a GCS bucket for temporary video storage during generation:

  1. Create a GCS bucket in your Google Cloud project
  2. Set environment variables:
    • GOOGLE_CLOUD_STORAGE_BUCKET: Your bucket name (e.g., my-ai-videos)
    • GOOGLE_CLOUD_STORAGE_PATH: Path within bucket (optional, defaults to videos)
  3. Ensure permissions: Your service account needs Storage Object Admin role

πŸ“Š Intelligent Media Management

Generated content is automatically:

  • πŸ“ Organized by media type (videos/, images/, audios/)
  • 🏷️ Tagged with generation metadata
  • 🧹 Cleaned up from cloud storage (optional)
  • πŸ“ˆ Tracked with detailed analytics
  • πŸ”„ Versioned for iterative workflows
  • ⚑ Downloaded concurrently with progress tracking
  • 🎯 Sorted by file extension into appropriate folders
  • πŸ“¦ Supports 20+ media formats (MP4, JPG, MP3, etc.)

πŸ“ Smart Media Download & Organization

The CLI features an intelligent download system that automatically organizes your generated content:

🎯 Automatic Organization

# Downloads are automatically organized by media type
downloaded_media/
β”œβ”€β”€ videos/     # .mp4, .avi, .mov, .mkv, .wmv, .flv, .webm, .m4v, .3gp
β”œβ”€β”€ images/     # .jpg, .jpeg, .png, .gif, .bmp, .tiff, .svg, .webp, .ico
β”œβ”€β”€ audios/     # .mp3, .wav, .flac, .aac, .ogg, .wma, .m4a, .opus
└── unknown/    # Unrecognized file types

⚑ Performance Features

  • Concurrent Downloads: Multiple files downloaded simultaneously
  • Progress Tracking: Real-time progress bars with ETA
  • Resume Support: Automatic retry on network interruptions
  • GCS Cleanup: Optional cloud storage cleanup after download
  • Memory Efficient: Streaming downloads for large files

πŸ”§ Customization Options

# Disable automatic organization
ai-studio generate video --no-organize

# Custom download directory
ai-studio generate video --output-dir "my-custom-folder"

# Keep files in cloud storage (no cleanup)
ai-studio generate video --keep-cloud-files

🀝 Contributing to the Future

We welcome contributions that push the boundaries of AI media generation:

🎯 Contribution Areas

  1. 🎬 Video Generation: New models, effects, transitions
  2. πŸ–ΌοΈ Image Creation: Style transfer, artistic filters
  3. 🎡 Music Composition: Instrument separation, rhythm generation
  4. πŸ–₯️ User Experience: Interface improvements, workflow optimization
  5. πŸ”§ Technical: Performance, architecture, new integrations

πŸ† Code Standards

  • Type hints for all functions across all modules
  • Comprehensive docstrings with examples
  • Unit tests with >95% coverage for new features
  • Integration tests for AI model endpoints
  • Performance benchmarks for generation workflows

πŸ“ž Support & Community

🌟 Get Help & Connect

Issues Discussions Documentation

🎨 Media Showcase

Share your AI-generated content with the community! Gallery


πŸ“„ License & Attribution

This project is licensed under the MIT License - see the LICENSE file for complete details.

Third-party acknowledgments:

  • πŸ€– Google AI for Veo, Imagen, and MusicLM model access
  • 🎨 Rich for beautiful terminal UI
  • ⚑ Typer for modern CLI framework
  • πŸ”§ UV for fast Python package management

🎨 AI MEDIA STUDIO CLI

The future of AI media generation in your terminal

Created with ❀️ by Abdulrahman Elsmmany

GitHub

LinkedIn


⭐ Star this repository if it helped you create amazing AI content!

Let's build the future of AI media generation together πŸš€

πŸŽ¬πŸ–ΌοΈπŸŽ΅ Videos β€’ Images β€’ Music - All Powered by AI

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages