Skip to content

artickc/veo-vibe-banana

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🍌 Veo Vibe Banana

License: MIT MCP TypeScript

An MCP (Model Context Protocol) server for generating images and videos using Google Imagen and Veo through CLIProxyAPI.

🎯 Compatibility

This server is primarily designed for VibeProxy - a Windows application for AI model proxying:

Download from Microsoft Store

Also compatible with:

✨ Features

  • πŸ–ΌοΈ Image Generation: Generate images using Google Imagen via gemini-3-pro-image-preview model
  • 🎬 Video Generation: Generate videos using Google Veo (requires CLIProxyAPI with Veo support - see Issue #94)
  • 🎨 Reference Images: Support for multiple reference images for style transfer, image editing, and image-to-video generation

⚠️ Note: Video generation requires CLIProxyAPI to have Veo models enabled. Check your CLIProxyAPI /v1/models endpoint to verify if veo-* models are available.

πŸ“¦ Installation

npm install
npm run build

βš™οΈ Configuration

The server can be configured using environment variables:

Variable Description Default
CLIPROXY_ENDPOINT CLIProxyAPI endpoint URL http://127.0.0.1:8317/v1
CLIPROXY_AUTH_TOKEN Authentication token sk-cliproxyapi-dummy
OUTPUT_DIR Directory for saving generated files ./generated

πŸš€ Usage

Running the MCP Server

npm start
# or
node dist/index.js

πŸ”§ MCP Configuration

Add to your MCP client configuration (e.g., opencode.json):

{
  "mcp": {
    "veo-vibe-banana": {
      "type": "local",
      "command": ["node", "H:/Lucru/MCP/veo-vibe-banana/dist/index.js"],
      "enabled": true,
      "environment": {
        "CLIPROXY_ENDPOINT": "http://127.0.0.1:8317/v1",
        "OUTPUT_DIR": "./generated"
      }
    }
  }
}

πŸ› οΈ Available Tools

πŸ–ΌοΈ generate_image

Generate an image using Google Imagen.

Parameters:

  • prompt (required): Text description of the image to generate
  • reference_images: Array of reference images (file paths, URLs, or data URLs)
  • model: Model to use (default: gemini-3-pro-image-preview)
  • aspect_ratio: 1:1, 16:9, 9:16, 4:3, 3:4 (default: 1:1)
  • image_size: 1K, 2K, 4K (default: 1K)
  • output_dir: Directory to save the image
  • save_to_file: Whether to save to file (default: true)

Example - Basic:

{
  "prompt": "A cute cat playing with a ball of yarn",
  "aspect_ratio": "16:9",
  "image_size": "2K"
}

Example - With Reference Images:

{
  "prompt": "Transform this image into a watercolor painting style",
  "reference_images": [
    "/path/to/source/image.png",
    "https://example.com/style-reference.jpg"
  ],
  "aspect_ratio": "1:1"
}

🎬 generate_video

Generate a video using Google Veo.

Parameters:

  • prompt (required): Text description of the video to generate
  • reference_images: Array of reference images for image-to-video generation
  • model: Model to use (default: veo-3.1-generate-preview)
    • veo-3.1-generate-preview - Latest preview
    • veo-3.1-fast-generate-preview - Faster generation
    • veo-3.0-generate-001 - Stable version
    • veo-3.0-fast-generate-001 - Fast stable version
  • duration_seconds: 4, 6, 8 (default: 8)
  • aspect_ratio: 16:9, 9:16 (default: 16:9)
  • resolution: 720p, 1080p (default: 720p)
  • negative_prompt: What NOT to include in the video
  • output_dir: Directory to save the video
  • save_to_file: Whether to save to file (default: true)

Example - Basic:

{
  "prompt": "A cinematic shot of a majestic lion walking through the savannah at sunset",
  "duration_seconds": "8",
  "aspect_ratio": "16:9",
  "resolution": "1080p",
  "negative_prompt": "cartoon, low quality, blurry"
}

Example - Image-to-Video:

{
  "prompt": "Animate this character walking forward with a gentle breeze",
  "reference_images": ["/path/to/character.png"],
  "duration_seconds": "4",
  "resolution": "1080p"
}

🎨 Reference Images

Reference images can be provided in multiple formats:

Format Example
File path (absolute) /home/user/images/photo.png
File path (relative) ./images/photo.png
URL https://example.com/image.jpg
Data URL data:image/png;base64,iVBORw0KGgo...

Use Cases

Image Generation with References:

  • Style transfer: Apply the style of reference images to new content
  • Image editing: Modify or enhance existing images
  • Composition: Combine elements from multiple reference images

Video Generation with References (Image-to-Video):

  • Animate still images: Bring photos or artwork to life
  • Character animation: Animate a character from a reference image
  • Scene transitions: Create video transitions between images

πŸ“‘ API Format

This MCP server uses the CLIProxyAPI format as defined in CLIProxyAPIPlus Issue #94.

Image Generation Request

{
  "model": "gemini-3-pro-image-preview",
  "messages": [{
    "role": "user",
    "content": [
      {"type": "image_url", "image_url": {"url": "data:image/png;base64,..."}},
      {"type": "text", "text": "A cute cat"}
    ]
  }],
  "modalities": ["image", "text"],
  "image_config": {
    "aspect_ratio": "1:1",
    "image_size": "1K"
  }
}

Video Generation Request (Image-to-Video)

{
  "model": "veo-3.1-generate-preview",
  "messages": [{
    "role": "user",
    "content": [
      {"type": "image_url", "image_url": {"url": "data:image/png;base64,..."}},
      {"type": "text", "text": "Animate this image with gentle motion"}
    ]
  }],
  "modalities": ["video", "text"],
  "video_config": {
    "duration_seconds": "8",
    "aspect_ratio": "16:9",
    "resolution": "720p"
  }
}

πŸ’» Development

# Run in development mode
npm run dev

# Build
npm run build

πŸ€– AI Assistant Integration

This project includes a SKILL.md file that provides comprehensive guidance for AI assistants working with image and video generation. This file is compatible with AI-powered IDEs and CLI tools such as:

  • πŸ”Ή Codex - AI coding assistant
  • πŸ”Ή Claude - Anthropic's AI assistant
  • πŸ”Ή OpenCode - AI-powered code editor
  • πŸ”Ή Cursor - AI-first code editor
  • πŸ”Ή Other IDEs supporting SKILL.md files

The SKILL.md file helps AI assistants:

  • βœ… Understand when and how to use image/video generation tools
  • βœ… Craft optimal prompts for better generation results
  • βœ… Choose appropriate parameters (aspect ratio, resolution, etc.)
  • βœ… Handle reference images for style transfer and editing
  • βœ… Follow best practices for generation workflows

πŸ’‘ For AI Assistants: Load the SKILL.md file when users request image or video generation to ensure optimal tool usage and prompt engineering.

πŸ“„ License

MIT

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors