🧠 Vibe Mind - Multimodal Prompt Enrichment Tool

Translate your ideas — across text, images, and files — into high-quality, designer-grade prompts for any AI vibe-coding platform.

🔍 What It Does

This tool helps users who work visually or conceptually — not just through words — create clearer, more effective prompts for AI design and coding workflows (e.g., V0, Magic Pattern, or similar vibe coding tools).

It automatically:

📸 Extracts key insights from images, UI screenshots, or files
📝 Refines messy or incomplete text input
🧩 Merges all input into a structured, professional-quality prompt
💡 Optimizes for better AI output with fewer tokens

🚧 Why It Matters

Not every designer, developer, or maker is a prompt expert. Many users express their intent through images, sketches, Figma files, or rough notes — but today's AI tools often require precise, text-only input to perform well.

This project bridges that gap.

🌱 Our Vision

While we're not yet doing full context engineering, this project is a step in that direction. We aim to:

Make it easy for anyone to express intent in a multimodal way — and automatically translate that into agent-friendly context.

This aligns with emerging trends in AI research around prompt structuring, context-aware agents, and efficient multimodal design workflows.

✨ Current Features

🎯 Context-Aware Analysis Profiles: Specialized workflows for different vibe-coding scenarios
📁 Modular Profile System: Create, edit, and manage custom analysis profiles as JSON files
🔄 Dynamic Prompt Generation: Automatically structure insights into AI-ready prompts
📝 Professional Report Templates: Generate structured outputs optimized for downstream AI tools
🛠️ Interactive Workflow: User-friendly menu system for multimodal input processing
📊 Multiple Input Types: Support for local files, URLs, and various media formats
🧩 Template Customization: Each profile can define its own prompt structure and analysis steps

🚀 Quick Start

Install Dependencies:
```
pip install -r requirements.txt
```

Set Up API Key:

# Create .env file
echo "OPENAI_API_KEY=your_key_here" > .env

Start Enhancing Your Prompts:

# Quick demo with UI screenshot analysis
python quick_start.py

# Full multimodal prompt enrichment workflow
python vibe_mind.py

# Direct context extraction (for developers)
python chat_agent_with_image_analysis.py

Create Custom Workflows:
- Run python vibe_mind.py
- Choose "Manage Profiles" to create vibe-coding specific workflows
- Customize analysis steps for your target AI platform (V0, Magic Pattern, etc.)
- Generate optimized prompts for better AI output

🎯 Usage Examples

Simple Analysis

from chat_agent_with_image_analysis import ImageAnalysisChatAgent

agent = ImageAnalysisChatAgent()
result = agent.analyze_image("image.jpg", "What's in this image?")
print(result)

Multimodal Prompt Enrichment

from vibe_mind import StructuredImageAnalyzer

# Initialize the multimodal context enhancer
enhancer = StructuredImageAnalyzer()

# Interactive workflow for vibe-coding platforms
# - Process images, files, and text input
   # - Select target AI platform (V0, Magic Pattern, etc.)
# - Generate optimized, structured prompts

Custom Vibe-Coding Workflow

# Create a V0.dev-optimized workflow
v0_workflow = {
    'name': 'V0 Component Generator',
    'target_platform': 'v0.dev',
    'description': 'Convert UI concepts to V0-ready component prompts',
    'extraction_steps': [
        'Identify component structure and hierarchy',
        'Extract design tokens and styling patterns',
        'Map interactive elements and state management',
        'Determine responsive behavior and props'
    ],
    'prompt_template': '''Create a React component with the following specifications:

## Component Structure
{structure_analysis}

## Styling & Design
{styling_analysis}

## Functionality
{interaction_analysis}

## Props & State
{props_analysis}''',
    'optimization_rules': {
        'max_tokens': 1000,
        'focus_areas': ['component_architecture', 'styling_specificity', 'responsive_design']
    }
}

enhancer.create_profile('V0 Component Generator', v0_workflow)

# Process multimodal input → AI-ready prompt
result = enhancer.analyze_with_profile(
    image_url='ui_screenshot.png',
    profile=v0_workflow,
    custom_text='Create a dashboard card component with hover effects'
)

🔧 Configuration

Vibe Mind uses OpenAI's GPT-4O Mini for multimodal analysis. Configure your API access:

# .env file
OPENAI_API_KEY=your_openai_api_key_here

# Optional: Customize for your preferred AI platform
TARGET_PLATFORM=v0.dev  # or magic_pattern, lovable, etc.
MAX_PROMPT_TOKENS=1000
OPTIMIZATION_LEVEL=balanced  # or speed, quality

🎯 Vibe-Coding Workflows

🛠️ Custom Workflow Creation

Create workflows optimized for your specific AI vibe-coding platform:

Target Platform Setup: Define your AI tool (V0, Magic Pattern, Lovable, etc.)
Context Extraction Rules: Specify what insights to extract from multimodal input
Prompt Structure: Design the optimal format for your target AI
Token Optimization: Configure efficiency rules for better performance

📋 Workflow Structure

Each workflow profile contains:

Target Platform: Specific AI tool or platform (V0, Magic Pattern, etc.)
Extraction Steps: Multimodal analysis pipeline
Prompt Template: AI platform-optimized format
Optimization Rules: Token efficiency and clarity guidelines
Output Format: Structured, AI-ready prompt

// V0.dev Example
{
  "name": "V0 Component Builder",
  "target_platform": "v0.dev",
  "description": "Convert UI screenshots to V0-optimized component prompts",
  "extraction_steps": [
    "Identify component structure and hierarchy",
    "Extract styling and design tokens",
    "Determine interactive elements and states",
    "Map data flow and props"
  ],
  "prompt_template": "Create a React component...{structured_analysis}",
  "optimization_rules": {
    "max_tokens": 1000,
    "focus_areas": ["functionality", "styling", "responsiveness"]
  }
}

// Magic Pattern Example
{
  "name": "Magic Pattern UI Generator",
  "target_platform": "magic_pattern",
  "description": "Convert design concepts to Magic Pattern-optimized prompts",
  "extraction_steps": [
    "Analyze visual hierarchy and layout patterns",
    "Extract color schemes and typography",
    "Identify interactive components and behaviors",
    "Map responsive design requirements"
  ],
  "prompt_template": "Generate a UI pattern with these specifications...{structured_analysis}",
  "optimization_rules": {
    "max_tokens": 800,
    "focus_areas": ["design_patterns", "visual_consistency", "user_experience"]
  }
}

📦 Coming Soon

🔌 API for Programmatic Access: Integrate multimodal prompt enrichment into your existing workflows
🎯 V0.dev Integration Demo: Direct integration with popular vibe-coding platforms
📤 Plugin Support: Drag-and-drop for image/file upload and processing
🧠 Advanced Context Models: More sophisticated visual insight extraction

🤝 Contributing

Have ideas? Want to collaborate or experiment?

Open an issue or submit a pull request — we're open to co-building.

🎨 Example Use Cases

UI Screenshot → V0 Component

# Input: Dashboard screenshot + "Create a data visualization component"
# Output: Optimized V0 prompt with component structure, styling, and props

Design Mockup → Magic Pattern Integration

# Input: Figma export + interaction notes
# Output: Structured prompt for responsive component generation

Sketch → Technical Specification

# Input: Hand-drawn wireframe + feature requirements
# Output: Detailed technical prompt for AI coding assistant

Concept Art → Lovable App Builder

# Input: App concept sketches + user flow descriptions
# Output: Structured prompt for full-stack app generation

Acknowledgment

Built with CAMEL Framework • Ready for Demo

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.venv		.venv
extension		extension
frontend		frontend
openai		openai
public		public
.gitignore		.gitignore
CHROME_WEB_STORE_PUBLISHING.md		CHROME_WEB_STORE_PUBLISHING.md
INSTALLATION.md		INSTALLATION.md
LICENSE		LICENSE
OPENAI_BACKEND_INTEGRATION.md		OPENAI_BACKEND_INTEGRATION.md
PRE_PUBLICATION_CHECKLIST.md		PRE_PUBLICATION_CHECKLIST.md
Permission.md		Permission.md
RAILWAY_DEPLOYMENT.md		RAILWAY_DEPLOYMENT.md
README.md		README.md
SETUP.md		SETUP.md
STORE_LISTING.md		STORE_LISTING.md
SWITCHING_ENVIRONMENTS.md		SWITCHING_ENVIRONMENTS.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
reload-extension.js		reload-extension.js
start_backend.sh		start_backend.sh
start_dev.sh		start_dev.sh
start_openai_backend.sh		start_openai_backend.sh
stop_backend.sh		stop_backend.sh
test_analysis.py		test_analysis.py
test_extension_api.html		test_extension_api.html
vibe-mind-extension-v1.0.0.zip		vibe-mind-extension-v1.0.0.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Vibe Mind - Multimodal Prompt Enrichment Tool

🔍 What It Does

🚧 Why It Matters

🌱 Our Vision

✨ Current Features

🚀 Quick Start

🎯 Usage Examples

Simple Analysis

Multimodal Prompt Enrichment

Custom Vibe-Coding Workflow

🔧 Configuration

🎯 Vibe-Coding Workflows

🛠️ Custom Workflow Creation

📋 Workflow Structure

📦 Coming Soon

🤝 Contributing

🎨 Example Use Cases

UI Screenshot → V0 Component

Design Mockup → Magic Pattern Integration

Sketch → Technical Specification

Concept Art → Lovable App Builder

Acknowledgment

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Douglasymlai/vibemind

Folders and files

Latest commit

History

Repository files navigation

🧠 Vibe Mind - Multimodal Prompt Enrichment Tool

🔍 What It Does

🚧 Why It Matters

🌱 Our Vision

✨ Current Features

🚀 Quick Start

🎯 Usage Examples

Simple Analysis

Multimodal Prompt Enrichment

Custom Vibe-Coding Workflow

🔧 Configuration

🎯 Vibe-Coding Workflows

🛠️ Custom Workflow Creation

📋 Workflow Structure

📦 Coming Soon

🤝 Contributing

🎨 Example Use Cases

UI Screenshot → V0 Component

Design Mockup → Magic Pattern Integration

Sketch → Technical Specification

Concept Art → Lovable App Builder

Acknowledgment

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages