Skip to content

Sai-Deekshith-06/Vision-Tagger-AI

Repository files navigation

VisionTagger-AI

VisionTagger AI is an AI-powered image tagging and metadata generation tool that leverages Google's Gemini Vision Pro to automate the image annotation process. This system generates accurate, descriptive tags and structured metadata in JSON format, eliminating the need for manual image tagging.

Vision.Tagger.-.Made.with.Clipchamp.1.mp4

🔹 Features

  • 🏷️ Automatic Image Tagging – Extract meaningful tags for any image.
  • 📝 JSON Metadata Generation – Get structured metadata for images.
  • 🔍 AI-Powered Accuracy – Uses Google Gemini Vision Pro for precise results.
  • 📂 Supports Multiple Image Formats – PNG, JPG, and more.
  • 🌐 Web-Based Application – Simple and user-friendly UI.

🔹 Tech Stack

  • Frontend: HTML, CSS, JavaScript
  • Backend: Python (Flask)
  • API Used: Google Gemini Vision Pro (Paid API – using trial version)

🔹 Setup Instructions

🛠 Prerequisites

  • Install Python 3.x
  • Install dependencies: - pip install -r requirements.txt 🚀 Running the Application - python app.py

🔹 API Access ⚠ Note: We are using a paid API on a trial basis. If you encounter an API-related error, contact the developers for an updated API key.

🔹 Contributing Pull requests are welcome! Feel free to submit issues or feature requests.

🔹 License MIT License – Use it freely!

About

Using Google's Vision Pro API

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published