A Streamlit demo that allows you to upload videos and chat with them using Google's Gemini AI with multimodal capabilities.
- 📹 Video Upload: Support for multiple video formats (MP4, AVI, MOV, MKV, WEBM)
- 🤖 AI-Powered Chat: Ask questions about your video content using Gemini's advanced video understanding
- 💬 Interactive Interface: Clean chat interface with streaming responses
- 🔄 Session Management: Maintain chat history and video context
- ⚡ Real-time Processing: Upload and process videos with progress feedback
-
Install Dependencies
pip install -r requirements.txt
-
Get Gemini API Key
- Visit Google AI Studio
- Create a new API key
- Keep it secure - you'll enter it in the app
-
Run the Application
streamlit run app.py
- Enter API Key: Input your Gemini API key in the sidebar
- Upload Video: Choose a video file (supported formats listed above)
- Wait for Processing: The video will be uploaded and processed by Gemini
- Start Chatting: Ask questions about your video content!
- "What is happening in this video?"
- "Summarize the main events"
- "Who are the people in this video?"
- "What objects can you see?"
- "Describe the setting and environment"
- "What actions are taking place?"
- Video Processing: Uses Gemini's File API for video upload and processing
- Multimodal AI: Combines video understanding with natural language processing
- File Size Limits: Large files (>100MB) may take longer to process
- Supported Formats: MP4, AVI, MOV, MKV, WEBM
- Video processing time depends on file size and complexity
- Large files may fail to upload or process
- API rate limits may apply based on your Gemini API plan
- Some video formats may not be supported
- Upload Fails: Check video format and file size
- Processing Stuck: Wait a few minutes, large files take time
- API Errors: Verify your API key is correct and has sufficient quota
- No Response: Try refreshing the page and re-uploading the video
- Streamlit - Web app framework
- Google Gemini API - Multimodal AI capabilities
- Python - Backend processing
Part of the AI Engineering Hub - Building practical AI applications