π¬ YouTube AI Transcript Assistant ππ€ A Chrome Extension + AI Backend that allows users to ask questions about any YouTube video. It fetches the video's transcript, chunks it, embeds it with SentenceTransformers, stores it in FAISS, and uses Gemini AI to answer questions contextually.
π Ask anything about the video β no need to scrub through the timeline!
πΉ Demo
screen-recording-2025-06-15-150244_QvE2gRoQ.mp4
π§ Tech Stack
π§© Frontend: Chrome Extension (HTML, JS, CSS) βοΈ Backend: FastAPI πΌ Transcript Fetching: yt-dlp, .vtt conversion π§ Embeddings: sentence-transformers π Vector Store: FAISS π§ LLM: langchain-google-genai using Gemini
π οΈ Features
π Chrome Extension to interact with any YouTube video π Automatic transcript download + parsing π Vector-based semantic search using FAISS π§ Context-aware AI answers via Gemini API πͺ Seamless integration between frontend & backend β‘ Real-time processing with loading states π¨ Modern glassmorphism UI design π± Responsive extension popup interface
π Running the Project π§ 1. Backend Setup bashcd backend python -m venv venv source venv/bin/activate # or venv\Scripts\activate on Windows pip install -r requirements.txt uvicorn main:app --reload Make sure .env contains: envGOOGLE_API_KEY=your_google_gemini_api_key π 2. Chrome Extension Setup
Open chrome://extensions/ Enable Developer mode Click Load unpacked Select the extension/ folder Click the extension icon on YouTube, ask a question!
π API Setup Get Google Gemini API Key
Go to Google AI Studio Create a new API key Add to .env file in backend folder
π― Usage
Navigate to any YouTube video Click the extension icon in your browser Type your question about the video content Get AI-powered answers based on the transcript!
Example Questions: "What are the main points discussed?" "Summarize the video in 3 bullet points" "What does the speaker say about [specific topic]?" "At what timestamp is [specific topic] mentioned?"