Rehearse AI

🏆 Hackathon Project: Built for the Gemini x Pipecat Virtual Hackathon: Build Adaptive Agents with Real-Time Intelligence

Practice real-life conversations with AI and get constructive feedback on your communication skills.

Overview

Rehearse AI is a web application where users can practice conversations with AI personas simulating real people (recruiters, managers, dates, customers, etc.). After each conversation, the AI provides detailed feedback on clarity, empathy, conciseness, filler words, and suggests alternative phrasings.

Note: Currently only the web application is functional. The mobile app is a work in progress - see apps/mobile/README.md for details.

Architecture

flowchart TB
    User[User Browser] --> WebApp[React Web App]
    WebApp --> Daily[Daily.co WebRTC]
    Daily <--> Bot[Pipecat Bot]
    Bot --> STT[Speechmatics/Deepgram STT]
    Bot --> LLM[OpenAI/Gemini LLM]
    Bot --> TTS[OpenAI/Gemini TTS]
    Bot -.Optional.-> Tavus[Tavus Video Avatar]
    Bot -.Optional.-> Mem0[Mem0 Memory]

Flow: User speaks → Daily.co → Pipecat Bot → STT → LLM → TTS → Audio Response → User

Pipecat Pipeline

The Pipecat bot processes conversations through a real-time pipeline:

Audio Input → Daily.co receives audio from the user's browser via WebRTC
Speech-to-Text (STT) → Speechmatics or Deepgram transcribes audio to text
- Detects filler words ("um", "uh", "like") for feedback analysis
- Provides real-time transcription with low latency
Language Model (LLM) → OpenAI GPT-4o-mini or Google Gemini generates responses
- Maintains conversation context and persona characteristics
- Accesses Mem0 memory (optional) for personalized feedback
- Adapts to the scenario (job interview, customer interaction, etc.)
Text-to-Speech (TTS) → OpenAI or Google Gemini converts response to natural speech
- Tavus video avatar (optional) provides synchronized video
- Low-latency streaming for natural conversation flow
Audio Output → Daily.co streams audio (and video) back to user's browser

The pipeline runs continuously during the conversation, enabling natural back-and-forth dialogue with minimal delay.

Features

🎙️ Voice Input/Output: Natural speech-to-text and text-to-speech conversation flow
🎭 Multiple Personas: Practice with different characters (recruiter, manager, date, customer)
📊 Detailed Feedback: Get constructive analysis on tone, content, and areas for improvement
📱 Responsive Design: Clean, minimal UI that works well on desktop and mobile browsers
💾 Session History: Review your last 3 conversation sessions
🔄 Try Again: Retry scenarios to improve your approach
🎥 Video Avatars (Optional): Realistic video avatars via Tavus
🧠 AI Memory (Optional): Personalized feedback tracking via Mem0

Tech Stack

Frontend: React 18, TypeScript, Vite, TailwindCSS Backend: Daily.co (WebRTC), Pipecat (Python Bot) AI Services: OpenAI GPT-4o-mini or Google Gemini, Speechmatics/Deepgram (STT), OpenAI/Gemini (TTS) Optional: Tavus (video avatars), Mem0 (AI memory)

Quick Start

Clone and Install:

git clone <your-repo-url>
cd rehearse-ai
npm install

Configure Environment Variables:
```
cp apps/web/env.example .env
cd pipecat-agent && cp .env.example .env
```
Edit both .env files with your API keys. See docs/API_KEYS.md for details.
Install Backend Dependencies:
```
cd pipecat-agent
uv sync
```

Start Backend and Frontend:

# Terminal 1: Start backend
cd pipecat-agent && python local_api.py

# Terminal 2: Start frontend
npm run dev --workspace=web

Open Browser: Navigate to http://localhost:5173

For detailed setup instructions, see SETUP.md.

Documentation

SETUP.md - Complete installation and configuration guide
docs/API_KEYS.md - Detailed API provider setup for all services
docs/FEATURES.md - Advanced features (Tavus video avatars, Mem0 memory)
docs/TROUBLESHOOTING.md - Common issues and solutions

Project Structure

rehearse-ai/
├── apps/
│   ├── web/                 # React + Vite web application
│   └── mobile/              # React Native app (Work in Progress)
├── packages/
│   └── shared/              # Shared business logic and types
├── pipecat-agent/           # Python bot backend
├── supabase/                # Database and edge functions
└── docs/                    # Documentation

Usage

Start a Session: Enter who you want to talk to and the scenario
Conversation: Use the microphone button to speak - the AI will respond with voice
End Session: Stop the conversation when ready
Get Feedback: Review AI-generated feedback on your performance
Try Again: Retry the same scenario or start a new one

Building for Production

npm run build:web
# Output: apps/web/dist/

Note: Production deployment is not yet implemented. The application currently runs in local development mode only.

Future Enhancements

PWA support for mobile installation
Additional personas and scenarios
Advanced feedback metrics and scoring
Export conversation transcripts
Support for multiple languages
React Native mobile app version

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.claude		.claude
apps		apps
docs		docs
packages/shared		packages/shared
pipecat-agent		pipecat-agent
supabase/functions/generate-feedback		supabase/functions/generate-feedback
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
SETUP.md		SETUP.md
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.json.bak		tsconfig.json.bak
tsconfig.node.json		tsconfig.node.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rehearse AI

Overview

Architecture

Pipecat Pipeline

Features

Tech Stack

Quick Start

Documentation

Project Structure

Usage

Building for Production

Future Enhancements

About

Uh oh!

Releases

Packages

Languages

Sanjan611/rehearse-ai

Folders and files

Latest commit

History

Repository files navigation

Rehearse AI

Overview

Architecture

Pipecat Pipeline

Features

Tech Stack

Quick Start

Documentation

Project Structure

Usage

Building for Production

Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages