Live AI-generated Real-Time Translation System

Introduction

This project demonstrates a powerful real-time translation system that automatically transcribes and translates live speech into multiple languages. Perfect for multilingual conferences, educational sessions, or international meetings.

Key Features

Real-Time Translation: Instant translation of live speech
Multi-Language Support: Currently supports English, French, German, Spanish, and Japanese
Single Host System: Optimized for one speaker with multiple listeners
Language Preferences: Each listener can choose their preferred language
High-Quality Speech Recognition: Powered by Deepgram's advanced STT
Neural Translation: Utilizing Gemini API for accurate translations

Technical Stack

🌐 LiveKit: Real-time communication infrastructure
🤖 LiveKit Agents: Backend processing and coordination
👂 Deepgram: Speech-to-text processing
🌍 Google Gemini AI: Neural machine translation
⚡ Next.js: Frontend framework

System Architecture

Room Creation & Management
- Automatic agent joining on room creation
- Dynamic host detection and mic stream subscription
Speech Processing Pipeline
- Real-time audio streaming
- Speech-to-text conversion via Deepgram
- Neural translation processing
Translation Distribution
- Language-specific routing
- Real-time caption delivery
- Multi-user synchronization

Running the demo

Run the agent

cd server
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
add values for keys in .env
python main.py dev

Run the client

cd client/web
pnpm i
cp .env.example .env.local
add values for keys in .env.local
pnpm dev
open a browser and navigate to http://localhost:3000

Known Limitations

Single host restriction per session
Occasional UI glitches when multiple browser windows are open
STT performance may degrade with multiple concurrent connections

Extending the System

You can easily add support for additional languages by modifying the language configuration in the agent code. The system is designed to be modular and extensible.

Need Professional Implementation?

Looking to implement a similar system for your organization? We specializes in building custom AI-powered solutions.

🔗 Contact Us for Professional Implementation

Documentation

For more information about LiveKit Agents and their capabilities, visit: LiveKit Agents Documentation

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
client/web		client/web
server		server
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Live AI-generated Real-Time Translation System

Introduction

Key Features

Technical Stack

System Architecture

Running the demo

Run the agent

Run the client

Known Limitations

Extending the System

Need Professional Implementation?

Documentation

License

About

Uh oh!

Releases

Packages

Languages

hiteshchouhan22/AI-Real-Time-Translation-System

Folders and files

Latest commit

History

Repository files navigation

Live AI-generated Real-Time Translation System

Introduction

Key Features

Technical Stack

System Architecture

Running the demo

Run the agent

Run the client

Known Limitations

Extending the System

Need Professional Implementation?

Documentation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages