AutoDub

An advanced AI-powered tool that automatically translates and dubs YouTube videos into different languages while dynamically adjusting video speed. This project combines state-of-the-art speech recognition, translation, and voice cloning technologies to create natural-sounding dubbed videos.

Features

Automatic Video Processing: Downloads YouTube videos using yt-dlp and extracts audio automatically
Speech Recognition: Uses Whisper AI for accurate speech-to-text transcription
Voice Separation: Splits original audio into vocal and instrumental tracks using Spleeter
Neural Translation: Supports high-quality translation through DeepL API
Voice Cloning: Uses XTTS v2 for natural-sounding voice synthesis that matches the original speaker
Intelligent Video Speed Adjustment: Automatically adjusts video speed per speech segment to maintain lip-sync
Background Music Preservation: Maintains original background music and sound effects
Multi-language Support: Can translate and dub into multiple target languages

Prerequisites

Python 3.8+
CUDA-capable GPU (recommended for faster processing)
FFmpeg installed and added to system PATH

Installation

Clone the repository:

git clone https://github.com/frrobledo/AutoDub.git
cd AutoDub

Install required packages:

pip install -r requirements.txt

Install additional dependencies:

apt-get install ffmpeg  # for debian based systems

For other OS, refer to the ffmpeg installation guide

Set up API keys:
- Create a DeepL API account and add your API key to the configuration

Project Structure

├── tools/
│   ├── audio_synthesis.py     # Voice cloning and audio processing
│   ├── transcriber.py         # Speech recognition and translation
│   ├── video_editing.py       # Video speed adjustment and editing
│   ├── video_downloader.py    # YouTube video downloading
│   ├── audio_splitter_ffmpeg.py # Audio separation
│   └── logger.py             # Logging utilities
├── main.py                   # Main execution script
└── README.md

Usage

Run the main script:

python main.py

Enter the YouTube URL when prompted.
The script will automatically:
- Download the video
- Extract and transcribe the audio
- Separate speech from background audio
- Translate the speech
- Clone the voice in the target language
- Adjust video speed for lip-sync
- Combine everything into the final video
Find the output video in the final_output directory.

How It Works

Video Processing:
- Downloads YouTube video using yt-dlp
- Extracts audio track
- Separates vocals from background using Spleeter
Speech Processing:
- Transcribes speech using Whisper AI
- Detects spoken language automatically
- Translates text using DeepL API
Voice Synthesis:
- Clones original voice using XTTS v2
- Generates speech in target language
- Matches timing of original speech segments
Video Adjustment:
- Analyzes duration of original vs. translated speech
- Adjusts video speed per segment for lip-sync
- Preserves original background audio
- Combines all elements into final video

Configuration

The project creates several directories for processing:

downloads/: Downloaded YouTube videos
original_audios/: Extracted audio files
output_audio/: Processed audio segments
final_output/: Final dubbed videos
logs/: Processing logs

Known Limitations

Video quality depends on source YouTube video
For some languages, audio generation can produce artifacts and very slow/fast segments
Processing time varies based on video length and hardware
Some languages may have better results than others

Contributing

Contributions are welcome! Please feel free to submit pull requests or create issues for bugs and feature requests.

Acknowledgments

Whisper AI for speech recognition
XTTS v2 for voice cloning
Spleeter for audio separation
DeepL for neural translation
yt-dlp for video downloading

Contact

For questions or support, please create an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoDub

Features

Prerequisites

Installation

Project Structure

Usage

How It Works

Configuration

Known Limitations

Contributing

Acknowledgments

Contact

About

Releases

Packages

Languages

License

frrobledo/AutoDub

Folders and files

Latest commit

History

Repository files navigation

AutoDub

Features

Prerequisites

Installation

Project Structure

Usage

How It Works

Configuration

Known Limitations

Contributing

Acknowledgments

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages