GitHub - jianchang512/pyvideotrans: Translate the video from one language to another and embed dubbing & subtitles.

简体中文

Recall.ai - Meeting Transcription API

If you’re looking for a transcription API for meetings, consider checking out Recall.ai , an API that works with Zoom, Google Meet, Microsoft Teams, and more. Recall.ai diarizes by pulling the speaker data and separate audio streams from the meeting platforms, which means 100% accurate speaker diarization with actual speaker names.

Video Translation & Dubbing Tool

This is a powerful open-source video translation / audio transcription / speech synthesis tool, dedicated to seamlessly converting videos from one language to another, complete with dubbed audio and subtitles.

Core Features at a Glance

Fully Automatic Video/Audio Translation: Intelligently recognizes and transcribes voices in audio/video, generates source language subtitles, translates them to the target language, performs dubbing, and finally merges the new audio and subtitles into the original video—all in one go.
Voice Transcription / Audio & Video to Subtitles: Batch transcribes human speech from video or audio files into SRT subtitle files with precise time codes.
Speech Synthesis / Text-to-Speech (TTS): Utilizes various advanced TTS channels to generate high-quality, natural-sounding voiceovers for your text or SRT subtitle files.
SRT Subtitle Translation: Supports batch translation of SRT subtitle files, preserving original timestamps and formatting, while providing multiple bilingual subtitle styles.
Real-time Speech-to-Text: Supports real-time microphone monitoring to convert speech into text.

How It Works

Before getting started, please ensure you understand the core working mechanism of this software:

First, the human voice in the audio or video is converted into a subtitle file via the [Speech Recognition Channel]. Next, this subtitle file is translated into the target language via the [Translation Channel]. Then, the translated subtitles are used to generate audio via the selected [Dubbing Channel]. Finally, the subtitles, audio, and original video are embedded and aligned to complete the video translation process.

Can Handle: Any audio or video containing human speech, regardless of whether it has embedded subtitles.
Cannot Handle: Videos containing only background music and hardcoded subtitles but no spoken voice. This software also cannot directly extract hardcoded subtitles from video frames.

Pre-packaged Version (Windows 10/11 Only, MacOS/Linux Use Source Code)

Packed using PyInstaller. No antivirus evasion or signing has been applied; antivirus software may flag it as a virus. Please add it to your trust list or deploy from source.

Click to download the pre-packaged version, unzip it to a directory with no spaces in the path, and double-click sp.exe.
Unzip to an English path, ensuring the path contains no spaces. After unzipping, double-click sp.exe (If you encounter permission issues, right-click and run as administrator).
Note: You must unzip the file before use. Do not run it directly from inside the compressed archive, and do not move the sp.exe file to another location after unzipping.

Source Code Deployment

Recommended: Install using uv. If you don't have uv yet, check the official installation guide.

Prerequisites for MacOS/Linux

MacOS: Execute the following commands to install the required libraries:
```
brew install libsndfile

brew install ffmpeg

brew install git
```
Linux: Install ffmpeg using sudo yum install -y ffmpeg or apt-get install ffmpeg.
Create a folder with no spaces in its name. Open a terminal in that folder and execute:
```
git clone https://github.com/jianchang512/pyvideotrans
cd pyvideotrans
```
Alternatively, download the source code directly from https://github.com/jianchang512/pyvideotrans by clicking the green Code button, unzip it, and navigate to the directory containing sp.py.
Run uv sync to install modules. Depending on your network connection, this may take anywhere from a few minutes to over ten minutes.
Run uv run sp.py to launch the software interface.

Source Deployment Troubleshooting

By default, the software uses ctranslate2 version 4.x, which only supports CUDA 12.x. If your CUDA version is lower than 12 and you cannot upgrade, please execute the following commands to uninstall ctranslate2 and reinstall a compatible version:

uv remove ctranslate2

uv add ctranslate2==3.24.0

Tutorials and Documentation

Please visit https://pyvideotrans.com

Software Preview

Acknowledgements

This program relies primarily on the following open-source projects:

Name		Name	Last commit message	Last commit date
Latest commit History 969 Commits
.github		.github
docs		docs
models		models
videotrans		videotrans
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api.py		api.py
down_hf-mirror.py		down_hf-mirror.py
down_huggingface.py		down_huggingface.py
ffmpeg.txt		ffmpeg.txt
law.txt		law.txt
pyproject.toml		pyproject.toml
run-cuda.bat		run-cuda.bat
run-test.bat		run-test.bat
run.bat		run.bat
runapi.bat		runapi.bat
single_exec.txt		single_exec.txt
sp.py		sp.py
testcuda.py		testcuda.py
update_ffmpeg.bat		update_ffmpeg.bat
uv.lock		uv.lock
version.json		version.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Recall.ai - Meeting Transcription API

Video Translation & Dubbing Tool

Core Features at a Glance

How It Works

Pre-packaged Version (Windows 10/11 Only, MacOS/Linux Use Source Code)

Source Code Deployment

Source Deployment Troubleshooting

Tutorials and Documentation

Software Preview

Acknowledgements

About

Uh oh!

Releases 75

Sponsor this project

Uh oh!

Uh oh!

Contributors 15

Languages

Uh oh!

License

jianchang512/pyvideotrans

Folders and files

Latest commit

History

Repository files navigation

Recall.ai - Meeting Transcription API

Video Translation & Dubbing Tool

Core Features at a Glance

How It Works

Pre-packaged Version (Windows 10/11 Only, MacOS/Linux Use Source Code)

Source Code Deployment

Source Deployment Troubleshooting

Tutorials and Documentation

Software Preview

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 75

Sponsor this project

Uh oh!

Uh oh!

Contributors 15

Languages