If you’re looking for a transcription API for meetings, consider checking out Recall.ai , an API that works with Zoom, Google Meet, Microsoft Teams, and more. Recall.ai diarizes by pulling the speaker data and separate audio streams from the meeting platforms, which means 100% accurate speaker diarization with actual speaker names.
This is a powerful open-source video translation / audio transcription / speech synthesis tool, dedicated to seamlessly converting videos from one language to another, complete with dubbed audio and subtitles.
- Fully Automatic Video/Audio Translation: Intelligently recognizes and transcribes voices in audio/video, generates source language subtitles, translates them to the target language, performs dubbing, and finally merges the new audio and subtitles into the original video—all in one go.
- Voice Transcription / Audio & Video to Subtitles: Batch transcribes human speech from video or audio files into SRT subtitle files with precise time codes.
- Speech Synthesis / Text-to-Speech (TTS): Utilizes various advanced TTS channels to generate high-quality, natural-sounding voiceovers for your text or SRT subtitle files.
- SRT Subtitle Translation: Supports batch translation of SRT subtitle files, preserving original timestamps and formatting, while providing multiple bilingual subtitle styles.
- Real-time Speech-to-Text: Supports real-time microphone monitoring to convert speech into text.
Before getting started, please ensure you understand the core working mechanism of this software:
First, the human voice in the audio or video is converted into a subtitle file via the [Speech Recognition Channel]. Next, this subtitle file is translated into the target language via the [Translation Channel]. Then, the translated subtitles are used to generate audio via the selected [Dubbing Channel]. Finally, the subtitles, audio, and original video are embedded and aligned to complete the video translation process.
- Can Handle: Any audio or video containing human speech, regardless of whether it has embedded subtitles.
- Cannot Handle: Videos containing only background music and hardcoded subtitles but no spoken voice. This software also cannot directly extract hardcoded subtitles from video frames.
Packed using PyInstaller. No antivirus evasion or signing has been applied; antivirus software may flag it as a virus. Please add it to your trust list or deploy from source.
-
Click to download the pre-packaged version, unzip it to a directory with no spaces in the path, and double-click
sp.exe. -
Unzip to an English path, ensuring the path contains no spaces. After unzipping, double-click
sp.exe(If you encounter permission issues, right-click and run as administrator). -
Note: You must unzip the file before use. Do not run it directly from inside the compressed archive, and do not move the
sp.exefile to another location after unzipping.
Recommended: Install using
uv. If you don't haveuvyet, check the official installation guide.
-
Prerequisites for MacOS/Linux
MacOS: Execute the following commands to install the required libraries:
brew install libsndfile brew install ffmpeg brew install git
Linux: Install
ffmpegusingsudo yum install -y ffmpegorapt-get install ffmpeg. -
Create a folder with no spaces in its name. Open a terminal in that folder and execute:
git clone https://github.com/jianchang512/pyvideotrans cd pyvideotransAlternatively, download the source code directly from https://github.com/jianchang512/pyvideotrans by clicking the green Code button, unzip it, and navigate to the directory containing
sp.py. -
Run
uv syncto install modules. Depending on your network connection, this may take anywhere from a few minutes to over ten minutes. -
Run
uv run sp.pyto launch the software interface.
- By default, the software uses
ctranslate2version 4.x, which only supports CUDA 12.x. If your CUDA version is lower than 12 and you cannot upgrade, please execute the following commands to uninstallctranslate2and reinstall a compatible version:
uv remove ctranslate2
uv add ctranslate2==3.24.0Please visit https://pyvideotrans.com
This program relies primarily on the following open-source projects: