The Mp3toText tool automatically processes Vietnamese audio files and converts them into text.
With a very simple interface, just a few clicks allow users to turn full conversations into text for practical use.
- 📝 Choose between Whisper (OpenAI) or PhoWhisper (VinAI) models, downloaded locally.
- ⚡ Automatically transcribe audio into text before feeding it into any system.
- 🔄 Speaker gender detection (male vs female) using clustering based on voice frequency (Hz).
- 🔄 Export results with a clear and intuitive display.
- 🖥️ User-friendly interface, no programming skills required.
- 🛠️ Compare models: users can evaluate whether OpenAI’s Whisper or VinAI’s PhoWhisper transcribes Vietnamese more accurately.
📄 View PhoWhisper Paper (VinAI)
- Prepare your audio file in formats like
.mp3or.mp4(example:fileAudio.mp3). - Open the application
Mp3toText.exe. - Select the audio file to process.
- Click Run to start automatic transcription.
- View results directly on the interface or from the output file.
- Clone the repository:
git clone https://github.com/Phuc75nguyen/MP3toText.git cd AutoData4FA - Set up a virtual environment:
python -m venv venv venv\Scripts\activate
- Build with PyInstaller in VS Code:
python -m PyInstaller --noconfirm --onefile --windowed --name MP3toText --icon=app.ico --add-data "app.ico;." app.py - Run the app:
python app.py





