🚀 Python Desktop Tool – Mp3toText

The Mp3toText tool automatically processes Vietnamese audio files and converts them into text.
With a very simple interface, just a few clicks allow users to turn full conversations into text for practical use.

✨ Key Features

📝 Choose between Whisper (OpenAI) or PhoWhisper (VinAI) models, downloaded locally.
⚡ Automatically transcribe audio into text before feeding it into any system.
🔄 Speaker gender detection (male vs female) using clustering based on voice frequency (Hz).
🔄 Export results with a clear and intuitive display.
🖥️ User-friendly interface, no programming skills required.
🛠️ Compare models: users can evaluate whether OpenAI’s Whisper or VinAI’s PhoWhisper transcribes Vietnamese more accurately.

📖 Approach

Whisper Model – OpenAI

PhoWhisper Model – VinAI

📄 View PhoWhisper Paper (VinAI)

📸 Application Interface

Main screen on launch

Processing data with models

Final transcription results

🛠️ How to Use

Prepare your audio file in formats like .mp3 or .mp4 (example: fileAudio.mp3).
Open the application Mp3toText.exe.
Select the audio file to process.
Click Run to start automatic transcription.
View results directly on the interface or from the output file.

📦 Installation

Clone the repository:

git clone https://github.com/Phuc75nguyen/MP3toText.git
cd AutoData4FA

Set up a virtual environment:

python -m venv venv
venv\Scripts\activate

Build with PyInstaller in VS Code:

python -m PyInstaller --noconfirm --onefile --windowed --name MP3toText --icon=app.ico --add-data "app.ico;." app.py

Run the app:
```
python app.py
```

❤️ Made with Tons of Love (Tan Phuc)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
PhoWhisper.pdf		PhoWhisper.pdf
README.md		README.md
ab_phowhisper_simple.py		ab_phowhisper_simple.py
ab_whisper.py		ab_whisper.py
app.ico		app.ico
app.py		app.py
app_open.png		app_open.png
approach_whisper_OpenAI.png		approach_whisper_OpenAI.png
iconApp.py		iconApp.py
logo.png		logo.png
phowhisper_processing.png		phowhisper_processing.png
phowhisper_result_processing.png		phowhisper_result_processing.png
whisper_processing.png		whisper_processing.png
whisper_result_processing.png		whisper_result_processing.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Python Desktop Tool – Mp3toText

✨ Key Features

📖 Approach

Whisper Model – OpenAI

PhoWhisper Model – VinAI

📸 Application Interface

Main screen on launch

Processing data with models

Final transcription results

🛠️ How to Use

📦 Installation

❤️ Made with Tons of Love (Tan Phuc)

About

Uh oh!

Releases

Packages

Languages

Phuc75nguyen/Speech2Text4Vietnamese

Folders and files

Latest commit

History

Repository files navigation

🚀 Python Desktop Tool – Mp3toText

✨ Key Features

📖 Approach

Whisper Model – OpenAI

PhoWhisper Model – VinAI

📸 Application Interface

Main screen on launch

Processing data with models

Final transcription results

🛠️ How to Use

📦 Installation

❤️ Made with Tons of Love (Tan Phuc)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages