VoiceMD - Voice Analysis Application

A modern, offline voice analysis tool that predicts speaker characteristics based on acoustic features.

Features

🎨 Modern Interface - Clean, intuitive GUI with modern design
🔄 Multi-Model Support - Switch between different trained models in real-time
💾 Fully Offline - No internet required after initial setup
🎤 Multiple Formats - Supports WAV, MP3, OGG, FLAC, M4A
📦 Easy Installation - Simple pip install, models auto-download

Installation

Requirements

Python 3.8 - 3.13 (Python 3.14+ not yet supported - dependencies numba/llvmlite don't support 3.14. Use Python 3.13 or earlier)
~500 MB disk space (including models and dependencies)
Internet connection for first-time model download only

Quick Install (Recommended - No Git Required!)

pip install voicemd-gui

Then run:

voicemd-gui

Models (~4.4 MB) will automatically download from GitHub Releases on first launch.

Alternative Install (From GitHub)

If you want the latest development version:

pip install git+https://github.com/Honey181/voicemd.git

Manual Install (From Source)

Download the repository as ZIP from GitHub
Extract it
Open terminal in the extracted folder
Run: pip install .
Run: voicemd-gui

Manual Installation

If you prefer to run from source:

# Clone the repository
git clone https://github.com/Honey181/voicemd.git
cd voicemd

# Install dependencies
pip install -r requirements_app.txt

# Run the app
python app_gui.py

Models will auto-download on first run, or manually run:

python download_models.py

Usage

Select a model from the dropdown (Small Dataset or CommonVoice)
Browse for an audio file (WAV, MP3, OGG, FLAC, M4A)
Click "Analyze Voice"
View results instantly

Uninstallation

Remove the Application

pip uninstall voicemd-gui -y

Remove Downloaded Models (Optional)

The model files (~4.4 MB) are stored separately and will remain after uninstalling. To completely remove them:

Windows:

Remove-Item -Recurse -Force "$env:USERPROFILE\.voicemd"

macOS/Linux:

rm -rf ~/.voicemd

Location of models:

When installed via pip: ~/.voicemd/models/ (or %USERPROFILE%\.voicemd\models\ on Windows)
When running from source: Project root directory

Multi-Model Support

VoiceMD includes two trained models that can be switched at runtime:

Small Dataset Model - Faster, good for general use
CommonVoice Model - More robust, handles diverse accents

Switch between models using the dropdown in the app. No restart required!

Troubleshooting

Models Won't Download

Download manually:

Go to https://github.com/Honey181/voicemd/releases
Download both .pt model files
Place them in the project root directory

Or use the download script:

python download_models.py

Import Errors

Update dependencies:

pip install --upgrade -r requirements_app.txt

Audio Loading Issues

The app uses soundfile/librosa (no FFmpeg required). If you still get errors:

Windows: choco install ffmpeg
macOS: brew install ffmpeg
Linux: sudo apt install ffmpeg

LLVM Version Compatibility (Linux)

If you get an error like llvmlite only officially supports LLVM 20 during installation:

Option 1 - Use Conda (Recommended):

conda create -n voicemd python=3.10
conda activate voicemd
conda install -c conda-forge llvmlite numba
pip install git+https://github.com/Honey181/voicemd.git

Option 2 - Install specific LLVM version:

# Ubuntu/Debian
sudo apt install llvm-14 llvm-14-dev
export LLVM_CONFIG=/usr/bin/llvm-config-14
pip install git+https://github.com/Honey181/voicemd.git

Option 3 - Use older Python:

# Python 3.9 or 3.10 have better llvmlite compatibility
python3.10 -m pip install git+https://github.com/Honey181/voicemd.git

Technical Details

Framework: PyTorch for model inference
GUI: Tkinter for cross-platform interface
Audio Processing: librosa, soundfile, torchaudio
No FFmpeg Required: Uses soundfile/librosa for audio loading

License

See LICENSE file for full details.

Credits

Original Project: VoiceMD by @jerpint (Jeremy Pinto)

Enhanced by: @Honey181 - Modern UI and easy installation

This project builds upon the excellent work of the original VoiceMD team. All credit for the model architecture, training pipeline, and core functionality goes to them.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
config/hooks		config/hooks
examples		examples
tests		tests
voicemd		voicemd
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
app_config.yaml		app_config.yaml
app_gui.py		app_gui.py
app_predictor.py		app_predictor.py
download_models.py		download_models.py
models_config.yaml		models_config.yaml
pyproject.toml		pyproject.toml
requirements_app.txt		requirements_app.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VoiceMD - Voice Analysis Application

Features

Installation

Requirements

Quick Install (Recommended - No Git Required!)

Alternative Install (From GitHub)

Manual Install (From Source)

Manual Installation

Usage

Uninstallation

Remove the Application

Remove Downloaded Models (Optional)

Multi-Model Support

Troubleshooting

Models Won't Download

Import Errors

Audio Loading Issues

LLVM Version Compatibility (Linux)

Technical Details

License

Credits

Contributing

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VoiceMD - Voice Analysis Application

Features

Installation

Requirements

Quick Install (Recommended - No Git Required!)

Alternative Install (From GitHub)

Manual Install (From Source)

Manual Installation

Usage

Uninstallation

Remove the Application

Remove Downloaded Models (Optional)

Multi-Model Support

Troubleshooting

Models Won't Download

Import Errors

Audio Loading Issues

LLVM Version Compatibility (Linux)

Technical Details

License

Credits

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages