this is a webui for the openai audio speech endpoint, as served by https://github.com/matatonic/openedai-speech Inspired by https://github.com/jhj0517/Whisper-WebUI (webui for selfhosted STT) I wanted to create the same for STT.
pull the latest image ´´´ dockcer pull ghcr.io/mreitschuster/speech-webui:latest ´´´
- git clone this repo
- adjsut the env variables in docker-compose.yaml
- docker compose up
git clone the repo run with ´´´ SERVER_URL="http://localhost:11435/v1/audio/speech" VOICES="alloy,echo,fable,onyx,nova,shimmer" MODELS="tts-1-hd,tts-1" python app.py ´´´