A web application for transcribing and summarizing audio files using OpenAI's Whisper model.
- Upload audio files (MP3, WAV, M4A, FLAC)
- Transcribe audio to text using Whisper
- Generate customized summaries based on user prompts
- Modern, responsive user interface
- Drag-and-drop file upload
-
Clone the repository:
git clone https://github.com/yourusername/whisper-summary.git cd whisper-summary -
Create a virtual environment and activate it:
python -m venv .venv source .venv/bin/activate # On Windows: .venv\Scripts\activate -
Install the required dependencies:
pip install -r requirements.txt -
Run the application:
python main.py -
Open your browser and navigate to
http://localhost:8000
GET /: Web interface for audio transcriptionGET /api: API informationPOST /transcribe/: Endpoint for audio transcription and summarization
See the LICENSE file for details.