Releases
v0.5.0
v0.5.0 - Parakeet Transcription Engine
Latest
Compare
Sorry, something went wrong.
No results found
ykdojo
released this
29 Jan 23:28
What's New
Parakeet Transcription Engine
Added FluidAudio Parakeet as alternative to WhisperKit
Parakeet v2 : ~110x realtime, 1.69% WER, English
Parakeet v3 : ~210x realtime, 1.8% WER, 25 languages
Faster and more accurate than Whisper on benchmarks
Features
Voice-to-Text : Cmd+Opt+Z (offline) or Cmd+Opt+X (Gemini cloud)
Text-to-Speech : Cmd+Opt+S with Gemini Live streaming
Screen Recording : Cmd+Opt+C with visual context transcription
History : Cmd+Opt+A to view past transcriptions
Paste Last : Cmd+Opt+V to re-paste last transcription
Settings
Unified model selector showing all engines in one list
Download models directly from Settings
Engine preference persists across restarts
Requirements
macOS 14.0+
Gemini API key (for TTS and cloud transcription)
ffmpeg (for screen recording)
You can’t perform that action at this time.