desk-talk

Transcription for your desktop.

A modern GUI application that records what you say when you press a button down, and types what you said when you release it.

Important

⚠️ This video contains sound and is intended to be listened to with audio on. ⚠️

transcribe.mp4

Features

✨ Push-to-Talk Transcription - Hold a key, speak, release to paste
🎯 System Tray Integration - Runs quietly in the background
⚙️ Modern GUI Settings - Beautiful, easy-to-use configuration interface
📊 WPM Statistics - Track your words per minute with rolling averages
🔒 Secure API Storage - API keys stored safely in Windows Credential Manager
🎤 Multiple Audio Devices - Choose any microphone on your system
🌐 OpenAI or Local - Use OpenAI's Whisper API or run models locally
💾 Persistent Settings - All preferences saved between sessions

Setup

Make sure ffmpeg is installed and added to your PATH

Quickstart

Download and run the latest release
Click the tray icon in your system tray to open settings
Configure your settings:
- Set your push-to-talk key (e.g., Scroll Lock)
- Choose your microphone
- Enter your OpenAI API key OR enable local transcription
Click "Start Transcription"
Hold your PTT key, speak, and release!

Note

You can manage your OpenAI API keys at https://platform.openai.com/api-keys

Using the GUI

System Tray

Left-click the tray icon to open the settings window
Right-click for quick Start/Stop/Quit options

Settings Tabs

General Tab:

Configure push-to-talk key
Select audio input device
Toggle capitalization and spacing options
Choose between paste mode (default) or typing mode

Transcription Tab:

OpenAI Mode: Enter your API key for cloud transcription
Local Mode: Download and run Whisper models on your computer
- Available models: tiny-en, tiny, base-en, base, small-en, small, medium-en, medium, large-v1, large-v2, large-v3
- Larger models = better accuracy but slower processing

Statistics Tab:

View your current words per minute
Track rolling average WPM (last 1000 samples)
Monitor total words transcribed
See total recording time

WPM Display

After each transcription, the console displays:

WPM: 132.4 (27 words over 12.25s) | Avg: 118.7

Current WPM - Speed of this transcription
Word count - Number of words spoken
Duration - Time from key press to release
Rolling average - Average of your last 1000 transcriptions

Building from Source

# Install Rust and dependencies
cargo build --release

# The executable will be in target/release/desk-talk.exe

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
.github/workflows		.github/workflows
assets		assets
gen/schemas		gen/schemas
icons		icons
keygen_test/src		keygen_test/src
src		src
ui/dist		ui/dist
.gitattributes		.gitattributes
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Cargo.toml.test		Cargo.toml.test
DEBUG_TEST_GUIDE.md		DEBUG_TEST_GUIDE.md
DEPLOYMENT_GUIDE.md		DEPLOYMENT_GUIDE.md
INSTALLER_LOCATION.txt		INSTALLER_LOCATION.txt
INSTALLER_README.md		INSTALLER_README.md
KEYGEN_FIX_INSTRUCTIONS.md		KEYGEN_FIX_INSTRUCTIONS.md
LICENSE		LICENSE
PHASE3_COMPLETE.md		PHASE3_COMPLETE.md
PRODUCTION_SECURITY_CHECKLIST.md		PRODUCTION_SECURITY_CHECKLIST.md
PRODUCTION_STATUS.md		PRODUCTION_STATUS.md
QUICK_START_GUIDE.md		QUICK_START_GUIDE.md
README.md		README.md
RELEASE_NOTES_v0.5.0.md		RELEASE_NOTES_v0.5.0.md
SECURITY_FIX_README.md		SECURITY_FIX_README.md
USER_GUIDE.md		USER_GUIDE.md
build.rs		build.rs
icon.ico		icon.ico
icon.png		icon.png
tauri.conf.json		tauri.conf.json
test_keygen.rs		test_keygen.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

desk-talk

Features

Setup

Quickstart

Using the GUI

System Tray

Settings Tabs

WPM Display

Building from Source

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

sloganking/desk-talk

Folders and files

Latest commit

History

Repository files navigation

desk-talk

Features

Setup

Quickstart

Using the GUI

System Tray

Settings Tabs

WPM Display

Building from Source

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages